Algorithm Synthesis Tool

Download Report

Transcript Algorithm Synthesis Tool

SLAAC Team Meeting 3-99

Ultra Wideband Coherent RF

Mark Dunham

Multi Dimensional Imaging

Jeff Bloch

LANL Challenge Problems

Kevin McCabe [email protected]

505-667-0728 LANL/SLAAC DARPA Collaboration

MultiDimensional Image Processing

Real Time Processing of Multi/Hyper Spectral or Time domain Data Cubes

CALIOPE

Kurt Moore HIRIS/MTI John Szymanski James Theiler SHS RULLI

Hyperspectral Demonstrations

DAPS

ReConfigurable Computing Hardware: IP51/RCA-2/DARPA/(RCA-3)/Commercial Mike Caffrey, Phil Blain, Noor Khalsa, Tony Rose, Tony Nelson RCC Architecture Development/Deployment Hardware/Software Environment Mike Caffrey, Tony Salazaar, John Layne, Jan Friego Classification/Compression/Recognition Algorithms for fixed point RCC Hardware John Szymanski, James Theiler, Jeff Bloch, Kurt Moore, Chris Brislawn Steven Brumby Reid Porter,Simon Perkins

DII:

“Rapid Feature Identification Using RCC Technology and Genetic Algorithms” Jeff Bloch, John Szymanski University Collaborations: BYU, UT,...

Kurt Moore DARPA Collaboration: RCC HW/SW Tool Evaluation Kevin McCabe

Ultra-Wide Band Radio Frequency

Real Time Processing wide band RF data

ALDEBARAN

Mark Dunham Bellatrix Scott Robinson Capella R. Dingler Cibola Mike Caffrey FORTE’ V-SENSOR

LANL/SLAAC DARPA Collaboration

Collaboration Phase 1 • Represent Challenge Problems – Ultra-Wide Band RF (UWBRF) Signal Processing – Multi-Dimensional Image Processing (MDIP) • Provide specific challenge problem descriptions to ACS investigators – MDIP: http://nis-www.lanl.gov/nis-projects/daps/ – UWBRF: http://www.lanl.gov/rcc/ • Seek out and collaborate with ACS investigators whose work matches our need – ISI - DRP and RRP – Northwestern - Matlab compiler – Ptolemy - System level analysis tool • Validate Hardware or Software Strategies

LANL/SLAAC DARPA Collaboration

Collaboration Phase 2 Technology Insertion • MDIP Rapid Feature Identification Project (RFIP) – Multi-dimensional image processing via algorithms derived in real time for rapid searching of archival information and high bandwidth sensor data streams • Bellatrix UWBRF Signal Compression – Wideband Signal Compressor for ID-1 Compatible Tape Recorders – Airborne environment • Capella UWBRF Accelerated Analysis Tool – Acceleration of government algorithms to make real time analysis feasible Additional Potential Challenge area • Plume Detection – Airborne LIDAR based sensor to detect and analyze plumes

LANL/SLAAC DARPA Collaboration

RFIP Jeff Bloch, 505-665-2568, [email protected]

John Szymanski, [email protected], 505-665-9371 James Theiler, [email protected], 505-665-5682

LANL/SLAAC DARPA Collaboration

RFIP • Objective – Manipulate image processing steps carried out on RCC hardware to develop remote sensing algorithms for classifying and identifying features of interest to an image analyst.

– Provide software suite and hardware to work in host workstation(s) – Search speeds comparable to archive retrieval times – Platform and RCC hardware independence – Scalable within a platform and via networked platforms • Approach – Rapid evolution of feature recognition procedure via: • Hardware accelerator containing tunable image processing operators • Software engine that parallelizes a search and manipulates accelerated operators to maximize performance against truth data

LANL/SLAAC DARPA Collaboration

RFIP • Current Plan – Develop non-real time demonstration (proof of concept) of an algorithm to evolve image classification procedures for identifying features of interest (fully funded by RFIP) • Select image operators amenable to RCC acceleration • Select algorithm framework and software • Select simulation environment (IDL, Perl, etc.) • Select test datasets • Develop, write, and execute an all software demonstration – Demonstrate ability of RCC to dramatically speed up image processing steps (RFIP - SLAAC partnership) • Select demonstration RCC hardware (SLAAC-1 insertion opportunity) • Develop tunable operator architecture in VHDL • Select some image operators and code them in VHDL • Develop software engine to drive a single RCC • Benchmark accelerated operators against software operators

LANL/SLAAC DARPA Collaboration

RFIP • Future Plan (2 DII proposals being submitted this month) – Fully develop RCC accelerated workstation • Add to selection image operators amenable to RCC acceleration and code them in VHDL • Refine algorithm framework and software • Define and procure RCC computer suitable for analysts workstation • Broaden test datasets • Benchmark against all software solution – Demonstrate parallelizability and scalability of approach on multiple workstations • Implement prior all software solution across multiple workstations • Develop a parallel execution scheme – Accelerated algorithm evolution against one truth data set – Accelerated processing of search data • Develop advanced user interface • Benchmark against single workstation all software solution

LANL/SLAAC DARPA Collaboration

RFIP • RFIP Project Status – Non-real time demonstration functional • Ability to demonstrate a limited number of operators • Further refinement of tunable operators approach ongoing • Rapid evolution of a Water Finding Procedure demonstrated • Benchmark of all software solution to be done – Current RFIP funding ends 12/99 • RFIP SLAAC collaboration effort – Demonstration of RCC • Candidate image operators selected • Tunable operator architecture concept under development • Targeting of RCC hardware to be done • Develop software engine to drive a single RCC to be done • Benchmark against software operators to be done – Limited demonstration to prove principle by 12/99

LANL/SLAAC DARPA Collaboration

The Tunable Operator Architecture Control Line Inputs Multi-Spectral Image Channel Inputs l Spectra Spatial Spatial l Spectra l Spectra l Spectra l Spectra l Spectra Feature 1 Threshold Spatial Feature 2 Threshold l Spectra Feature 3 Threshold Fitness Function Fitness Output Ground Truth Training Data

LANL/SLAAC DARPA Collaboration

Control Line Inputs The Tunable Spatial Operator Selected Input Channel P1 P1 P1 P1 P1 P1 FIFO of Image Width FIFO of Image Width P1 P1 P1 Correlation Multipliers and Accumulators Programmable Mask m1 m2 m3 m4 m5 m6 m7 m8 m9

LANL/SLAAC DARPA Collaboration

The Tunable Spectral Operator Selected Input Channels Control Line Inputs Adder / Subtractor Adder / Subtractor Constant K Arithmetic Array *, /, sqr, sqrt

LANL/SLAAC DARPA Collaboration

RFIP - SLAAC collaboration SLAAC technology • Hardware – SLAAC-1 RRP with Linux driver • LANL has VXI based RCC in use just in last few months BUT!

– VXI form factor not suitable for RFIP workstations – Have only a primitive board support package • SLAAC-1 advantages – PCI form factor with Linux driver matches RFIP workstations – Runtime library is key to research into scalability proposed by RFIP team – Still under discussion but: Xilinx architecture in SLAAC-1 may have advantages over Altera architecture for tunable operator concept under development

LANL/SLAAC DARPA Collaboration

RFIP - SLAAC collaboration SLAAC technology • Software – Runtime library is key to research into scalability proposed by RFIP team • Long-term vision Clusters of work-stations employing accelerated hardware to allow: 1) Rapid development of new tools, and constant refinement of existing tools, for analysts mining large data bases for timely information.

2) Greater acceleration by distributing inherently parallelizable processing

LANL/SLAAC DARPA Collaboration

RFIP - SLAAC collaboration Goals • Long Term Goals (beyond 12/99) – Determine best architecture for MDIP class of problems • Single RRP in a workstation • Operators accelerated at least 10x – Demonstrate scalability • Multiple RRPs in a single workstation • Multiple workstations with 1 RRP each • 9 month Insertion Plan – Map tunable operator architecture into SLAAC-1 – Target VHDL operators to Xilinx 40150 – Develop interface to software engine on host

LANL/SLAAC DARPA Collaboration

Bellatrix

Scott Robinson, 505-665-1954, [email protected]

John Layne, 505-667-5137, [email protected]

Mark Dunham, 505-667-0045, [email protected]

LANL/SLAAC DARPA Collaboration

Bellatrix • Objective – To demonstrate the ability to continuously record wideband data for the COMBAT SENT program.

– Apply lossy compression while still preserving the signal characteristics required by the analyst. – Demonstrate a novel algorithm for lossy compression of wideband signals so that 40 MHz @ 12 bits can be recorded at 50 MB/s with upgrade path to 70MHz.

– Devise hardware solution for higher resolution and up to 200 MHz bandwidth under light signal conditions. – Provide a scalable platform that can be used for R&D on new WB processing tasks after delivery of the compression system in anticipation of NextGen architecture.

LANL/SLAAC DARPA Collaboration

Bellatrix • Approach – Develop three signal processing algorithms for RCC acceleration • Sub-Band Coding Compression • Homomorphic Compression • Burst Digitization Compression – Initially target a 100Mss, 12 bit channel recorded onto an ID-1 tape.

– Apply lossy compression techniques to convert 150 Mbytes/second of incoming data to 50 Mbytes/second outgoing to tape.

LANL/SLAAC DARPA Collaboration

BELLATRIX 1.0 WB Compression

Sub-Band Coding (V. 2)

VXI Chassis ApCom 1610 IF/IF Converter 160 MHz IF LANL RCA-2 FPGA Computer I/ O 1 I/ O 2 I/ O 3 40 MHz analog BW 100Mss, 12 bit Digitizer Mezzanine (under development ) FPDP Celerity A256 ID-1 Tape Interface VXI PPC Con troller ID-1 Sony DIR-1000H Tape Ethernet 10baseT Control

LANL/SLAAC DARPA Collaboration

BELLATRIX 1.0 WB Compression

Sub-Band Coding with SLAAC-1

VXI Chassis ApCom 1610 IF/IF Converter 160 MHz IF 40 MHz analog BW Celerity FPDP A256 ID-1 Tape Interface ID-1 VXI PPC Con troller Ethernet 10baseT Control 100Mss, 12 bit Digitizer Mezzanine (under development) I/O 1 I/O 2 SLAAC-1 FPGA Computer Sony DIR-1000H Tape Pentium PCI Slot Computer

LANL/SLAAC DARPA Collaboration

BELLATRIX 1.0 WB Compression

Homomorphic or Burst Digitization (V. 2)

VXI Chassis ApCom 1610 IF/IF Converter 160 MHz IF LANL RCA-2 FPGA Computer I/ O 1 I/ O 2 I/ O 3 40MHz analog BW QC-64 LANL RCA-2 FPGA Computer I/ O 1 I/ O 2 I/ O 3 FPDP Celerity A256 ID-1 Tape Interface VXI PPC Con troller 100Mss, 12 bit Digitizer Mezzanine (under development ) CRI Peg-80 FFT CPU QC-64 CRI Peg-80 FFT CPU Pentium PC WinNT OS ID-1 Ethernet 10baseT Control Industrial PCI Chassis & Backplane Sony DIR-1000H Tape

LANL/SLAAC DARPA Collaboration

Bellatrix • Plan and Status – Software models of lossy compression techniques have been developed.

• Accomplishments: Demonstration of experimental algorithms on Blackbeard and FORTE signals; analysis of rate-distortion characteristics and effects of data quantization on exploitability – Validate lossy compression models against actual data in process.

– Develop a 12-bit, 100 Msps A/D converter input card for the RCA-2 using the new Analog Devices AD9432.

– Implement Sub-Band Coding compression technique on RCC for initial flight demonstration Sept. 1999.

– Follow-on demos dependent on success of initial demo – Eventually add demodulation, cross-correlation delay estimation, parameterization, set-on, and SNOI removal.

LANL/SLAAC DARPA Collaboration

Bellatrix - SLAAC collaboration Goals – Evaluate suitability of RRP architecture for UWB problem • High rate systolic streaming data – Collaborate with developers of Nextgen architecture

LANL/SLAAC DARPA Collaboration

Wideband Compression via Burst Digitization Input Signal S j S i S k S l i j k l

Spectrum Memory

W X

FFT(N+K) Adaptive Thresholds

t

S

Compressed Output

Activity Rules

SAVE?

Freqs

LANL/SLAAC DARPA Collaboration

2 i+j Homomorphic Compression Algorithm Positive Frequencies only: (Analytic Signal Format)

S

T 0 0 0

D

m Baseline Threshold N/2 Discrete Components

f

nyq

LANL/SLAAC DARPA Collaboration

• •

Joint Time Frequency Compression of Wideband RF Signals

Developers: Chris Brislawn (CIC-3), Shane Crockett (student, USNA). Example: spectrogram of FORTE data (L); after 4:1 compression (R).

LANL/SLAAC DARPA Collaboration

Lossy Compression of Wideband RF

Time Based Compression (Burst Digitization)

Well suited for pulse-like signals with low duty factor Performance depends on detection of pulse presence Weak SNR cases need sophisticated triggering

Frequency Domain Compression (Homomorphic + Thresholding)

Well suited to long duration signals and complex signal mixtures Simple versions of algorithm can provide 5X compression & high fidelity Higher compression ratios tend to round fast rise/fall times on pulses

Compression Through Sub-Band Coding

Joint localization of signal in time and frequency Adaptive bit allocation and scalar quantizer design Compression generally removes signal “noise”

LANL/SLAAC DARPA Collaboration

Capella-2

Scott Robinson, 505-665-1954, [email protected]

Robert Dingler, 505-665-3483, [email protected]

Steve White, 505-667-4623, [email protected]

Tony Salazar, 505-667-2508, [email protected]

Mark Dunham, 505-667-0045, [email protected]

LANL/SLAAC DARPA Collaboration

Capella-2 Objective • Provide 1000 lines/sec minimum, 10,000 lines/sec goal, of Government Spectrum & A Raster displays for quick look data searches. • Implement selected routines for 40 MHz real time analysis.

• Allow key concept demonstrations of a Modular Coherent UWB Processor, including SNOI removal, set-on, demod, and cross correlation.

• Demonstrate that pre-D processing can yield superior Pd, de-interleave, and metrics in real time, with respect to PDW methods.

LANL/SLAAC DARPA Collaboration

Dataflow Block Diagram

FFT Time Frequency Filters IFFT ALPHA 4100 RCC Pulse Parameterizer RCC Synchronous Video Integration

LANL/SLAAC DARPA Collaboration

Capella-2 Software Environment

60 MB/S RAID

PCI Bus 1

Ethernet FPDP Out Dec Alpha 4100 4-Processor Host

PCI Bus 2

FPDP In Calculex Tape/GigaFlash System

Control and Status Control - Text commands sent to socket via TCP/IP National Inst.

Pentium PC Running NT 4.0

MXI Control SW

CRI FFT Board

RCC Boards

Daughter Cards

VXI Crate “Black Box”

“Black Box” Software Functions:

Initialize, Load Flex File, Set Registers, Start Processing, Stop Processing, Report Status

C++ MFC Routines DLL HW Library Routines MXI Control SW

SW Model

LANL/SLAAC DARPA Collaboration

Basic Workstation Accelerator System

SVGA Alta Ether Ether PCI A Waterfall Video FPDP To External Network 10bT

SYS RAM

DEC Alpha 4100

70 MB/s RAID UWB Digitizer 50 MB/s Tape

PCI B U-SCSI U-SCSI Alta ID-1 FPDP

VXI Crate

LANL/SLAAC DARPA Collaboration

Review

LANL/SLAAC DARPA Collaboration

• Highest Priority MDIP Algorithm Needs: – Spectral and Spatial Classification in real time – Spectral matched filter algorithms – “K-Means” style classification algorithm – Plume detection – Rare signal or signature detection

LANL/SLAAC DARPA Collaboration

• Highest Priority UWB Algorithm Needs: – Find a coding algorithm/process to compress information bandwidth through an FFT – Decompose a non-linear RF chirp into an efficient wavelet or multi-resolution expansion – Apply image processing/recognition algorithms to streaming time-frequency images to find objects of interest.

– Identify fast methods of classification and correlation suitable for FPGA implementation

LANL/SLAAC DARPA Collaboration

Myrinet-2560/SAN Compatible I/O Interface Development Status

March 3, 1999 Douglas E. Patrick NIS-4 Space Instrumentation and Systems Engineering Mail Stop D448 Los Alamos National Laboratory (505)-665-1203 [email protected]

LANL/SLAAC DARPA Collaboration

A Few Current Myrinet/SAN Interface Design Efforts by others

• Lockheed/Sanders: LANAI processor based Common Node Adapter (CNA) • Lockheed/Martin Astronautics: FPGA driven I/O design using FI32 SAN/FIFO interface Version 1.3 (currently not using any Packet or Header info) • Air Force Research Laboratory: FPGA driven I/O similar to LMCO but uses AFRL Packets

LANL/SLAAC DARPA Collaboration

Myrinet Interface Design Goals

• Leverage off of LMCO and AFRL Design • Maintain as much Myrinet-2560/SAN Compatibility as possible (within reason) • Maintain protocol and packet compatibility with those that we will be interfacing with (AFRL, LMCO, etc..) • At a minimum, be able to easily reconfigure (via FPGA reprogramming) for mission specific protocol(s)/packet(s).

LANL/SLAAC DARPA Collaboration

LANL/SLAAC DARPA Collaboration

SLAAC-3

LANL/SLAAC DARPA Collaboration

Desirable architectural elements

Overall Architecture

• Distinct from current COTS RCCs • Careful trade study of PE-PE connectivity vs. PE-Memory – Bus widths – Data Broadcast between one input and all PEs – Independent addressibility • Anticipate direction of reconfigurability features

LANL/SLAAC DARPA Collaboration

Desirable architectural elements

Input/Output

• High Speed IO of flexible type – Mezzanine card with standard interface to RCC – Directly connected to PEs – Capable of wide 64 bit data • Ability to split and combine data streams for parallelizability and scalability – 3 IO Ports, 2 in and 1 out or vice versa – 1 IO connected to all PEs • IO dataflow decoupled from PE by FIFOs

LANL/SLAAC DARPA Collaboration

Desirable architectural elements

Memory

• Multiple parallel memory banks local to each PE – Independently addressable – Parallel access – 18 bit data • Ability to split and merge data streams for parallelizability and scalability – 3 IO Ports, 2 in and 1 out or vice versa – 1 IO connected to all PEs • IO dataflow decoupled from PE by FIFOs

LANL/SLAAC DARPA Collaboration

Desirable architectural elements

Memory

• Shared memory banks between adjacent PEs – Connected via crossbar switches • Ability to split and merge data streams for parallelizability and scalability – 3 IO Ports, 2 in and 1 out or vice versa – 1 IO connected to all PEs • IO dataflow decoupled from PE by FIFOs

Datapaths

• Broadcast bus between one input all PEs

LANL/SLAAC DARPA Collaboration

Desirable architectural elements

• 6U VME64 Board (+3.3V included in this standard) • Mezzanine cards for flexibility • Simple Fast interface from PE to back-plane • Simple VME Interface Controller • Configuration Manager and local configuration memory • +2.5V or +1.8V Need these for future FPGAs • Independent clock with skew control

LANL/SLAAC DARPA Collaboration