An Engineering Research Center for Integrated Sensing and

Download Report

Transcript An Engineering Research Center for Integrated Sensing and

Center for
Subsurface Sensing & Imaging Systems
Overview of Image
and Data Information
Management in
CenSSIS
David Kaeli
Northeastern University
Boston, MA
[email protected]
Overview of the Strategic Research Plan
Bio-Med
L3
Enviro-Civil
S2 S3
S1
S4
S5
Validating
L2 TestBEDs
R2
L1 Fundamental
Science
R3
R1
Image and Data
Information
Management
R3 Research Thrust Overview
 Utilize enabling hardware and software
technologies to address CenSSIS barriers
 Pursue research in enabling technologies
 Develop a common set of tools and techniques to
address SSI problems:
 Hardware parallelization and acceleration
 Software toolboxes
 Image database management and tools
Toolboxes
CenSSIS Middleware Tools
 Parallelization of MATLAB, C/C++ and Fortran codes
using Message Passing Interface (MPI) – a software
pathway to exploiting GRID-level resources
 Utilizing MPI-2 to address barriers in I/O performance
 Building on existing Grid Middleware such as Globus
Toolkit, MPICH-G2 and GridPort
 Presently illustrating the impact of the GRID on system
level projects (tomosynthesis reconstruction)
MPI
MATLAB
C/C++
Fortran
Parallelization
MPICH-G2
UPC
Air
Impact on CenSSIS Applications
Mine
Soil
 Reduced the runtime of a single-body Steepest Descent Fast
Multipole Method (SDFMM) application by 74% on a 32-node
Beowulf cluster
Scattered Light Simulation
• Hot-path parallelization
• Data restructuring
• Obtained superlinear speedup of Ellipsoid
Algorithm run on a 16-node IBM SP2
• Matlab-to-C compliation
• Hot-path parallelization
Runtime in seconds
• Matlab-to-C compliation
• Hot-path parallelization
100000
Original
10000
1000
Matlab-to-C
100
Hot path
parallelization
10
1
Ellipsoid Algorithm Speedup
(versus serial C version)
Speedup
 Reduced the runtime of a Monte Carlo
scattered light simulation by 98% on
a 16-node Silicon Graphics Origin 2000
Speedup
20
15
10
5
0
1
2
4
8
16
Number of Nodes
64-vector
1024-vector
256-vector
linear speedup
Tomographic mammography
 3D image reconstruction from x-ray projections
 Used to detect and diagnose breast cancer
 Based on well developed mammography techniques
 Exposes tissue structure using multiple projections from
different angles
 Advantages
 Accuracy: provides at least as much useful information
than x-ray film
 Flexibility: digital image manipulation, digital storage
 Structural information: using layered images
 Safe: low-dose x-ray
 Lower cost: compared to MRI
Image acquisition/reconstruction process
 Acquisition: 11 uniform angular
samples along Y-axis
 X-ray projection: breast tissue density
absorption radiograph
 Algorithm: constrained non-linear
convergence and iterative process
 Uses a Maximum Likelihood Estimation
Y
X-ray source
Initialization
Set 3D volume
Forward
Compute projections
3D volume
Backward
Correct 3D volume
Exit
Yes
Y
No
Satisfied ?
Z
X-ray
projections
X
detector
Parallelization approaches
 Reduce communication data
 Segmentation along Y-axis
 Using redundant computation to replace communication
 Segmenting along x-ray beam
First approach:
Non inter-communication
(more computation, less
communication)
Overlap area
Second approach:
Overlap with
inter-communication
exchange data
Third approach:
Non-overlap with
inter-communication
(less computation, more
communication)
Tomosynthesis Acceleration
•Input data set: phantom
1600x2034x45
Phantom data test results using nonoverlap method on 32 CPUs
• Serial implementations runs in 23 hours on a P4 machine
350
• Platforms:
– SGI Altix system
– UIUC NCSA Titan cluster
250
– P4 cluster at MGH
• Number of processors: 32
Computation: SGI Altix with
Itanium 2 processor outperforms
the other CPUs
 Currently moving this work to
the GRID and the Pittsburgh
Supercomputer Center
 Prototype running on our GRID
system at NU
Time (sec)
– UIUC NCSA IBM p690
– UMich Hypnos cluster
File IO
Collect
Inter-comm
Sync
Backward
Forward
300
200
150
100
50
0
P4 cluster
Hypnos
cluster
Titan
cluster
Platform
IBM p690
SGI Altix
Field Programmable Gate Arrays for
Subsurface Imaging
 Backprojection for Computed Tomography image
reconstruction
 Sponsored by Mercury Computer
 Finite Difference Time Domain (FDTD) in hardware
 Collaboration with Humanitarian Demining project
 Retinal Vascular Tracing in real time
 Collaboration with Real-time Retinal Imaging project
 Phase Unwrapping
 Collaboration with 3-D Fusion Microscope project
 Diverse problems, similar solutions:
FPGAs are particularly well suited for accelerating image processing
and image understanding algorithms
Retinal Vascular Tracing: Register 2-D Image to
3-D in Real Time
Objective
To accelerate an existing retinal
vascular tracing (RVT) algorithm by
implementing computation of template
responses in reconfigurable hardware
PCI
PCI BUS
BUS
RESULTS
MEMORY1
DESIGN
BLOCK RAM
Direction
of of
Direction
blood
vessel
blood
vessel
IMAGE
MEMORY0
FPGA
HOST
“Smart Camera”
FIREBIRD BOARD
Some Recent Publications on
Parallelization
• “Execution-Driven Simulation of Network Storage Systems,” Y. Wang and D.
Kaeli, Proceedings of the 12th ACM/IEEE International Symposium on
Modeling, Analysis of Computer and Telecommunication Systems, October
2004, pp. 604-611.
• “Profile-guided File Paritioning on Beowulf Clusters,” Y. Wang and D. Kaeli,
Journal o f Cluster Computing, Special Issue on Parallel I/O, to appear,
• “An Object-oriented Parallel Library,” C. Oaurrauri and D. Kaeli, International
Journal of High Performance of Computing and Networking, to appear.
• “Digital Tomosynthesis Mammography using a Parallelized Maximum
Likelihood Reconstruction Method,” T. Wu, R. Moore, E. Rafferty, D. Koppans,
J. Zhang, W. Meleis and D. Kaeli, Medical Imaging, 5368, February 2004.
• “Mapping and characterization of applications in Heterogeneous Distributed
Systems,” J. Yeckle and W. Rivera , To appear in Proceed. of the 7th World
Multiconference on Systemics, Cybernetics and Informatics (SCI2003).
• “Profile-Guided I/O Partitioning,” Y. Wang and D. Kaeli, Proceedings of the 17th
ACM International Symposium on Supercomputing, June 2003, pp. 252-260.
• “Source-Level Transformations to Apply I/O Data Partitioning,” Y. Wang and D.
Kaeli, Proceedings of the IEEE Workshop on Storage Network Architecture
and Parallel IO, Oct. 2003, pp. 12-21.
Held
again
in
2004
CenSSIS Solutionware – UPRM/NU/RPI
Toolbox Development
Toolboxes
 Support the development of CenSSIS Solutionware that
demonstrates our “Diverse Problems – Similar Solutions”
model
 Develop Toolboxes that support research and education
 Establish software development and testing standards for
CenSSIS
Image and Sensor Data Database
 Develop an web-accessible image database for CenSSIS
that enables efficient searching and querying of images,
metadata and image content
 Develop image feature tagging capabilities
Status of the CenSSIS Toolboxes
 Hyperspectral Image Analysis Toolbox
(HIAT)
HIAT
 October 2004
 Multiview Tomography Toolbox (MVT)
 fddlib:
 January 2003 (v. 1.0)
 July 2003 (v. 1.1)
MVT
 mvt:
 October 2004
 Rensselaer Generalized Registration
Library (RGRL)
RGRL
 September 2004
New toolbox: Improving the quality of
radiation oncology @ MGH
 Developed a 4D (3D + including time)
visualization browser tool kit
 Visualize Computed Tomography (CT)
images, organ outlines (wire contours) and
the isodose lines (treatment dosage)
 Present all this information in a user friendly
interface
4-D Visualization of Lung Tumors
Dosage
4-D Visualization
The Future for CenSSIS Toolboxes
SCIRun
Collaboration with the University of Utah
CenSSIS Image Database System
 Deliver an web-accessible database for
CenSSIS that enables efficient searching
and querying of images, sensor data,
metadata and image content
More that 4000 metadata-rich images/datasets
presently available online (> 10,000 by 2006)
 Database Characteristics:
• Relational complex queries (Oracle9i)
• Data security, reliability and layered user privileges
mouse
embryo
• Efficient search and query of image content and metadata
• Content-based image tagging using XML
• Indexing algorithms (2D, 3D, and 4D)
• Explore object relational technology to handle collections
3
4
2
1
CenSSIS Image Database System
CenSSIS Image Database System
CenSSIS Image Database System
Utilize Machine
Learning
algorithms to
improve query
view
CenSSIS Image Database System
Provides
data
description
associated
with initial
collection,
but does not
allow for
further
elaboration
or
annotation.
Image Annotation
 Provide the ability to markup image with searchable
features
 Enable image database to be more effectively datamined
<xml version=“1.0” encoding=“UTF-8”>
<embryo>
<description> Embryo developmental
stages</description>
<feature label=“1” xPos1=“29” yPos1=“33”
xPos2=“48” yPos2=“50”> 1 cell embryo </feature>
<feature label=“2” xPos1=“50” yPos1=“28”
xPos8=“70” yPos2=“40”> 2 cell embryo
</feature>
<feature label=“3” xPos1= “5” yPos1= “5”
xPos2=“25 yPos2=“20”> 4 cell embryo </feature>
</embryo>
XML and Java
• XML (Extensible Markup Language)
• Provides maximum flexibility and portability
• Well-supported standard
• Powerful querying tools available in Oracle
• The Java2 Platform
• Cross-platform compatibility
• Standard web-browser interface
• Native XML support
Image Tagging
A raw image file from the CenSSIS Database
• QUERY: I want to be able to add to this image textual
annotations, providing my medical team with questions about
particular ROIs:
• Difficult to describe regions in an image
• Difficult to pinpoint specific features in images
• Global image metadata too coarse to facilitate low-level tagging
Image Tagging
Image with tags
• Metadata associated with specific areas
• Query for specific image features
The Image Tagging Interface
Drawing
Tools
Tag
Options
View Options
Tags and XML
<feature type="Ellipse" label="4 Cell Stage">
<ellipse>
<xCenter> 101 </xCenter>
<yCenter> 58 </yCenter>
<xRadius> 79 </xRadius>
<yRadius> 46 </yRadius>
</ellipse>
<note> [custom XML tags go here] </note>
<annotator> awilliam </annotator>
</feature>
The Future Role of Image Annotation
 Provide a vehicle for natural collaboration
• A richer set of metadata to enable more detailed
queries
• Potential to perform extensive data mining on image
content
• An eye toward content-based image retrieval
 Tumor tracking paper recently accepted to SIGMOD 2005
The CenSSIS Image Database System
 Hosts the image and sensor data of CenSSIS (>500
images online)
 http://censsis-db1.ece.neu.edu/
 Provides metadata indexed image searching
 Uses XML tags to allow for easy information
interchange
 Evolved into a project-based management system,
allowing users to organize their data hierarchically
 Key issue: how do we develop collaboration tools
that increase the value of data stored in the
database?
 Presently exploring how best to integrate both
visualization and image annotation into the existing
framework (NIH proposal)
CenSSIS Image and Data Information
Management
 Addressing key research barriers in computational
efficiency, embedded computing and image/sensor data
management
 Exploiting Grid resources to enable new discovery in SSI
applications
 Producing a image/data repository and software-engineered
Subsurface Sensing and Imaging Toolsets
 Developing enabling tools targeting system-level projects
• Near real-time reconstruction and visualization
• Visualization of complex motion
• Predicting motion in image data using database indexing
techniques
MVT