Transcript Document
The Open Earth Framework (OEF)
A Data Integration Environment for Earth
Sciences
G. Randy Keller - Univ. Oklahoma
Matt Fouch - Arizona State Univ.
Chris Crosby – SDSC
Chaitan Baru – SDSC
Dave Nadeu – SDSC
John Moreland - SDSC
Motivations
• Integration of multidisciplinary data sets is essential to
understanding the complex processes operating at the
Earth’s surface and within its interior.
• Our current ability to collect massive amounts of digital
data, develop structural models from these data, and
generate high-resolution numerical models of dynamics is
very well developed.
• Conversely, our ability to quantitatively integrate these
data and models into holistic interpretations of Earth
systems is poorly developed.
EarthScope Program http://www.earthscope.org
•
•
•
•
•
•
Study the three dimensional structure and evolution of the North American Continent
3.2 km borehole into the San Andreas Fault
• 400 transportable seismic stations
1099 permanent GPS stations
occupying 2000 sites
74 borehole strainmeters
• 30 magnetotelluric systems
5 laser strainmeters
• 100 campaign GPS stations
100 Permanent seismic stations
• 2146 campaign seismic stations
2
1000s km high resolution topography/InSAR
• Geochronology
3
EarthScope Data Portal
portal.earthscope.org
Data layers - Input to aid in the construction of
3-D and ultimately 4-D models
DEM (USGS, SRTM)
Geology (mostly 1:500,000)
Landsat 7 / ASTER / LIDAR
Petrology/Geochron (e.g. NAVDAT)
Drilling data (State surveys, USGS)
Magnetic anomalies
Gravity anomalies
……….
To construct 3-D models, start
with tomography; add gravity,
geologic interfaces, seismic
interfaces, ….
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES
Provide input
for modeling
of processes
www.geongrid.org
A Scientific Effort Vector
Background
Research
Background
Research
Data Collection and
Compilation
Software Issues
Data
Collection
and
Compilation
Software Issues
Science
Science
Science - Analysis, Modeling, Interpretation, Discovery
Data Integration as a Workflow
• All data integration activities can be characterized generically as
workflows that typically involve running the data through a series of
processing stages to:
–
–
–
–
Find the data
prepare the data,
remove outliers,
format and filter data,
– grid the data
– derive other data products,
– visualize the results,
– produce a proposed “model” for a given region
• The integration process is necessarily iterative, leading to
progressively refined earth system models, but it is rife with
possible stumbling points and inefficiencies.
Data Integration Choke Points
• Frustrations and inefficiencies that come from wrangling and
integrating disparate data to build a coherent model.
• Bottlenecks in the process of going from disparate sets of data to
integrated models are workflow “choke points” that stall processing
when data does not flow easily from one stage to the next.
Data Collection
Backand
• Custom
ad hoc software “hacks”
to stitch together
tools and push
Science
ground
Compilation
pastResearch
these choke points.
Software Issues
• Working through and tolerating such choke points is sometimes
viewed as a “rite of passage” and necessary training.
• Learning to manage and prepare data is useful, but ultimately time
would be better spent on analyzing data and building
comprehensive models.
“PhotoShop Science”
• Choke point at the very end of the workflow, just shy of a
publication-worthy diagram.
• Screen shots & outputs from different tools overlain them by hand
in software such as PhotoShop to obtain the desired result.
• Difficult to reproduce a similar figure with improved data, and we
have to regenerate the figure from the beginning, through all of the
workflows again.
• “PhotoShop science” is a problem to be solved rather than lived
with.
Seismic
tomography result
from the CD-ROM
project
Ken Dueker, University of Wyoming
The OEF
Start with a
tomographic
model
Add
topography
Add interfaces
based on
geophysics
Finish and
place in a
regional
Add geologic
context
detail
Use open source software and openly available data
3-D is
essential in
geology
Is this a plume, an
intrusion, a salt
dome, or a reef?
The ultimate goal in geophysics is..
Construction of 3-D volumes with as many physical
properties as possible assigned to each volume element
Discontinuities are also important, and we need to
be able to insert them and manipulate them
Figure by M. Billen,
UCDavis
We also want the
results to be
compatible with
various modeling
programs (e.g.
groundwater,
geodynamics)
A number of geophysical techniques can produce 3-D voxel
models (e.g., tomography), and others produce interfaces. The
big challenges are to include interfaces in voxel-based models
and to be able edit and visualize the models as one proceeds.
Reflection
seismology
provides an image
of the subsurface
whose geologic
interpretation is
often obvious.