GML Data Models and Web Services for GPS and Earthquake Catalogs Marlon Pierce, Galip Aydin Community Grids Lab, Indiana University [email protected].

Download Report

Transcript GML Data Models and Web Services for GPS and Earthquake Catalogs Marlon Pierce, Galip Aydin Community Grids Lab, Indiana University [email protected].

GML Data Models and Web
Services for GPS and
Earthquake Catalogs
Marlon Pierce, Galip Aydin
Community Grids Lab,
Indiana University
[email protected]
QuakeSim Applications


Several QuakeSim codes work directly with
observational data.
Examples discussed at ACES include
• GeoFEST, VirtualCalifornia, Simplex, and Disloc all depend
upon fault models.
• RDAHMM and Pattern Informatics codes use seismic
catalogs.
• RDAHMM primarily used with GPS data

Problem: We need to provide a way to integrate
these codes with the online data repositories.
• QuakeTables Fault Database was developed
• What about GPS and Earthquake Catalogs?
• Many formats, data available in tars or files, not searchable,
not easy to integrate with applicaitons

Solution: use databases to store catalog data; use
XML (GML) as exchange data format; use Web
Services for data exchanges, invoking queries, and
filtering data.
What Are Web Services?

Web Services are not web
pages, CGI, or Servlets
Web Services framework is
a way for doing distributed
computing with XML.


XML provides crosslanguage support
Suitable for both human
and application clients
Appl
Web
Server
WSDL
SOAP
WSDL
SOAP
Web
Server
WSDL
• WSDL: Defines interfaces
to functions of remote
components.
• SOAP: Defines the
message format that you
exchange between
components.
Browser
WSDL

JDBC
DB
Geographical Information Service
(GIS) Data Formats and Services


OpenGIS Consortium is an international group for defining
GIS data formats and services.
Main data format language is the XML-based GML.
• Subdivided into schemas for drawing maps, representing
features, observations, …


First Step: design GML schemas and build specialized Web
Services for GPS and Earthquake data.
OGC also defines services.
• Services include Web Features Services, Web Map Services,
and similar.
• These are currently pre-Web Service, based on HTTP Post, but
they are being revised to comply with WS standards.

Next Step: Implement OGC compatible Web Services for
this problem.
• Also build services to interact with QuakeTables Fault DB.
GML and Existing Data Formats




GPS or seismic data used in this project
are retrieved from different URLs and
have different text formats.
Seismic data formats
•
SCSN, SCEDC, Dinger-Shearer, Haukkson
•
JPL, SOPAC, USGS
GPS data formats
We defined 2 GML Schemas to unify
these
• http://grids.ucs.indiana.edu/~gaydin/servo

A summary of all supported formats and
data sources can also be found there.
So We Built It

First version of the
system available
• Tried XML databases
but performance was
awful
• Currently database uses
MySQL

Download results are
in GML, but we can
convert to appropriate
text formats.
Search DB For Earthquake
Catalogs
Results
Search XML DB For GPS Catalogs
1
Use Ours or Set Up Your Own

URL to access our browser interface:
• http://gf3.ucs.indiana.edu:6060/cce/sql/

URL to download and set up your
own
• http://complexity.ucs.indiana.edu/~gaydin/cce
/install/install.html
Fault Quest: QuakeTables+OGC
Web Map Service Demo
http://rio.ucs.indiana.edu:8080/wmsClient/
Conclusions


This is a little discussion with a big
conclusion-If you want to build iSERVO or
something like it, data access
services are an important
foundation.
GML Schemas as Data Models for
Services



Fault and GPS Schemas are based on GMLFeature object.
Seismicity Schema is based on GML-Observation
object.
Working schema available from
http://grids.ucs.indiana.edu/~gaydin/schemas/
Browser Interface
JSP + Client Stubs
DB Service 1
Job Sub/Mon
And File
Services
Viz Service
JDBC
DB
Host 1
Operating and
Queuing
Systems
Host 2
RIVA
Host 3
QuakeSim Portal Shots
Other Issues

We want to abstract the data storage system to allow
simple federation of relational and XML databases
• UK e-Science’s OGSA-DAI project is an interesting but
complicated example.


We’d like to simplify this approach
Metadata is also important
• Useful for capturing data pedigree and validation.



“This fault data generated with Simplex by Jay Parker using the
parameters….”
“Those 1935 Fault measurements aren’t so good.”
We have developed some general applications for metadata
management
• Newsgroups, citations, references, glossaries as examples.
• Would like to apply to scientific metadata
Future Directions

We are interested in Semantic Web markups (particularly
RDFS) to provide metadata descriptions of
•
•
•
•




Instruments
Data sets
Computing hardware
Applications/codes
We want this to form the basis for building composite
services.
Infrastructure improvements: reliable, fault tolerant grid
infrastructure needed as grid components come and go.
Component based portals: reuse portal interfaces between
projects.
ISERVO: International collaborations with Australia, Japan,
and possibly other countries
• Through ACES: APEC Cooperation for Earthquake Simulation
Acknowledgements






Community Grids: Geoffrey Fox,
Choonhan Youn, Galip Aydin,
Mehmet Aktas
NASA JPL: Andrea Donnellan (PI),
Jay Parker, Peggy Li, Robert Granat
UC-Davis: John Rundle
UC-Irvine: Lisa Grant
USC: Dennis Mcleod
Brown: Terry Tullis
Problems: Data Access and
Sharing, Code Integration


Codes all use custom text formats for
describing input and output.
Input and output data often combined
with code-specific information.
• Number of iterations, array sizes, etc.

Data files often created by hand from
journals, online repositories
• Online repositories themselves use differing
formats

Challenges are to develop common data
formats, access services, and client
query tools.
Web Services for Data Access and
Computing Service Invocation

Web services:
• WSDL: Interface definition language, describes your
service

“GeoFEST may be invoked with these input types”
• SOAP: Transport envelope for remote procedure
calls/messages


“Invoke GeoFEST with this set of input”
Together, WSDL and SOAP are useful for
manipulating, returning XML data values
• So GML schemas act as our data models and return
values

Status: built several general purpose services
• Remotely executing codes, monitoring queuing systems,
manipulating/moving files around, describing
applications, storing portal session values, accessing
data bases of faults,…

Work underway to build data services
QuakeSim Basics




Under development in collaboration with
researchers at JPL, UC-Davis, USC, and
Brown University.
Geoscientists develop simulation codes,
analysis and visualization tools.
We need a way to bind distributed codes,
tools, and data sets.
We need a way to deliver it to a larger
audience
• Instead of downloading and installing the code,
use it as a remote service.
What’s the Problem?

Data sources typically were provided in single downloads
• Tar bundles or text
• This has changed for SCEC catalogs since we developed this
project.
• SCIGN is adopting a Web Services approach for GPS data.

Formats defined but presented as text
• Use XML to re-format the data.
• Buys us investment in many XML manipulation, validation, and
messaging tools.


We wanted to use databases to store and manage the
information.
This makes the data queryable
• Retrieve all entries > 1970
• Retrieve all entries with M>3.0
Data Sources Summary

A summary of all supported
formats can be found here
• http://grids.ucs.indiana.edu/~gaydin/
servo


Information about supported
Earthquake catalog formats can be
found in
http://www.data.scec.org/
Information about supported GPS
data formats can be found in
http://www.scign.org
What Are Web Services?
Web services framework is
a way for doing distributed
computing with XML.


XML provides crosslanguage support
Suitable for both human
and application clients
Appl
Web
Server
WSDL
SOAP
WSDL
SOAP
Web
Server
WSDL
• WSDL: Defines interfaces
to functions of remote
components.
• SOAP: Defines the
message format that you
exchange between
components.
Browser
WSDL

JDBC
DB
Delivering Data for Human and
Application Consumption




We still have to get the results to the
(remote) client.
The client may be a user or an application.
Web Services provide a way to do this.
Note Web Services are NOT
• Web pages
• Servlets
• CGI scripts