Transcript Document

MGDS
PROJECT TEAM:
PROJECT OVERVIEW
AND
SAMPLE METADATA
MGDS Project Overview and Sample Metadata (Arko)
1 of 18
Joyce Alsop
* Robert Arko
Suzanne Carbotte (lead)
Dale Chayes
John Diebold
Vicki Ferrini
Andrew Goodwillie
* Kerstin Lehnert
Andrew Melkonian
Suzanne O’Hara
William Ryan
R.A. Weissel
SESAR–IGSN Workshop (February 26-27, 2007)
OUTLINE
1.
PROJECT OVERVIEW
2.
CURRENT HOLDINGS
3.
DATA MODEL
4.
METADATA SUBMISSION
5.
CHALLENGES
MGDS Project Overview and Sample Metadata (Arko)
2 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
OVERVIEW: MISSION STATEMENT
Design and maintain an integrated data repository for MG&G communities:
•
Ridge 2000 Program
•
MARGINS Program
•
U.S. Antarctic Program
•
Legacy - Multibeam Synthesis
•
Seismic Reflection
Joint funding from NSF OCE + EAR + OPP
MGDS Project Overview and Sample Metadata (Arko)
3 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
OVERVIEW: SCOPE AND PARTNERS
Data from marine and terrestrial realms
Data from all disciplines - biological, physical, chemical, geological
Project partners:
•
WHOI (Ridge 2000 Program)
•
TAMU (MARGINS Program)
•
RPSC (U.S. Antarctic Program)
•
NGDC, CCOM (Legacy - Multibeam Synthesis)
•
UTIG (Seismic Reflection)
Collaborative partners:
•
DLESE (education modules)
•
MMI (community/ontology development)
•
SESAR (sample registration)
MGDS Project Overview and Sample Metadata (Arko)
4 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
OVERVIEW: SCIENTIFIC RATIONALE
•
Ensure ability to verify research results
•
Preserve expensive/unique/unrepeatable data
•
Supplement traditional publication methods
•
Facilitate cross-disciplinary research
•
Increase data availability to non-specialists
•
Enable automated analysis + synthesis
MGDS Project Overview and Sample Metadata (Arko)
5 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
OVERVIEW: SYSTEM COMPONENTS
PRODUCTS
•
Metadata catalog (1500+ collections)
•
Data repository (210,000+ files total 5+ TB - partnership with SDSC)
•
Global syntheses (e.g. multi-resolution DEM)
SERVICES
•
Web portals (search + download)
•
GeoMapApp® (integrate + visualize data from multiple sources)
•
Web services (OAI, OGC, etc.)
MGDS Project Overview and Sample Metadata (Arko)
6 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
CURRENT HOLDINGS:
SOLID EARTH SAMPLES
50 NEW DATA SETS
OVER 3500 SAMPLES
(growing rapidly…)
MGDS Project Overview and Sample Metadata (Arko)
COLLEC T ION
A T 03 -24
A T 03 -38
A T 11 -07
A T 11 -07
A T 11 -09
A T 11 -09
A T 11 -09
A T 11 -09
A T 11 -10
A T 11 -20
A T 11 -26
A T 15 -06
A T 15 -09
A T 15 -12
COOK06 M V
COOK07 M V
DANA0 1 RR
DANA0 2 RR
DANA0 7 RR
DANA0 8 RR
E W 0 0 04
E W 0 1 04
K M 04 1 7
K M 05 0 2
K M 05 0 3
KN182 -13
M ar ian a _Forearc_2002
M GLN07 M V
T AN0613
T CS06NH
T CS06NH
T N154
T N154
T U IM 05 M V
VANC0 2M V
VANC1 3M V
VANC1 4M V
VANC1 5M V
VANC1 6M V
VANC1 9M V
VANC2 0M V
VANC2 1M V
VANC2 1M V
VANC2 2M V
VANC2 3M V
VANC2 7M V
VANC2 8M V
VANC2 9M V
VANC3 0M V
W F2983
7 of 18
T YPE
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Sedi m ent
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Roc k
Ro c k
Sedi m ent
Roc k
Sedi m ent
Sedi m ent
Roc k
Roc k
Roc k
Sedi m ent
Roc k
Roc k
Roc k
Sedi m ent
Roc k
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Sedi m ent
Roc k
INV E S T IGA T ORS
F isher
F isher
Perf it
Schouten
B la k e
Re y senbach
Von D a mm
Inderb itz en
Vetr ia n i
Edwar ds
Vetr ia n i
Perf it, S ie v ert, H a y m on
Kel le y
Br ight
Fr y er
B lo o m er
Lonsda le
Lonsda le
Lonsda le
Lonsda le
S int o n
Underwood
Lang m u ir
Kue h l
A le x an d er
Fors y th
Reagan
Lang m u ir
A le x an d er
Perf it
Per fit, R ub in
Fr y er
Fr y er
Ti v e y
Underwood,
Spinel li
Ogston
N ittrouer
N ittrouer
Ogston
Ogston
N ittrouer
Gon i
N ittrouer
Dr isco ll
Dr isco ll
Ogston
N ittrouer
N ittrouer
N ittrouer
G ill
SESAR–IGSN Workshop (February 26-27, 2007)
DATA MODEL:
MULTIPLE WEB PORTALS
TO SERVE
DIFFERENT COMMUNITIES
SINGLE INTEGRATED
DATABASE BACKEND
:
:
MGDS Project Overview and Sample Metadata (Arko)
8 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
DATA MODEL:
COLLECTION (registration = ?)
•
Field
•
Observatory
•
Expedition
•
Derived
COLLECTION
SET
SET (registration = STD-DOI)
•
group of data objects having
common provenance
OBJECT
OBJECT (registration = IGSN)
•
Data File
•
Real-time
•
Processed
•
Sample
MGDS Project Overview and Sample Metadata (Arko)
9 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
DATA MODEL:
COLLECTION METADATA
related collections
collection aliases (at other repositories)
platform/operator
funding agency/awards
project titles/urls
science party (field + lab personnel)
lat/lon bins
location (physio features, place names)
supporting documents (cruise reports etc.)
references (citations)
MGDS Project Overview and Sample Metadata (Arko)
10 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
DATA MODEL:
ACQUISITION EVENTS
1.
LAUNCH (independent, navigated)
•
daughter platforms e.g. Submersible, Drone, Small Boat
2.
LINE (navigated)
•
towed platforms e.g. Camera, MCS, TowYo
3.
STATION (only start/stop)
•
lowered platforms e.g. Core, Grab, CTD, BLISP
•
towed platforms e.g. Dredge, Net
•
deployed platforms e.g. OBS, Marker, Float, Probe
Events can be nested
(e.g. Dive > Station)
MGDS Project Overview and Sample Metadata (Arko)
11 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
collection_id
DATA MODEL:
SAMPLE METADATA
sample_id
sample_name (investigator’s pet name)
parent_id
data_type (e.g. “Rock Sample”)
sample_type (e.g. “Igneous: Volcanic: Mafic”)
launch_id
line_id
station_id ---> station_type (e.g. “Bottom: Towed”) + station_platform (e.g. “Dredge”)
start_date
start_longitude/latitude/elevation
stop_date
stop_longitude/latitude/elevation
navfix_type
local_origin/units (e.g. for dive programs)
start_local_x/y
stop_local_x/y
location_id (physiographic feature)
tectonic_setting (e.g. “Back-Arc Basin”)
investigator_id
contact_id
contributor_id
repository_id (holds authoritative metadata)
facility_id (holds physical sample)
other/details
MGDS Project Overview and Sample Metadata (Arko)
12 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
collection_id
collection_type
data_type
device_type
dive_type
feature_id
feature_type
format_id
initiative_id
language_id
launch_platform_type
launch_type
line_platform_type
line_type
location_id
nav_type
organization_id
person_id
platform_id
platform_type
role_id
role_type
station_platform_type
station_type
status_id
DATA MODEL:
CONTROLLED VOCABULARIES
(both types and identifiers)
(and still growing…)
MGDS Project Overview and Sample Metadata (Arko)
13 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
METADATA SUBMISSION: POLICY
Records made public immediately:
•
people/projects/awards
•
primary navigation
•
catalog of acquisition events
•
catalog of data sets
•
catalog of samples
MGDS Project Overview and Sample Metadata (Arko)
14 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
METADATA SUBMISSION: FORMS
1.
Contact chief scientist in advance designate science party liaison
2.
Follow up with liaison (60 days)
3.
Register/submit data sets to
appropriate partner repositories
MGDS Project Overview and Sample Metadata (Arko)
15 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
METADATA SUBMISSION: FORMS
Example: Sediment Cores (based on LDEO Repository log sheet)
MGDS Project Overview and Sample Metadata (Arko)
16 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
CHALLENGES:
1.
Metadata form submission
•
completeness
•
consistent identifiers and formats
2.
Globally unique identifiers
3.
Evolving/shared vocabularies
•
Physiographic Feature (gazetteer + local features e.g. Vents)
•
Tectonic Setting
•
Sample Type (domain specific)
•
Station Platform/Type
MGDS Project Overview and Sample Metadata (Arko)
17 of 18
SESAR–IGSN Workshop (February 26-27, 2007)
Questions?
_____________
marine – geo.org
MGDS Project Overview and Sample Metadata (Arko)
18 of 18
SESAR–IGSN Workshop (February 26-27, 2007)