Transcript Document
MGDS PROJECT TEAM: PROJECT OVERVIEW AND SAMPLE METADATA MGDS Project Overview and Sample Metadata (Arko) 1 of 18 Joyce Alsop * Robert Arko Suzanne Carbotte (lead) Dale Chayes John Diebold Vicki Ferrini Andrew Goodwillie * Kerstin Lehnert Andrew Melkonian Suzanne O’Hara William Ryan R.A. Weissel SESAR–IGSN Workshop (February 26-27, 2007) OUTLINE 1. PROJECT OVERVIEW 2. CURRENT HOLDINGS 3. DATA MODEL 4. METADATA SUBMISSION 5. CHALLENGES MGDS Project Overview and Sample Metadata (Arko) 2 of 18 SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: MISSION STATEMENT Design and maintain an integrated data repository for MG&G communities: • Ridge 2000 Program • MARGINS Program • U.S. Antarctic Program • Legacy - Multibeam Synthesis • Seismic Reflection Joint funding from NSF OCE + EAR + OPP MGDS Project Overview and Sample Metadata (Arko) 3 of 18 SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: SCOPE AND PARTNERS Data from marine and terrestrial realms Data from all disciplines - biological, physical, chemical, geological Project partners: • WHOI (Ridge 2000 Program) • TAMU (MARGINS Program) • RPSC (U.S. Antarctic Program) • NGDC, CCOM (Legacy - Multibeam Synthesis) • UTIG (Seismic Reflection) Collaborative partners: • DLESE (education modules) • MMI (community/ontology development) • SESAR (sample registration) MGDS Project Overview and Sample Metadata (Arko) 4 of 18 SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: SCIENTIFIC RATIONALE • Ensure ability to verify research results • Preserve expensive/unique/unrepeatable data • Supplement traditional publication methods • Facilitate cross-disciplinary research • Increase data availability to non-specialists • Enable automated analysis + synthesis MGDS Project Overview and Sample Metadata (Arko) 5 of 18 SESAR–IGSN Workshop (February 26-27, 2007) OVERVIEW: SYSTEM COMPONENTS PRODUCTS • Metadata catalog (1500+ collections) • Data repository (210,000+ files total 5+ TB - partnership with SDSC) • Global syntheses (e.g. multi-resolution DEM) SERVICES • Web portals (search + download) • GeoMapApp® (integrate + visualize data from multiple sources) • Web services (OAI, OGC, etc.) MGDS Project Overview and Sample Metadata (Arko) 6 of 18 SESAR–IGSN Workshop (February 26-27, 2007) CURRENT HOLDINGS: SOLID EARTH SAMPLES 50 NEW DATA SETS OVER 3500 SAMPLES (growing rapidly…) MGDS Project Overview and Sample Metadata (Arko) COLLEC T ION A T 03 -24 A T 03 -38 A T 11 -07 A T 11 -07 A T 11 -09 A T 11 -09 A T 11 -09 A T 11 -09 A T 11 -10 A T 11 -20 A T 11 -26 A T 15 -06 A T 15 -09 A T 15 -12 COOK06 M V COOK07 M V DANA0 1 RR DANA0 2 RR DANA0 7 RR DANA0 8 RR E W 0 0 04 E W 0 1 04 K M 04 1 7 K M 05 0 2 K M 05 0 3 KN182 -13 M ar ian a _Forearc_2002 M GLN07 M V T AN0613 T CS06NH T CS06NH T N154 T N154 T U IM 05 M V VANC0 2M V VANC1 3M V VANC1 4M V VANC1 5M V VANC1 6M V VANC1 9M V VANC2 0M V VANC2 1M V VANC2 1M V VANC2 2M V VANC2 3M V VANC2 7M V VANC2 8M V VANC2 9M V VANC3 0M V W F2983 7 of 18 T YPE Roc k Roc k Roc k Roc k Roc k Roc k Roc k Sedi m ent Roc k Roc k Roc k Roc k Roc k Roc k Roc k Roc k Roc k Roc k Roc k Roc k Ro c k Sedi m ent Roc k Sedi m ent Sedi m ent Roc k Roc k Roc k Sedi m ent Roc k Roc k Roc k Sedi m ent Roc k Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Sedi m ent Roc k INV E S T IGA T ORS F isher F isher Perf it Schouten B la k e Re y senbach Von D a mm Inderb itz en Vetr ia n i Edwar ds Vetr ia n i Perf it, S ie v ert, H a y m on Kel le y Br ight Fr y er B lo o m er Lonsda le Lonsda le Lonsda le Lonsda le S int o n Underwood Lang m u ir Kue h l A le x an d er Fors y th Reagan Lang m u ir A le x an d er Perf it Per fit, R ub in Fr y er Fr y er Ti v e y Underwood, Spinel li Ogston N ittrouer N ittrouer Ogston Ogston N ittrouer Gon i N ittrouer Dr isco ll Dr isco ll Ogston N ittrouer N ittrouer N ittrouer G ill SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: MULTIPLE WEB PORTALS TO SERVE DIFFERENT COMMUNITIES SINGLE INTEGRATED DATABASE BACKEND : : MGDS Project Overview and Sample Metadata (Arko) 8 of 18 SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: COLLECTION (registration = ?) • Field • Observatory • Expedition • Derived COLLECTION SET SET (registration = STD-DOI) • group of data objects having common provenance OBJECT OBJECT (registration = IGSN) • Data File • Real-time • Processed • Sample MGDS Project Overview and Sample Metadata (Arko) 9 of 18 SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: COLLECTION METADATA related collections collection aliases (at other repositories) platform/operator funding agency/awards project titles/urls science party (field + lab personnel) lat/lon bins location (physio features, place names) supporting documents (cruise reports etc.) references (citations) MGDS Project Overview and Sample Metadata (Arko) 10 of 18 SESAR–IGSN Workshop (February 26-27, 2007) DATA MODEL: ACQUISITION EVENTS 1. LAUNCH (independent, navigated) • daughter platforms e.g. Submersible, Drone, Small Boat 2. LINE (navigated) • towed platforms e.g. Camera, MCS, TowYo 3. STATION (only start/stop) • lowered platforms e.g. Core, Grab, CTD, BLISP • towed platforms e.g. Dredge, Net • deployed platforms e.g. OBS, Marker, Float, Probe Events can be nested (e.g. Dive > Station) MGDS Project Overview and Sample Metadata (Arko) 11 of 18 SESAR–IGSN Workshop (February 26-27, 2007) collection_id DATA MODEL: SAMPLE METADATA sample_id sample_name (investigator’s pet name) parent_id data_type (e.g. “Rock Sample”) sample_type (e.g. “Igneous: Volcanic: Mafic”) launch_id line_id station_id ---> station_type (e.g. “Bottom: Towed”) + station_platform (e.g. “Dredge”) start_date start_longitude/latitude/elevation stop_date stop_longitude/latitude/elevation navfix_type local_origin/units (e.g. for dive programs) start_local_x/y stop_local_x/y location_id (physiographic feature) tectonic_setting (e.g. “Back-Arc Basin”) investigator_id contact_id contributor_id repository_id (holds authoritative metadata) facility_id (holds physical sample) other/details MGDS Project Overview and Sample Metadata (Arko) 12 of 18 SESAR–IGSN Workshop (February 26-27, 2007) collection_id collection_type data_type device_type dive_type feature_id feature_type format_id initiative_id language_id launch_platform_type launch_type line_platform_type line_type location_id nav_type organization_id person_id platform_id platform_type role_id role_type station_platform_type station_type status_id DATA MODEL: CONTROLLED VOCABULARIES (both types and identifiers) (and still growing…) MGDS Project Overview and Sample Metadata (Arko) 13 of 18 SESAR–IGSN Workshop (February 26-27, 2007) METADATA SUBMISSION: POLICY Records made public immediately: • people/projects/awards • primary navigation • catalog of acquisition events • catalog of data sets • catalog of samples MGDS Project Overview and Sample Metadata (Arko) 14 of 18 SESAR–IGSN Workshop (February 26-27, 2007) METADATA SUBMISSION: FORMS 1. Contact chief scientist in advance designate science party liaison 2. Follow up with liaison (60 days) 3. Register/submit data sets to appropriate partner repositories MGDS Project Overview and Sample Metadata (Arko) 15 of 18 SESAR–IGSN Workshop (February 26-27, 2007) METADATA SUBMISSION: FORMS Example: Sediment Cores (based on LDEO Repository log sheet) MGDS Project Overview and Sample Metadata (Arko) 16 of 18 SESAR–IGSN Workshop (February 26-27, 2007) CHALLENGES: 1. Metadata form submission • completeness • consistent identifiers and formats 2. Globally unique identifiers 3. Evolving/shared vocabularies • Physiographic Feature (gazetteer + local features e.g. Vents) • Tectonic Setting • Sample Type (domain specific) • Station Platform/Type MGDS Project Overview and Sample Metadata (Arko) 17 of 18 SESAR–IGSN Workshop (February 26-27, 2007) Questions? _____________ marine – geo.org MGDS Project Overview and Sample Metadata (Arko) 18 of 18 SESAR–IGSN Workshop (February 26-27, 2007)