GBIF Data Access and Database Interoperability

Download Report

Transcript GBIF Data Access and Database Interoperability

GBIF Data Access and
Database Interoperability
2003 Work Programme Overview
Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability
[email protected]
Indaiatuba, Brazil, 19 October, 2002
GBIF Node Responsibilities
GBIF Portal
1.
2.
3.
1.
2.
3.
4.
Network
Standards
Tools
1.
2.
1.
2.
Data
Node
Data
Metadata
3.
Network
Standards
Tools
Consolidated Data
Identify (local) Data Nodes
Forward registration
metadata from Data Nodes
National Language Interfaces
Participant
1.
2.
Encourage participation
Manage registration of
Data Nodes
GBIF Data Access and
Database Interoperability
Node Architecture
Collection Data Node
HTML
Presentation
Service
Specimen Specimen
Specimen
Index Data Summary Data
Detail
(3-5 fields) (20-30 fields) (full data)
Node Data Services
Collection
Database
Adaptor
Collection
Database
Adaptor
Collection
Database
Collection
Database
WSDL
Service
Descriptions
Data
Repository
…
…
Metadata
Services
General Resource Node
WSDL
Service
Descriptions
General Resource
Data
HTML
Presentation
Service
Node Data Services
Resource
Database
Adaptor
Resource
Database
Adaptor
Resource
Database
Resource
Database
Data
Repository
…
…
Metadata
Services
GBIF Aggregated Data Portal
HTML
Aggregated General
Resource Data
Aggregated
Specimen Data
Name Data
Geographic
Service Data
Presentation
Service
Data Aggregation Services
General
Resource
Index
Specimen
Index
UDDI
Service Registry
Indexing
Service
WSDL Service
Descriptions
General
Index Specimen
Resource Requests Index Data
Index Data
(3-5 fields)
Name
Services
(ECAT)
Specimen
Name
Data Summary Data
(20-30 fields)
Participant Node
HTML
Presentation
Service
General Resource Specimen
Data
Data
Name
Data
WSDL Service
Descriptions
Registry
Management
Portal Services
UDDI Service
Registry
Data Services
from GBIF Portal
Specimen Data
from Collection
Data Nodes
WSDL Service
Descriptions
Data Flow within GBIF Network
User Browser
HTML Data
HTML Data
GBIF
Portal
Participant
Node
Aggregated
Data
Aggregated
Data
Service
Metadata
Service
Metadata
Detailed
Specimen Data
Collection Node
Participant
Node
Service
Metadata
Specimen
Index Data
Service
Metadata
Collection Nodes
Detailed
Specimen
Data
Example Interactions: Collection View
User Browser
HTML Data
Collection Node
Example Interactions: Global View
User Browser
HTML Data
GBIF
Portal
Specimen
Summary
Data
Collection Node
Collection Node
Example Interactions: Participant View
User Browser
HTML Data
Participant
Node
Aggregated
Specimen Index
Data
Specimen
Summary Data
GBIF
Portal
Collection Node
Collection Node
Example Interactions: Participant Network
User Browser
HTML Data
GBIF
Portal
Specimen Summary
Data
Specimen Data
(Network-specific
Format)
Specimen
Summary Data
Participant
Network Node
Network Collection Nodes
Collection Node
Example Interactions: Duplicated Node
User Browser
One of the Collection Nodes is linked to the
GBIF portal by multiple paths. The
intervening nodes must retain all attribution
details from the original provider to ensure
that duplicate records may be recognised.
HTML Data
GBIF
Portal
Specimen Summary
Data
Specimen Data
(Network-specific
Format)
Participant Network Collection Nodes
Specimen
Summary Data
Collection Node
GBIF Data Access and
Database Interoperability
Walkthrough of Registration for
new Data Node
1. Data Node submits Metadata
User Browser
Participant
Node
GBIF Portal
WSDL description of the Data
Node’s holdings, registered
via UDDI
The Node Manager uses the Metadata
Services component to define the data
held by the node and to submit this data to
a Participant Node (either the Country
Node for the Data Node or a Node with
related interests.
Data Node
2. Participant Node authorises Metadata
User Browser
The Participant Node Manager uses the
Registry Management component to define
the data held by the node and to submit
this data to a Participant Node (either the
Country Node for the Data Node or a Node
with related interests.
Participant
Node
WSDL description of the Data
Node’s holdings, propagated
to GBIF Portal Registry
Data Node
GBIF Portal
3. GBIF Portal indexes Data Node
User Browser
The GBIF Portal requests the Index data
required to use the Data Node when
processing user requests, and the Data
Node responds with Index data
Participant
Node
GBIF Portal
DiGIR Scan request to
retrieve index data
DiGIR Response containing
index data
Data Node
GBIF Data Access and
Database Interoperability
Walkthrough of User Request for
Data
1. User submits Query to Participant Node
The User completes a form on the
Participant Node web site (or, alternatively
on the GBIF Portal). The request is
handled by a Java servlet in the Participant
Node.
User Browser
Request for Specimen data
for given species (HTML form
submission)
Participant
Node
GBIF Portal
Data Node
2. Participant Node queries GBIF Portal
User Browser
DiGIR Request for ABCD
Specimen data for given
species within geographic
scope of Participant Node
Participant
Node
GBIF Portal
The Participant Node Services component
adds any parameters specific to the Node
(e.g. to filter to records specific to a given
country) and forwards a DiGIR request to
the GBIF Portal. The request is handled
by a Java servlet in the Portal Services
component
Data Node
3. GBIF Portal returns Matches
User Browser
Participant
Node
GBIF Portal
DiGIR Response containing
summary detail and access
information for all Specimen
records matching request.
Data Node
The Portal Services component searches
the Index data to determine the locations of
all matching Specimens held by Data
Nodes. The results are returned in XML as
a DiGIR response.
4. Participant Node returns Matches
User Browser
HTML Response containing
summary detail and URLs to
access detailed information
for all Specimen records
matching request.
Participant
Node
GBIF Portal
The Presentation Services component
transforms the XML response into HTML
for the user, including URLs to access the
detailed data from the Data Nodes.
Data Node
5. User requests Detailed Data
The User selects a URL to access detailed
data for a Specimen.
User Browser
Participant
Node
GBIF Portal
HTML Request for
detail for a
Specimen record.
Data Node
6. Data Node returns Detailed Data
User Browser
Participant
Node
GBIF Portal
HTML Response with
detailed data formatted
for browser.
Note: The requests for detail could
alternatively be directed to either the
Participant Node or the GBIF Portal, either
of which could then format XML into HTML
appropriately. The model illustrated is the
simplest.
The Data Node processes the request (and
produces an XML response) and formats
the XML into HTML using the Presentation
Services component.
Data Node
GBIF DADI Work Programme
1. Establish the data standards and interchange
mechanisms required to integrate species-level and
specimen-level data within the GBIF Network - $90,000
2. Identify and develop the foundational components of the
network as early as possible - $70,000
3. Provide tool kits to assist node managers to bring their
databases online as rapidly as possible - $140,000
4. Promote a community (and open-source) development
model for GBIF components
GBIF Standards Requirements
Requirements for 2003
• Federated Data Access (DiGIR)
• Collection Data Exchange Schema (ABCD)
• Name Data Exchange Schema (Taxonomic Names)
• Geographical Services (Spatial Data)
Future requirements:
• Structured Descriptive Data (SDD)
Also required:
• Description of user requirements for GBIF network