Metadata Standards in GEOSS Clearinghouse

Download Report

Transcript Metadata Standards in GEOSS Clearinghouse

Kai Liu, Phil Yang, Huayi Wu, Wenwen Li, Zhenlong Li, Tim Miao, Qunying Huang,
Center for Intelligent Spatial Computing, George Mason University
Douglas Nebert
Federal Geographic Data Committee (FGDC)
Contents
1. Geospatial Metadata and Clearinghouse
2. GEOSS Clearinghouse
3. GEOSS Clearinghouse Administration and
Functions
4. Metadata Transformation in Harvesting
5. Metadata Creating
6. Metadata Search
1. Geospatial Metadata and
Clearinghouse
 Metadata: “Data about Data”
 Geospatial metadata: used to describe how geospatial
& attribute data was collected & processed into its final
form
 Allows agencies to:
Organize and maintain an organization's data
investment
 Share data set information through data catalogs and
clearinghouses
 Provide access for download or display and analysis
through web services

 Allows users to:
 Discover,
 evaluate, and
 access geodata resources.
 Standards
 Allows communication between peers;
 Minimizes data integration time … costs
 Permits data operations under known conditions
 Numerous Geographic Metadata standards exist:
 Dublin Core,
 FGDC CSDGM,
 ISO19115 (UML)
 ISO19139 (XML)
 OGC WxS Capability Document
Dublin Core
 Year 1995: NCSA/OCLC Metadata workshop, Dublin,
Ohio
 15 elements in Simple Dublin Core Metadata: all
optional, all repeatable,
 Title
 Contributor
 Source
 Subject
 Date
 Language
 Description
 Type
 Relation
 Creator
 Format
 Coverage
 Publisher
 Identifier
 Rights
Dublin Core
FGDC CSDGM
 CSDGM is the current US Federal Metadata standard.
 Standard Organized into 10 sections
 Section 1: Identification Information
 Section 2: Data Quality Information
 Section 3: Spatial Data Organization Information
 Section 4: Spatial Reference Information
 Section 5: Entity and Attribute Information
 Section 6: Distribution Information
 Section 7: Metadata Reference Information
 Section 8: Citation Information
 Section 9: Time Period Information
 Section 10: Contact Information
FGDC CSDGM Example
ISO 19115
 International consensus being built on structured metadata




within a broader Geomatics Standard under ISO Technical
Committee 211 (TC 211)
UML abstract model
ISO 19115 reflects FGDC, TC 287, ANZLIC and other inputs
Specifies content guidance (field vocabulary and structures),
like FGDC
ANSI adoption 12/24/2003
ISO 19115
Organization
Metadata Contact
Data Identification Info
Content Information
Distribution Info
Spatial Representation
Metadata
Online Source
Data Quality Info
Reference System
Language
Character Set
Hierarchy
Standard Name,
Version
Date
Responsible Party
Citation
Metadata Maintenance
Metadata Constraints
Application Schema Info
Portrayal Catalog Info
Extension Information
Required
Conditional
or Optional
Core ISO Metadata
 Title
 Abstract
 Reference Date
 Format
 Responsible Party
 Extent
 Geographic
 Representation Type




location
Language
Characterset
Topic Category
Scale
 Reference System
 Lineage
 On-line Resource
ISO 19139
 XML Schema implementation of ISO 19115
Metadata Storage and Access
Storage:
 File system
 Database
Access: browse, request
 CSW
 SRU
 HtmlParser
 GetCapabilities
 Local Filesystem
Clearinghouse
2. GEOSS Clearinghouse
 The GEOSS Clearinghouse is a common search facility
for the Intergovernmental Group on Earth Observation
(GEO) based the GeoNetwork Opensource.
 Through the harvesting mechanism, GEOSS
clearinghouse collects all types of metadata of EO data,
services and other resources
 Through distributed search of harvested metadata or
registered metadata catalogues, EO data, services, and
related resources can be discovered and accessed.
 The GEOSS Clearinghouse supports a variety of
geospatial standards:
 Dublin Core, FGDC CSDGM, ISO 19115/19139 metadata,
 CSW, RSS, and SRU for search,
 WMS related OGC standards for data access and
visualization, as linked from the metadata.
GEOSS Clearinghouse Main Page
Supports Multiple languages
Architecture
 GEOSS clearinghouse uses Web application with
service based architecture (Jeeves) which is a Java
Easy Engine for Very Effective Systems. Publishing
of the simple systems on the Internet by Jeeves is
possible without Java language knowledge and with
little effort.
 All the interfaces are expressed through XML and
XSL files
Jeeves Architecture
Business Logic
3. GEOSS Clearinghouse
Administration and Functions
 Administration
 User Management
 Search
 Harvesting
 Metadata Edit and Management
 Visualization
Administration
User Management
Harvesting
Metadata Edit and Management
Editing
Metadata Visualization
4. Metadata Transformation in Harvesting
 Harvesting is the process of collecting
remote metadata and storing them locally
for a faster access.
 Currently, about 26 000 metadata were
harvested to the GEOSS clearinghouse.
Where GEOSS Harvest From?
The GEOSS Clearinghouse is harvesting from the following protocols:
 Z39.50 "GeoProfile" (manually-initiated, under testing)
 CSW 2.0.2 baseline, AP ISO, ebRIM with no extensions
 WebDAV, sitemaps, and Web Accessible Folders (WAF)
 OGC GetCapabilities (WMS, WFS, WCS) endpoints
 Local file access for batch ingest of packaged, static metadata
Also available to the system, are the following protocols:
 THREDDS Catalog
 OAI-PMH
 ISO 23950 "SRU"
 GeoNetwork "native"
Harvest From Example: CSR
Harvest From Example: USGS WAF
Metadata of the following three standards can be harvested
directly.
 Dublin-core
 FGDC CSDGM
 ISO19115, ISO19139
Other formats must be converted to ISO19139
Metadata Formats Transformation





As for formats, the following "Full" metadata formats are
recognized and parsed into ISO 19139 metadata for ingest:
ISO 19115, 19119 XML (no transform, per 19139)
ebRIM common information model XML, including the
CSR form
FGDC CSDGM Version 2 XML (no transform)
CSW csw:Record XML (no transform)
OGC GetCapabilities XML and other XML
Stylesheets will be used for the Metadata Formats
Transformation
Stylesheet(
*.xsl)
Original
Metadata
ISO Metadata
 Stytlesheet example:
<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet version="1.0"
xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:template match="metadata">
<uuid><xsl:value-of select="mdFileID"/></uuid>
</xsl:template>
</xsl:stylesheet>
Metadata Transformation in CSR Harvesting
 CSR: which is short for the GEOSS Component and Service
Registry.
 ebRIMcomponent2ISO19139.xsl will be used for component’s
translating
<gmd:fileIdentifier>
<gco:CharacterString>
<xsl:value-of
select="csw:GetRecordByIdResponse/rim:Organization/@*[name()='id']"/>
</gco:CharacterString>
</gmd:fileIdentifier>
 Also, ebRIMservice2ISO19139.xsl will be used for service’s
translating
<gmd:fileIdentifier>
<gco:CharacterString>
<xsl:value-of
select="csw:GetRecordByIdResponse/rim: Service /@*[name()='id']"/>
</gco:CharacterString>
</gmd:fileIdentifier>
Example: Original Rim Metadata
Example: The Same Record in
GEOSS Clearinghouse
Demo: Harvsting
Demo: Harvsting
Demo: Harvsting
Demo: Harvsting
Demo: Harvsting
5. Metadata Creating
 In the process of creating metadata, a template can be
chosen. Some templates have been developed like
below:
FGDC template
Dublin Core Template
6. Metadata Search
 Local search
Using Lucene to index metadata
 Remote search
CSW Search
http://clearinghouse.cisc.gmu.edu/srv/en/csw
RSS Search
http://clearinghouse.cisc.gmu.edu/srv/en/rss.search
SRU Search
http://clearinghouse.cisc.gmu.edu/srv/en/portal.sru
Advanced Search
ISO Query-ables
Name
Definition
Data Type
Property Mapping to Information Model
Subject
The topic of the
content of the
resource
CharacterString
MD_Metadata.identificationInfo.AbstractMD_Identification.de
scriptiveKeywords.MD_Keywords.keyword
plus
MD_Metadata.identificationInfo.MD_DataIdentification.topicC
ategory
Title
A name given to the
resource
CharacterString
MD_Metadata.identificationInfo.AbstractMD_Identification.cit
ation.CI_Citation.title
Abstract
A summary of the
content of the
resource
CharacterString
MD_Metadata.identificationInfo.AbstractMD_Identification.ab
stract
AnyText
A target for full-text
search of
character data
types in a
catagloue
CharacterString
Whole resource text.
Format
The physical or
digital
manifestation of
the resource
CharacterString
MD_Metadata.distributionInfo.MD_Distribution.distributionFor
mat.MD_Format.name
Identifier
A unique reference to
the record within
the catalogue
Identifier
MD_Metadata.fileIdentifier
ISO Query-ables (contd.)
Modified
Data on which the
record was created or
updated within the
catalogue
Date-8601, example:
2000-08-21
MD_Metadata.dateStamp.Date
Type
The nature or genre of
the content of the
resource. Type can
include general
categories, generes or
aggregation levels of
content
Codelist: dataset,
datasetcollection,
service, application
MD_Metadata.hierarchyLevel.MD_ScopeCode/@codeListValu
e. If MD_Metadata .hierarchyLevel is missing, ‘Type’ will be
considered as “Dataset” (default).
Bounding
Box
A bounding box for
identifying a
geographic area of
interest
BoundingBox,
BoundingBox
WestBoundLongitude
SouthBoundLatitude
EastBoundLongitude
NorthBoundLatitude
CRS
Geographic
Coordinate Reference
System (Authority
and ID) for the
BoundingBox
Identifier
CRS
Authority:
MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe
renceSystemIdentifier.RS_Identifier.codeSpace
ID:
MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe
renceSystemIdentifier.RS_Identifier.code
Version:
MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe
renceSystemIdentifier.RS_Identifier.version
Example 1: CSW Post Request



























<?xml version="1.0" encoding="UTF-8"?>
<csw:GetRecords
xmlns="http://www.opengis.net/cat/csw/2.0.2"
xmlns:csw="http://www.opengis.net/cat/csw/2.0.2"
xmlns:gmd="http://www.isotc211.org/2005/gmd"
xmlns:ogc="http://www.opengis.net/ogc"
xmlns:gml="http://www.opengis.net/gml"
xmlns:rim="urn:oasis:names:tc:ebxml-regrep:xsd:rim:3.0"
service="CSW"
version="2.0.2"
outputFormat="application/xml"
outputSchema="http://www.isotc211.org/2005/gmd"
resultType="results"
startPosition="1"
maxRecords="5">
<csw:Query typeNames="csw:Record">
<csw:ElementSetName>full</csw:ElementSetName>
<csw:Constraint version="1.1.0">
<ogc:Filter>
<ogc:PropertyIsLike escapeChar="\" singleChar="?" wildCard="*">
<ogc:PropertyName>Type</ogc:PropertyName>
<ogc:Literal>dataset</ogc:Literal>
</ogc:PropertyIsLike>
</ogc:Filter>
</csw:Constraint>
</csw:Query>
</csw:GetRecords>
Example 2: RSS Request
Example 3: SRU Request
GEOSS Clearinghouse Demonstration
http://clearinghouse.cisc.gmu.edu/geonetwork/
Acknowledgements & Further Info
 Website: The clearinghouse is operating at
http://clearinghouse.cisc.gmu.edu/geonetwork
 Contactor:
Chaowei Phil Yang: [email protected]
Douglas Nebert: [email protected]
 The project is supported by FGDC grant, ESIP Products &
Services and Air Quality Committees, and NASA Geospatial
Interoperability grant.
Kai Liu, Chaowei Phil Yang, Huayi Wu, Wenwen Li,
Zhenlong Li, Tim Miao, Qunying Huang, Douglas Nebert
Center for Intelligent Spatial Computing
George Mason University & FGDC
http://cisc.gmu.edu/
Let’s get
Started
Multilanguage
Metadata
Let’s Check the metadata
Let’s check the metadata standards
Let’s check the metadata standards
Let’s check the metadata in XML
Categories
Recent Changes
Advanced Search
Create New Metadata
Create New Metadata(1)
Create New Metadata(2)
Create New Metadata(3)
Create New Metadata(4)
Create New Metadata(5)
Create New Metadata(5)
Create New Metadata(6)