Metadata Standards in GEOSS Clearinghouse
Download
Report
Transcript Metadata Standards in GEOSS Clearinghouse
Kai Liu, Phil Yang, Huayi Wu, Wenwen Li, Zhenlong Li, Tim Miao, Qunying Huang,
Center for Intelligent Spatial Computing, George Mason University
Douglas Nebert
Federal Geographic Data Committee (FGDC)
Contents
1. Geospatial Metadata and Clearinghouse
2. GEOSS Clearinghouse
3. GEOSS Clearinghouse Administration and
Functions
4. Metadata Transformation in Harvesting
5. Metadata Creating
6. Metadata Search
1. Geospatial Metadata and
Clearinghouse
Metadata: “Data about Data”
Geospatial metadata: used to describe how geospatial
& attribute data was collected & processed into its final
form
Allows agencies to:
Organize and maintain an organization's data
investment
Share data set information through data catalogs and
clearinghouses
Provide access for download or display and analysis
through web services
Allows users to:
Discover,
evaluate, and
access geodata resources.
Standards
Allows communication between peers;
Minimizes data integration time … costs
Permits data operations under known conditions
Numerous Geographic Metadata standards exist:
Dublin Core,
FGDC CSDGM,
ISO19115 (UML)
ISO19139 (XML)
OGC WxS Capability Document
Dublin Core
Year 1995: NCSA/OCLC Metadata workshop, Dublin,
Ohio
15 elements in Simple Dublin Core Metadata: all
optional, all repeatable,
Title
Contributor
Source
Subject
Date
Language
Description
Type
Relation
Creator
Format
Coverage
Publisher
Identifier
Rights
Dublin Core
FGDC CSDGM
CSDGM is the current US Federal Metadata standard.
Standard Organized into 10 sections
Section 1: Identification Information
Section 2: Data Quality Information
Section 3: Spatial Data Organization Information
Section 4: Spatial Reference Information
Section 5: Entity and Attribute Information
Section 6: Distribution Information
Section 7: Metadata Reference Information
Section 8: Citation Information
Section 9: Time Period Information
Section 10: Contact Information
FGDC CSDGM Example
ISO 19115
International consensus being built on structured metadata
within a broader Geomatics Standard under ISO Technical
Committee 211 (TC 211)
UML abstract model
ISO 19115 reflects FGDC, TC 287, ANZLIC and other inputs
Specifies content guidance (field vocabulary and structures),
like FGDC
ANSI adoption 12/24/2003
ISO 19115
Organization
Metadata Contact
Data Identification Info
Content Information
Distribution Info
Spatial Representation
Metadata
Online Source
Data Quality Info
Reference System
Language
Character Set
Hierarchy
Standard Name,
Version
Date
Responsible Party
Citation
Metadata Maintenance
Metadata Constraints
Application Schema Info
Portrayal Catalog Info
Extension Information
Required
Conditional
or Optional
Core ISO Metadata
Title
Abstract
Reference Date
Format
Responsible Party
Extent
Geographic
Representation Type
location
Language
Characterset
Topic Category
Scale
Reference System
Lineage
On-line Resource
ISO 19139
XML Schema implementation of ISO 19115
Metadata Storage and Access
Storage:
File system
Database
Access: browse, request
CSW
SRU
HtmlParser
GetCapabilities
Local Filesystem
Clearinghouse
2. GEOSS Clearinghouse
The GEOSS Clearinghouse is a common search facility
for the Intergovernmental Group on Earth Observation
(GEO) based the GeoNetwork Opensource.
Through the harvesting mechanism, GEOSS
clearinghouse collects all types of metadata of EO data,
services and other resources
Through distributed search of harvested metadata or
registered metadata catalogues, EO data, services, and
related resources can be discovered and accessed.
The GEOSS Clearinghouse supports a variety of
geospatial standards:
Dublin Core, FGDC CSDGM, ISO 19115/19139 metadata,
CSW, RSS, and SRU for search,
WMS related OGC standards for data access and
visualization, as linked from the metadata.
GEOSS Clearinghouse Main Page
Supports Multiple languages
Architecture
GEOSS clearinghouse uses Web application with
service based architecture (Jeeves) which is a Java
Easy Engine for Very Effective Systems. Publishing
of the simple systems on the Internet by Jeeves is
possible without Java language knowledge and with
little effort.
All the interfaces are expressed through XML and
XSL files
Jeeves Architecture
Business Logic
3. GEOSS Clearinghouse
Administration and Functions
Administration
User Management
Search
Harvesting
Metadata Edit and Management
Visualization
Administration
User Management
Harvesting
Metadata Edit and Management
Editing
Metadata Visualization
4. Metadata Transformation in Harvesting
Harvesting is the process of collecting
remote metadata and storing them locally
for a faster access.
Currently, about 26 000 metadata were
harvested to the GEOSS clearinghouse.
Where GEOSS Harvest From?
The GEOSS Clearinghouse is harvesting from the following protocols:
Z39.50 "GeoProfile" (manually-initiated, under testing)
CSW 2.0.2 baseline, AP ISO, ebRIM with no extensions
WebDAV, sitemaps, and Web Accessible Folders (WAF)
OGC GetCapabilities (WMS, WFS, WCS) endpoints
Local file access for batch ingest of packaged, static metadata
Also available to the system, are the following protocols:
THREDDS Catalog
OAI-PMH
ISO 23950 "SRU"
GeoNetwork "native"
Harvest From Example: CSR
Harvest From Example: USGS WAF
Metadata of the following three standards can be harvested
directly.
Dublin-core
FGDC CSDGM
ISO19115, ISO19139
Other formats must be converted to ISO19139
Metadata Formats Transformation
As for formats, the following "Full" metadata formats are
recognized and parsed into ISO 19139 metadata for ingest:
ISO 19115, 19119 XML (no transform, per 19139)
ebRIM common information model XML, including the
CSR form
FGDC CSDGM Version 2 XML (no transform)
CSW csw:Record XML (no transform)
OGC GetCapabilities XML and other XML
Stylesheets will be used for the Metadata Formats
Transformation
Stylesheet(
*.xsl)
Original
Metadata
ISO Metadata
Stytlesheet example:
<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet version="1.0"
xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
<xsl:template match="metadata">
<uuid><xsl:value-of select="mdFileID"/></uuid>
</xsl:template>
</xsl:stylesheet>
Metadata Transformation in CSR Harvesting
CSR: which is short for the GEOSS Component and Service
Registry.
ebRIMcomponent2ISO19139.xsl will be used for component’s
translating
<gmd:fileIdentifier>
<gco:CharacterString>
<xsl:value-of
select="csw:GetRecordByIdResponse/rim:Organization/@*[name()='id']"/>
</gco:CharacterString>
</gmd:fileIdentifier>
Also, ebRIMservice2ISO19139.xsl will be used for service’s
translating
<gmd:fileIdentifier>
<gco:CharacterString>
<xsl:value-of
select="csw:GetRecordByIdResponse/rim: Service /@*[name()='id']"/>
</gco:CharacterString>
</gmd:fileIdentifier>
Example: Original Rim Metadata
Example: The Same Record in
GEOSS Clearinghouse
Demo: Harvsting
Demo: Harvsting
Demo: Harvsting
Demo: Harvsting
Demo: Harvsting
5. Metadata Creating
In the process of creating metadata, a template can be
chosen. Some templates have been developed like
below:
FGDC template
Dublin Core Template
6. Metadata Search
Local search
Using Lucene to index metadata
Remote search
CSW Search
http://clearinghouse.cisc.gmu.edu/srv/en/csw
RSS Search
http://clearinghouse.cisc.gmu.edu/srv/en/rss.search
SRU Search
http://clearinghouse.cisc.gmu.edu/srv/en/portal.sru
Advanced Search
ISO Query-ables
Name
Definition
Data Type
Property Mapping to Information Model
Subject
The topic of the
content of the
resource
CharacterString
MD_Metadata.identificationInfo.AbstractMD_Identification.de
scriptiveKeywords.MD_Keywords.keyword
plus
MD_Metadata.identificationInfo.MD_DataIdentification.topicC
ategory
Title
A name given to the
resource
CharacterString
MD_Metadata.identificationInfo.AbstractMD_Identification.cit
ation.CI_Citation.title
Abstract
A summary of the
content of the
resource
CharacterString
MD_Metadata.identificationInfo.AbstractMD_Identification.ab
stract
AnyText
A target for full-text
search of
character data
types in a
catagloue
CharacterString
Whole resource text.
Format
The physical or
digital
manifestation of
the resource
CharacterString
MD_Metadata.distributionInfo.MD_Distribution.distributionFor
mat.MD_Format.name
Identifier
A unique reference to
the record within
the catalogue
Identifier
MD_Metadata.fileIdentifier
ISO Query-ables (contd.)
Modified
Data on which the
record was created or
updated within the
catalogue
Date-8601, example:
2000-08-21
MD_Metadata.dateStamp.Date
Type
The nature or genre of
the content of the
resource. Type can
include general
categories, generes or
aggregation levels of
content
Codelist: dataset,
datasetcollection,
service, application
MD_Metadata.hierarchyLevel.MD_ScopeCode/@codeListValu
e. If MD_Metadata .hierarchyLevel is missing, ‘Type’ will be
considered as “Dataset” (default).
Bounding
Box
A bounding box for
identifying a
geographic area of
interest
BoundingBox,
BoundingBox
WestBoundLongitude
SouthBoundLatitude
EastBoundLongitude
NorthBoundLatitude
CRS
Geographic
Coordinate Reference
System (Authority
and ID) for the
BoundingBox
Identifier
CRS
Authority:
MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe
renceSystemIdentifier.RS_Identifier.codeSpace
ID:
MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe
renceSystemIdentifier.RS_Identifier.code
Version:
MD_Metadata.referenceSystemInfo.MD_ReferenceSystem.refe
renceSystemIdentifier.RS_Identifier.version
Example 1: CSW Post Request
<?xml version="1.0" encoding="UTF-8"?>
<csw:GetRecords
xmlns="http://www.opengis.net/cat/csw/2.0.2"
xmlns:csw="http://www.opengis.net/cat/csw/2.0.2"
xmlns:gmd="http://www.isotc211.org/2005/gmd"
xmlns:ogc="http://www.opengis.net/ogc"
xmlns:gml="http://www.opengis.net/gml"
xmlns:rim="urn:oasis:names:tc:ebxml-regrep:xsd:rim:3.0"
service="CSW"
version="2.0.2"
outputFormat="application/xml"
outputSchema="http://www.isotc211.org/2005/gmd"
resultType="results"
startPosition="1"
maxRecords="5">
<csw:Query typeNames="csw:Record">
<csw:ElementSetName>full</csw:ElementSetName>
<csw:Constraint version="1.1.0">
<ogc:Filter>
<ogc:PropertyIsLike escapeChar="\" singleChar="?" wildCard="*">
<ogc:PropertyName>Type</ogc:PropertyName>
<ogc:Literal>dataset</ogc:Literal>
</ogc:PropertyIsLike>
</ogc:Filter>
</csw:Constraint>
</csw:Query>
</csw:GetRecords>
Example 2: RSS Request
Example 3: SRU Request
GEOSS Clearinghouse Demonstration
http://clearinghouse.cisc.gmu.edu/geonetwork/
Acknowledgements & Further Info
Website: The clearinghouse is operating at
http://clearinghouse.cisc.gmu.edu/geonetwork
Contactor:
Chaowei Phil Yang: [email protected]
Douglas Nebert: [email protected]
The project is supported by FGDC grant, ESIP Products &
Services and Air Quality Committees, and NASA Geospatial
Interoperability grant.
Kai Liu, Chaowei Phil Yang, Huayi Wu, Wenwen Li,
Zhenlong Li, Tim Miao, Qunying Huang, Douglas Nebert
Center for Intelligent Spatial Computing
George Mason University & FGDC
http://cisc.gmu.edu/
Let’s get
Started
Multilanguage
Metadata
Let’s Check the metadata
Let’s check the metadata standards
Let’s check the metadata standards
Let’s check the metadata in XML
Categories
Recent Changes
Advanced Search
Create New Metadata
Create New Metadata(1)
Create New Metadata(2)
Create New Metadata(3)
Create New Metadata(4)
Create New Metadata(5)
Create New Metadata(5)
Create New Metadata(6)