World Meteorological Organizations (WMO) Observing and Information Systems Department WMO Information System (WIS) Interoperability Standards at Levels of Syntax and Semantics Presented by Eliot Christian at.
Download ReportTranscript World Meteorological Organizations (WMO) Observing and Information Systems Department WMO Information System (WIS) Interoperability Standards at Levels of Syntax and Semantics Presented by Eliot Christian at.
World Meteorological Organizations (WMO) Observing and Information Systems Department WMO Information System (WIS) Interoperability Standards at Levels of Syntax and Semantics Presented by Eliot Christian <[email protected]> at the First Meeting of WMO / CBS / ISS / ET-ADRS (Expert Team On Assessment Of Data Representation Systems) in Silver Spring, MD, USA on 23 - 25 April 2008 Syntax and Semantics Semantics deals with the meaning of a symbol in some language Syntax deals with handling of symbols independent of their meaning, e.g., ASN.1 (Abstract Syntax Notation) XML (eXtensible Markup Language) Interoperability Standards at Levels of Syntax and Semantics 2 Standards-based Interoperability at the Level of Syntax Structure Types, e.g., sub-element is part of element record is part of file table has three dimensions Data Types, e.g., integers or decimal numbers ... dates, times, coordinates ... text, images, sound ... Indicating structure in an instance, e.g., start-position, length mark-up with start-end tagging Interoperability Standards at Levels of Syntax and Semantics 3 ASN.1 Snippet PDU ::= CHOICE{ searchRequest [22] IMPLICIT SearchRequest, searchResponse [23] IMPLICIT SearchResponse, SearchRequest ::= SEQUENCE{ resultSetName [17] IMPLICIT InternationalString, databaseNames [18] IMPLICIT SEQUENCE OF DatabaseName, query [21] Query SearchResponse ::= SEQUENCE{ resultCount numberOfRecordsReturned nextResultSetPosition searchStatus resultSetStatus records [23] IMPLICIT INTEGER, [24] IMPLICIT INTEGER, [25] IMPLICIT INTEGER, [22] IMPLICIT BOOLEAN, [26] IMPLICIT INTEGER, Records OPTIONAL, Interoperability Standards at Levels of Syntax and Semantics 4 Comparing ASN.1 and XML Syntactic Mechanisms syntax description ASN.1 XML specification schema structure mechanism start-position and length mark-up with start-end tags atomic element character bit encoding mechanism encoding rules Unicode filter (basic, packed...) (ISO 8859-1, UTF-8 ... Interoperability Standards at Levels of Syntax and Semantics 5 ASN.1 and Encoding Rules In ASN.1, concrete syntax ("on-the-wire encoding") is handled separately from the abstract syntax: ASN.1 "Encoding Rules" are applied to make actual messages in compliance with an ASN.1 specification Basic Encoding Rules (BER): represent each byte present, as defined in the specification Packed Encoding Rules (PER): group repeated bytes Distinguished Encoding Rules (DER): represent only bytes that change, on a message to message basis XML Encoding Rules (XER): translate between ASN.1 elements and their XML equivalents Interoperability Standards at Levels of Syntax and Semantics 6 ASN.1 and XML Interoperability using XML Encoding Rules An XML Encoding Rules (XER) tool can generate, automatically and losslessly: an XML schema from an ASN.1 specification OR an ASN.1 specification from an XML schema In addition, XER can be applied at the instance level rather than the schema level, i.e., XER converters inserted at both ends of a communications link will enhance transmission efficiency "on-the-fly" Interoperability Standards at Levels of Syntax and Semantics 7 Example: Common Alerting Protocol Uses XER to Bridge ASN.1 and XML Common Alerting Protocol (CAP) standardizes all-hazards, all-media public warning messages Message formats in the CAP standard were originally defined using XML Schema XER was used to create an ASN.1 specification for CAP messages, required for ITU adoption International Telecommunication Union (ITU) adopted CAP as ITU Recommendation X.1303 Interoperability Standards at Levels of Syntax and Semantics 8 Standards-based Interoperability at the Level of Semantics Interoperability Standards at Levels of Syntax and Semantics 9 Standards-based Interoperability at the Level of Semantics Semantic interoperability is often constrained in practice by a lack of available data definitions It is risky to integrate or transform any data or information element without its explicit definition ISO/IEC 11179, Metadata Registries, provides useful guidance for documenting the meanings of data or information elements The objective is to have good definitions for data and information elements, not to fully automate inferencing (sometimes called "Semantic Web") Interoperability Standards at Levels of Syntax and Semantics 10 Example: CAP Data Dictionary Element Name Context. Class. Attribute. Representation Definition and (Optionality) Notes or Value Domain alert cap. alert. group The container for all component parts of the alert message (REQUIRED) (1) Surrounds CAP alert message sub-elements. (2) MUST include the xmlns attribute referencing the CAP URI as the namespace, e.g.: <cap:alert xmlns:cap="urn:oasis:names:tc:emergency:cap:1.1"> [sub-elements] </cap: alert> (3) In addition to the specified sub-elements, MAY contain one or more <info> blocks. identifier cap. alert. identifier The identifier of the alert message (REQUIRED) (1) A number or string uniquely identifying this message, assigned by the sender (2) MUST NOT include spaces, commas, or restricted characters (< and &) sender cap. alert. sender. identifier The identifier of the sender of the alert message (REQUIRED) (1) Identifies the originator of this alert. Guaranteed by assigner to be unique globally; e. g., may be based on an Internet domain name (2) MUST NOT include spaces, commas, or restricted characters (< and &) sent cap. alert. sent. time The time and date of the origination of the alert message (REQUIRED) The date and time is represented in [dateTime] format (e.g., "2002-05-24T16:49:0007:00" for 24 May 2002 at 16: 49 PDT). status cap. alert. status. code The code denoting the appropriate handling of the alert message (REQUIRED) Code Values: “Actual" Actionable by all targeted recipients “Exercise" Actionable only by designated exercise participants; exercise identifier should appear in <note> “System" For messages that support alert network internal functions. “Test" Technical testing only, all recipients disregard Interoperability Standards at Levels of Syntax and Semantics 11 Actions Proposed Syntactic interoperability should be achieved using automated transformations, based on standardized syntax description languages Syntactic interoperability cannot assure that information is meaningful--precise definitions of data and information elements are necessary ISO/IEC 11179, Metadata Registries, provides useful guidance for defining elements, within and among data dictionaries A policy promoting use of ISO/IEC 11179 would complement broader use of ISO 191xx series of standards, especially ISO 19115 Interoperability Standards at Levels of Syntax and Semantics 12 References: GEOSS (Global Earth Observation System of Systems) 10 Year Implementation Plan and Reference Document http://earthobservations.org/ ISO/IEC 11179 Metadata Registries http://standards.iso.org/ittf/PubliclyAvailableStandards/ c035343_ISO_IEC_11179-1_2004(E).zip Questions? Interoperability Standards at Levels of Syntax and Semantics 13 Using Registries to Manage Data / Metadata Syntax and Semantics Interoperability Standards at Levels of Syntax and Semantics 14 Registries are Tools for Data Management, Information Management, Knowledge Management Several types of “registries” commonly used The types of registries have related content, but extended and utilized in different ways The registries vary according to: intended purpose granularity of contents the level of semantics management Interoperability Standards at Levels of Syntax and Semantics 15 Types of Registries ISO 11179 Registries Database Catalogs UDDI Registries Common Content Common Content Common Content OASIS/ebXML Registries CASE Tool Repositories Common Content Common Content Software Component Registries Common Content Ontological Registries Common Content Interoperability Standards at Levels of Syntax and Semantics 16 Example using "Country Name" as Common Content across Registry Types Database Catalogs ISO 11179 Registries Common Content Common Content Common Content OASIS/ebXML Registries Common Content Software Component Registries UDDI Registries Country Name CASE Tool Repositories Common Content Ontological Registries Common Content Common Content Interoperability Standards at Levels of Syntax and Semantics 17 "Country Name" in a 11179 Registry Data Element Concept Name: Country Identifiers Context: Definition: Unique ID: 5769 Conceptual Domain: Maintenance Org.: Steward: Classification: Registration Authority: Others Algeria Belgium China Denmark Egypt France ... Zimbabwe Data Elements ISO 3166 French Name ISO 3166 2-Alpha Code ISO 3166 3-Alpha Code ISO 3166 3-Numeric Code Algeria L`Algérie DZ DZA 012 Belgium Belgique BE BEL 056 China Chine CN CHN 156 Denmark Danemark DK DNK 208 Egypt Egypte EG EGY 818 France La France FR FRA 250 ... ... ... ... ... Zimbabwe Zimbabwe ZW ZWE 716 ISO 3166 English Name Name: Context: Definition: Unique ID: 4572 Value Domain: Maintenance Org. Steward: Classification: Registration Authority: Others Interoperability Standards at Levels of Syntax and Semantics 18 Registries Address Different Needs Example using "Country Name" as Common Content Database Catalogs Common Content ISO 11179 Registries Data Element Common Content Table Column UDDI Registries Common Content Business Specification OASIS/ebXML Registries XML Tag Country Name Attribute Common Content Software Component Registries Common Content CASE Tool Repositories Common Content Term Hierarchy Business Object Ontological Registries Common Content Interoperability Standards at Levels of Syntax and Semantics 19 Types of Registries: Database Catalogs/Data Dictionaries/ Repositories Database Catalogs Common Content Registers schema, integrity & operational information Includes all elements of database schemas: data elements, relations, integrity constraints Emphasis on information required to make database systems work for queries, etc., not semantic management Interoperability Standards at Levels of Syntax and Semantics 20 Database Catalog Interoperability Standards at Levels of Syntax and Semantics 21 Types of Registries: Universal Description, Discovery, and Integration (UDDI) Registries Registers Web-based Business Services A White page directory of the business activities of firms Emphasis on interface specifications UDDI Registries Common Content Interoperability Standards at Levels of Syntax and Semantics 22 UDDI Registry Business-to-Business <?xml version="1.0"?> <shipTo> <name>Alice Wilson</name> <street>161 North Street</street> <city>Happy Valley</city> <state>MO</state> <zip>63105</zip> <country code>USA</country code> </shipTo> Interoperability Standards at Levels of Syntax and Semantics 23 Types of Registries: Case Tool (Encyclopedias/Repositories) Registers data models and application program logic Includes information needed to create a database and potentially the program code for a system, including database schemas CASE Tool Repositories Common Content Emphasis not on semantic management Interoperability Standards at Levels of Syntax and Semantics 24 Case Tool Repository Interoperability Standards at Levels of Syntax and Semantics 25 Types of Registries: Ontological Registries Registers Concept Structures Includes relations among concepts (subsumption, inheritance,…) and axioms for inference among concepts, e.g., temporal/spatial reasoning, etc. Emphasis is on semantics Ontological Registries Common Content Interoperability Standards at Levels of Syntax and Semantics 26 Ontological Registry Concept Concept Concept Concept Geographic Area Geographic Sub-Area Country Country Identifier Country Name Short Name Mailing Address Country Name Long Name Distributor Country Name Country Code ISO 3166 2-Character Code ISO 3166 3-Numeric Code Interoperability Standards at Levels of Syntax and Semantics ISO 3166 3- Character Code FIPS Code 27 Types of Registries: Software Component Registries Registers reusable software components built on specific vendor platforms such as EJB, COM, CORBA IDL etc. Includes basic common elementary objects and object patterns Emphasis on reusability of software component to build flexible and standardized business systems. Software Component Registries Common Content Interoperability Standards at Levels of Syntax and Semantics 28 An Example of Software Component Registry Interoperability Standards at Levels of Syntax and Semantics 29 Types of Registries: OASIS/ebXML XML Registries Registers XML Artifacts OASIS/ebXML Registries Common Content Includes schemas and DTDs (XML schemata typically relate to a paper form, EDI document, or reporting requirement) Emphasis on syntactic (structure) information Interoperability Standards at Levels of Syntax and Semantics 30 OASIS/ebXML XML Registry Repository Dollars X12 UnitPrice EDIFACT ListPrice Currency VI304 Collaboration Partner #1 Schema or Template <?xml version="1.0"?> <shipTo > <name>Alice Wilson</name> <street>161 North Street</street> <city>Happy Valley</city> <state>MO</state> <zip>63105</zip> <country code>USA</country code> </shipTo> Collaboration Partner #2 Schema or Template <Rep href= “http://www.disa.mil”>DISA</Rep> <Rep href= “http://www.disa.mil”>DISA</Rep> <ELEMENT rep= ‘DISA’ name=‘Country’ Biz =‘VI304’ > <ELEMENT rep= ‘DISA’ name=‘CntryCode’ Biz =‘VI304’ > XML Instance XML Instance <Country>JPN</Country> <CntryCode>JPN</CntryCode> Interoperability Standards at Levels of Syntax and Semantics 31 Types of Registries: ISO 11179 Metadata Registries ISO 11179 Registries Common Content Registers Data Elements, components of data elements and groups of data elements (e.g., country codes for customer place of residence) Includes data element concepts, data elements (including representation), value domains, and (multiple) taxonomies Emphasis on semantic information such as definitions of data elements and value meanings, and stewardship responsibilities Interoperability Standards at Levels of Syntax and Semantics 32 ISO 11179 Registry Data Element List – Address Group 33c Name Street Address City, State Postal Code Country <?xml version="1.0"?> <shipTo > <name>Alice Wilson</name> <street>161 North Street</street> <city>Happy Valley</city> <state>MO</state> <zip>63105</zip> <country code>USA</country code> </shipTo> Interoperability Standards at Levels of Syntax and Semantics 33