United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session on Statistical Metadata Geneva, 10-12

Download Report

Transcript United Nations Economic Commission for Europe Statistical Division Part B of CMF: Metadata, Standards Concepts and Models Jana Meliskova UNECE Work Session on Statistical Metadata Geneva, 10-12

United Nations Economic Commission for Europe
Statistical Division
Part B of CMF:
Metadata, Standards Concepts
and Models
Jana Meliskova
UNECE Work Session on Statistical Metadata
Geneva, 10-12 March 2010
Common Metadata Framework
(CMF)

CMF work under the umbrella of Conference
of European Statisticians

CMF aim:
to assist SOs in adoption, modeling, usage
and implementation of SMS across all phases
of their Statistical Business Process

Collective input of NSOs and IOs

Published online via Metis Wiki
www.unece.org/stats/cmf
Jana Melíšková
Slide 2
CMF composition




Part A - Statistical Metadata in a Corporate Context- A guide for
managers
Focused on corporate management of metadata projects.
(Finalized, publication available)
Part B - Metadata , Standards Concepts and Models
Provides information about international standards concepts and
models relevant to statistical metadata.
(Under preparation, presented version available on the Metis Wiki)
Part C - Metadata and the Statistical Business Process
Presents a Generic Business Process Model as a common process
model for statistical surveys.
(Model finalized, available on Metis Wiki)
Part D - Implementation
Focuses on the experiences of national and international statistical
organizations that have recently implemented or re-engineered their
statistical metadata systems.
(Case studies available on Metis Wiki)
Jana Melíšková
Slide 3
Standards for metadata are
indispensable
Use of standards is a challenge
There is a common understanding in SOs
that the use of common standards related
to statistics and metadata is indispensable.
The number and diversity of existing
standards, however, makes it a challenge
for statistical experts to understand them
and to incorporate them efficiently in the
SMS global architecture.
Jana Melíšková
Slide 4
CMF Part B - a draft version
Ad-hoc Task Force convened for Part B in early 2009
 Presented document is not a final version of CMF
Part B.

It is a draft based on:
• National and international experiences reported
at the UNECE/Eurostat/OECD WSs on Metis,
• Consultancies with experts,
• Know-how available in writing a draft.
Feedback and assistance in finalizing and further
maintenance of Part B is highly welcomed.
Jana Melíšková
Slide 5
The aim of CMF Part B
To offer SMS designers an overview of existing
resources, which are likely to be applicable when
designing and implementing SMS.



Resources:
standards, concepts, models, best practices and
other methodological materials.
Focus on internationally recognized resource.
Part B is designed primarily as an Internet
publication, so that it can be kept as up to date as
possible.
Jana Melíšková
Slide 6
CMF Part B – a unique source

Part B of the CMF aims to be a unique
source of information on existing statistical
metadata standards.

It provides a single point of reference,
giving SMS designers and other potential
SMS users a basic information about
standards related to statistical metadata.

It provides for each standard links to more
detailed materials and resources.
Jana Melíšková
Slide 7
CMF Part B – groups of standards


Standards, related to statistical metadata are
presented in the following groups:
1. Statistical concepts
2. Technical standards;
3. Models and statistical practices;
4. Methodological guidelines and
recommendations.
Grouping aims to assist SMS designers decide in
which areas of SIS metadata standards should
be implemented.
Jana Melíšková
Slide 8
Template for source description












Name and version
Alternative name
Valid
Description
Intended use
Maintenance organization
ISO Standard Number
References
Relationship to other standards
Format
Language
Template last update
Jana Melíšková
Slide 9
Statistical concepts





Refer to the content of statistics. The group
encompasses internationally accepted
standards and/or recommendations:
Concepts and definitions for compiling,
disseminating and exchanging statistics;
Statistical classifications;
Statistical units;
Statistical subject matter domains;
Other standards related to statistical content.
Jana Melíšková
Slide 10
Technical standards
provide technical specifications for:






exchange, storage, documentation and retrieval of
statistical data and metadata,
other ICT supported activities, dealing with the
use of metadata for the production of statistics,
ISO international standards on Statistical Data
and Metadata Exchange (SDMX),
metadata registries,
Data Documentation Initiative (DDI),
Geographical information system (GIS)
Jana Melíšková
Slide 11
Models and statistical practices







Internationally developed models related to
statistical metadata;
Nationally developed models, recognized and
applicable internationally;
Neuchâtel Model on Statistical Classifications
and Variables;
Corporate Metadata Repository ;
the IMF Data Quality Assurance Framework
ESS standards for quality reports
others
Jana Melíšková
Slide 12
Methodological guidelines and
recommendations
A lot of methodological materials and recommendations
related to statistical metadata have been developed in the
framework of international cooperation organized by the
UNECE together with OECD, Eurostat and other
international organizations.




Guidelines for the Modeling Statistical Data and Metadata,
Guidelines for Statistical Metadata on the Internet,
Recommendations on Formats Relevant to the
Downloading of Data from the Internet,
Best Practices in Designing Websites for Dissemination of
Statistics.
Jana Melíšková
Slide 13
METADATA STANDARDS, CONCEPTS AND MODELS
1
Statistical
Concepts
2
Technical
Standards
3
Models and
Statistical Practices
4
Methodological
Guidelines and
Recommendations
4.1
Guidelines for the
Modeling Statistical
Data and Metadata
(UNECE 1995)
1.1
Statistical
classifications
2.1
Dublin Core
(ISO 15836)
3.1
Neuchatel Model
(Classifications,
Variables)
1.2
Statistical units
2.2
Data Documentation
Initiative (DDI)
3.2
CRISTAL Model
1.3
Statistical Variables
/Characteristics
1.4
Statistical Subjectmatter Domains
(UNECE)
1.5
SDMX - Metadata
Common
Vocabulary
1.6
SDMX - Crossdomain Concepts
1.7
SDMX - Crossdomain Code-lists
2.3
Metadata Registries
(ISO/IEC 11179)
2.4
Statistical Data and
Metadata Exchange
(SDMX)
(ISO/TS 17369)
3.3
Generic Statistical
Business Process
Model (GSBPM)
3.4
Corporate Metadata
Repository (CMR)
2.5
Common Warehouse
Metamodel
(ISO/IEC 19504)
3.5
Data Quality Assurance
Framework (DQAF)/
Special Data
Dissemination
Standards (SDDS)
2.6
eXtensible Business
Reporting Language
(XBRL)
3.6
ESS Standard for
Quality Reports
(ESQR)
2.7
GIS
(ISO 19115)
3.7
Nordic Metamodel
4.2
Guidelines for
Statistical Metadata
on Internet
(UNECE 2000)
4.3
Recommendations on
Formats Relevant to
the Downloading of
Data from the Internet
(UNECE 2001)
4.4
Best Practices in
Designing Websites
for Dissemination of
Statistics
(UNECE 2001)
CMF Part B – relationships between
the resources


Work has also started on defining the nature of the
relationships between individual resources. The
chart on the next slide shows the progress so far.
Several types of relationships have been identified:
•
classifies
• codifies
• compliesWith
• conformsTo
• contains
•
generalises
• instantiates
• isPartOf
• isSimilarTo
• measures
Jana Melíšková
Slide 15
CMF Part B
Relationships between resources
07 November 2015
Jana Melíšková
Slide 16
CMF Part B – some issues to be
considered
Completeness
There are many other resources that could be
described in Part B. For example:

standards indirectly related to statistical
metadata;

national standards and models.
Are all the relevant resources described?
Should any of those currently included be
removed?
Jana Melíšková
Slide 17
CMF Part B – some issues to be
considered
A second level of resource descriptions.

Some of the resources described (for example
statistical classifications and statistical units)
refer just to a family of resources.

Is there a value in developing a second tier of
resource descriptions to describe the
elements in these resource families?
An example could be resource descriptions of
the main international classifications (ISIC,
ISCO, ISCED etc.).
Jana Melíšková
Slide 18
CMF Part B – some issues to be
considered
Identification of links between resources
Each resource could have links to all
other resources.
However, some links are stronger than
others.
At present only the stronger links are
described. Is it sufficient for practical
purposes?
Jana Melíšková
Slide 19
CMF Part B – some issues to be
considered
Maintenance and update

A regular maintenance of the
resources described in the CMF Part
B is an important pre requisite for its
value to SMS designers.

To identify “owners”, responsible for
maintaining each resource description
seems to be indispensable.
Jana Melíšková
Slide 20
Conclusion





The work on the CMF Part B has progressed
significantly during the last year. It can now be
opened to the wider METIS group for comment.
Since the CMF Part B is a living document, it will never
really be finished. It should remain open for further
development and maintenance.
The important goal at present is to reach the stage
where the METIS community will be confident that, the
CMF Part B will be useful for statistical practice.
It is planned to promote CMF Part B to the wider
statistical community by the end of 2010.
The input of the 2010 METIS Work Session will help
the Task Force to reach this goal.
Jana Melíšková
Slide 21
Task Force
Sergio Bacelar (INE, Portugal)
Max Booleman (CBS, Netherland)
Alice Born (Statistics Canada)
Dan Gillman (US Bureau of Labor Statistics)
Hamish James (Statistics New Zealand)
Jana Meliskova (Consultant)
Marco Pelegrino (Eurostat)
Steven Vale (UNECE)
Jana Melíšková
Slide 22