Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva, April 3-5, 2006

Download Report

Transcript Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva, April 3-5, 2006

Metadata to Support
the Survey Life Cycle
Alice Born, Statistics Canada
Joint UNECE/Eurostat/OECD
Work Session on Statistical Metadata
(METIS)
Geneva, April 3-5, 2006
Outline
• Description of STC’s Integrated
Metadatabase (IMDB)
• Common metatdata set for a survey life
cycle
• Tools for entering metadata
• Time travel – versioning rules
• Complete model
Corporate metadata at Statistics Canada
• Integrated Metadatabase (IMDB)
– Collection of information about each of
Statistics Canada’s 560+ current surveys
– Aimed at helping users interpret statistical
data
•
•
•
•
•
Survey description
Survey instrument
Methodology
Data accuracy
Variables, classifications
What is the IMDB based on?
• ISO 11179 Specification and Standardization of
Data Elements
• Corporate Metadata Repository (CMR) – USBC
(D. Gillman)
• Extension of ANSI X3.285 for the management
of statistical information (American National
Standards Institute metamodel)
Surveys - definition
• Metadata in the IMDB is organized around the
survey entity
• Refers to collection, compilation and publication
of data measuring characteristics of a population
• Three types of surveys:
• Direct
• Administrative
• Derived
Statistical Activities
• Group of surveys that share common
feature, common explanatory text
• E.g., System of National Accounts:
The Canadian System of National Accounts (CSNA) provides a
conceptually integrated framework of statistics and analysis for
studying the state and behaviour of the Canadian economy. The
accounts are centered on the measurement of activities associated
with production of goods and services, the sales of goods and
services in final markets, the supporting financial transactions and
the resulting wealth positions.
Regions
Statistical Activity
Organization
Survey
Stewardship
Contact
Universe
Documentation
Frame
Identification
Survey instance
Time Frame
Instrument
Keyword
Question
Identification
Classification
Theme
Data file
Methodology
Data Element
Instrument design
Sampling
Data source
Error detection
Imputation
Estimation
Quality evaluation
Disclosure control
Revisions and seasonal
adjustment
Data accuracy
Data Element Concept
Object Class
Property
Formula
Conceptual Domain
Value Domain
Common metadata set for survey life cycle
Statistical activity
Survey (direct, administrative, derived)
Target population (population, statistical unit)
Survey instance (each survey process)
Collection instrument
Methodology
Data accuracy
Documentation
Data file
(Data elements, value domains)
Common metadata set for survey life cycle
Methodology
Instrument design
Sampling
Collection method
Error detection
Imputation
Estimation
Quality evaluation
Disclosure control
Revisions and seasonal adjustment
Common metadata set for survey life cycle
Survey
Survey Instance
- questionnaires
- variables (DE)
- methodology
- data accuracy
Common metadata set for survey life cycle
Survey
Instruments
Common metadata set for survey life cycle
Data
elements
Common metadata set for survey life cycle
Methodology
Target population
Instrument design
Tools for loading metadata into IMDB
Statistical Activity - Identification Tab
Statistical Activity and Survey
- DescriptionTab
Survey Instance (cycle) – Times Frames
Data sources – Description
Versioning (time-travel)
• Metadata change over time – each survey
instance, survey or statistical activity
• Rules for revisions and versioning of
administered items
• Three functions:
– Create
– Update
– Version
Versioning (time-travel)
Survey:
• Changes to mandate or subject of survey – new survey
(new IMDB record and new SDDS number)
• Changes to characteristics of surveys – new version of
survey
Survey instance:
• Each reference period – new version of the instance
– Now it coincides with release of data in the Daily
– Demand for the new instance version to coincide with collection
start dates
– Central link to versioning of other administered items
(instrument, methodology and data file)
Versioning (time-travel)
Target population:
• Changes result in a new version of the survey
and target population
Statistical activity:
• Changes to program mandate or structure
(addition or removal of surveys) results in new
version of statistical activity
Applications/
Software
Statistical
Activity
Target
population
Survey
Frame and
Sample
Methodology
Survey
Instance
Products
(COR)
Instrument
Data File
Data elements