Compilation of Metadata, Statistics Canada

Download Report

Transcript Compilation of Metadata, Statistics Canada

Compilation of Meta Data
Presentation to OG6
Canberra, Australia
May 2011
What is meta data?
 Information used to describe other data
 Everything you need to know about a particular
set of data in order to understand and use it
 Information about concepts, definitions,
collection, processing, methodology, quality, etc.
2
What is meta data used for?
 To help the user:
•
•
•
•
To interpret, understand, analyse the data
To judge the quality of the data & the “fitness for use”
To transform statistical data into information
To facilitate comparability of data
 To support data producers:
• To retain and transfer knowledge
• To promote harmonization between data sets
• To improve collection
3
Meta data is an integral part of
quality assurance
 Elements of data quality:
 Relevance
 Accuracy
 Timeliness
 Accessibility
 Coherence
 Interpretability
4
General principles for documentation
 Provide users with the information necessary to
understand both strengths and weaknesses
 Allow users to determine whether the data meet
their needs
 Should be clear, organized, accessible
 Should be integrated wherever necessary to
support the user’s understanding
 Should be standardized, mandatory, updated as
required
5
Defining meta data content
 See IRES chapter 9 for a template
 Handout: Excerpt of the Statistics Canada
“Policy on informing users of data quality and
methodology”
 Handout: Example of meta data documentation
for Canada’s “Industrial Consumption of Energy”
survey
 What are the minimum requirements?
6
Proposed meta data content (1)
 Survey/Product name
 Objectives of survey:
• Why are the data collected?
• Who are the intended users?
 Timeframe
• Frequency of collection?
• Reference period?
• Collection period?
7
Proposed meta data content (2)
 Concepts and definitions
 Target population
• Survey universe/sampling frame
• Classifications used
 Collection method
• Direct survey (sample/census; mandatory/voluntary)
• Administrative data sources
8
Proposed meta data content (3)
 For sample surveys:
• Sample size, sampling error
• Response rates
• Imputation rates
 For administrative data:
•
•
•
•
9
Sources
Purpose of original collection
Merits/shortcomings of data (coverage, conceptual)
Processing, correction, reliability, caveats
Proposed meta data content (4)
 Error detection
• Missing data, entry errors, validity problems, edits,
reconciliation
 Imputation of missing data
 Disclosure control
• Rules of confidentiality, confidentiality analysis
 Revisions
• Policy, explanation of changes
10
Proposed meta data content (5)
 Description of analytical methods used
• Seasonal adjustment, rounding
 Other explanatory notes
• Breaks in time series
 Other supporting documents
• Questionnaires, reporting guides, procedures manuals
11
Concluding comments
 Documentation has often been the last work done and
the first work to be dropped
 But it is important on many levels
 Needs to be maintained & updated; standards and
templates help
 In the future, new surveys or changes may be meta data
driven – a growing role and importance
• To support planning, development
• To encourage harmonization, integration
12
For more information…
Andy Kohut
Director, Manufacturing & Energy Division
Statistics Canada
11th Floor, Jean Talon Building, section B-8
Ottawa, Ontario CANADA
K1A 0T6
613-951-5858
[email protected]
13