Data Archiving
Download
Report
Transcript Data Archiving
Multiple Indicator Cluster Surveys
Data Interpretation, Further Analysis and
Dissemination Workshop
Data Archiving
Look familiar?
Today’s storage medium
is no longer file cabinets
and storage rooms...
What is archiving and why do it?
Data archiving: rationale
Collecting data is expensive. Data should be used
beyond producing basic report.
Survey microdata are valuable resources for
government departments and academic researchers.
Survey microdata constitute valuable and
irreplaceable assets which should be managed in a
way that encourages their widest possible use and
re-use.
At the same time, data collectors main focus should
be protecting respondents while making microdata
assessable.
Role of MICS data archiving
Preservation
Documentation
Anonymization
Dissemination of microdata
Why preserve data?
Availability for future analysis
Examples:
Able to examine trends if you have a series
of datasets over time
Allows in-depth analysis of important
subject areas
Why document data?
Difficult to process without documentation
Impossible to analyze correctly
Data are useless without
documentation
Also allows reuse of documents for future
surveys
Why anonymize data?
Protecting the confidentiality of survey respondents
- not only because of legal and ethical mandates, but
also because of the overall public perception that
trust is an important contributor to data quality and
response rates.
Protecting confidentiality necessitates some sort of
data anonymization so that individual respondents
can not be identified.
Finding balance between protecting the data and
compromising its quality.
Why disseminate data?
Allows wider use of data
Allows others to re-analyze data
may bring to light issues that may not have been picked up
in the original processing.
Permits cross-national comparison of data
Permits further analysis of data (to be discussed later)
Increases acceptability of data
Through transparency
Demonstrates confidence in survey
Who do we disseminate to?
The real value of statistical data is in their use
Potential users of the data are:
•National Policy-makers
•International organizations
•NGO’s
•Academic community
•Private enterprises
•Others?
What do users expect?
Well documented data
Comprehensive
Clear, consistent, easy to use data
Information to be able to
Fully understand the survey, especially
• Sample design, selection and weighting
• Field procedures
• Data processing
• Datasets
Accurately analyze and use data
When to archive?
Start archiving when you start the survey
Typically, datasets are documented
after completion of the survey
MICS Documents to Archive
Survey plan and budget
Questionnaires
Pre-test report
Interviewer & field manuals
Sample selection, listing information
Excel files, other supporting documents
Explanations of any event(s) that impacted the sample
selection, survey dates, etc
Steering/technical committee ToRs and minutes
Any other documentation that will help people understand what
happened in the design and implementation of the survey
MICS Programmes to Archive
Software Programmes & Files:
• CSPro dictionaries
• CSPro data entry & menu systems
• CSPro secondary editing & menu systems
• CSPro GPS Entry program
• CSPro export & SPSS recode
• SPSS tabulation programmes
• Any country-specific programmes
MICS Datasets to Archive
• SPSS Files:
‒ Household (HH)
‒ Household listing (HL)
‒ Insecticide-Treated Nets (TN, if included in survey)
‒ Women (WM)
‒ Birth History (BH if included in survey)
‒ FGM (FG, if included in survey)
‒ Children age 0-4 years (CH)
‒ Men (MN, if included in survey)
‒ Maternal Mortality (MM, if included in survey)
• CSPro Files:
‒ Final (merged, not individual clusters) HH data file
‒ GPS data (if included in survey)
MICS Reports and Dissemination Materials
to Archive
Key Findings Report
Final Report
Presentations
Factsheets
Press releases
Videos
And everything else!
Archives for internal and external audiences
If some survey information is considered
sensitive or for internal use only then at
the end of the survey process it is
advisable to create two separate
versions of the MICS archive.
Restricted MICS
Archive
(complete archive but
only accessible
internally)
Public MICS
Archive
(with some
documents and
information deleted)
How do users access data?
Types of data
Indicators
DevInfo, other databases, …
Tables, Graphs, Maps
Reports, excel files, DevInfo
gallery, …
Datasets, etc.
Formats
Print
CD-Rom/DVD
Web
Etc.
But how to get the data?
Online
Write to data producers
Conditions on access?
•
Freely available or restricted?
How to Archive?
Creating the archive?
Recommend to use an archiving tool:
International Household Survey Network’s
(IHSN) Microdata Management Toolkit
http://www.ihsn.org/toolkit
Toolkit components
Metadata Editor (also known as Nesstar Publisher)
Documents survey data in accordance with
international standards
CD-Rom Builder
Generates CD-Rom and web output
Nesstar Explorer
For viewing metadata and re-exporting data to
various formats
Step 1: The Metadata Editor
All data and documents are first archived in
this software.
Template-driven specialized editor
Imports and exports data to the most
common statistical formats
Integrated interface, multi-lingual support
Compiles metadata and data in single file
Installation
\Archiving\IHSN Toolkit Full Package\Nesstar
Publisher v4.0.9
Install by running:
NesstarPublisherInstaller_v4.0.9.exe
Nesstar Publisher – first time
Open the Nesstar Publisher
Select Documentation, Template
(or T from the menu buttons)
Import Archiving\IHSN Toolkit Full Package\IHSN
Metadata Templates
• Select IHSN_1.6_en.NesstarTemplate the Study
Templates
Click Use to use this templates
Select Documentation, Template
Click Resource Description Templates
Import Archiving\IHSN Toolkit Full Package\IHSN
Metadata Templates
Select IHSN_1.5_en.NesstarRDTemplate from the
Resource Description Templates
Click Use to use this templates
Modifying the example for your survey
Select My Projects
File, Add New Study (Ctrl-N)
Document description
Study description
Datasets
Variable groups
External resources
Import datasets
Click on Datasets
Add datasets (Ctrl-I)
•
•
•
•
HH
HL
WM
MN
•
•
•
•
CH
BH
MM
FG…
Add LN variable to HL file (needed for relations)
Insert variable (before HL1)
Name: LN, Label: Line number, Width: 2
Click on Data Entry
Select Data, and uncheck ‘Write protected’
Copy all cells of HL1 to LN
Select Data, and check ‘Write protected’
Completing datasets
For each dataset, check key variables and
relations
Add LN to HL files list of key variables
For each dataset, click variables, resequence
Validate dataset relations
Import descriptions from generic MICS
example
Documentation, Import, from study
Check all boxes, except
•
•
•
•
variable information
variable definitions
categories
keys and relations
Select Generic MICS 5 v1.0
This imports all of the generic document
description, study description, variable groups
Does not import external resources
Save As
..\My survey MICS Izmir WS v1.0
Update for your survey
Document description
Study description
Dataset description
Key variables and relations
Questionnaire wording
Variable groups
External resources
Thank You
30