Document 7670679

Download Report

Transcript Document 7670679

EPUNet Training Course
2005
Day 2
Tutors:
Olaf Jürgens and Christian Schmitt
Berlin, April 11th to April 15th
2005
Overview
 Cross-Sectional Matching
 Transcription Routines (Raw Data into
Statistics-Package-Format)
 Exploring Data-Sets
EPUNet 2005
Training Course
ECHP Overview - Data Sets
Contents of the ECHP UDB
 Personal File
 Household File
 Register File
 Relational File
Above: One file per wave
 Country File
 Link File
ECHP Overview - Data Sets
Contents of the ECHP UDB
Personal File
 All person related information in the ECHP
 Only adult respondents
 Similar variable structure across countries
 Similar variable structure across waves
ECHP Overview - Data Sets
Contents of the ECHP UDB
Household File
 Unit of analysis: Household
 General information that is applicable for all
household members
 Similar variable structure across countries
 Similar variable structure across waves
ECHP Overview - Data Sets
Contents of the ECHP UDB
Register File
 Unit of analysis: persons
 General information
 Regardless of age or participation
 Similar variable structure across countries
 Similar variable structure across waves
ECHP Overview - Data Sets
Contents of the ECHP UDB
Relational File
 Unit of analysis: persons (caution: repeated
observations of persons!)
 Display of the relation-matrix of persons
within a household
 Kinship, parent-child relationships,
sibblings, etc.
 Similar variable structure across countries
 Similar variable structure across waves
ECHP Overview - Data Sets
Contents of the ECHP UDB
Country File
 Unit of analysis: country
 Display of general country specific
information (PPP, Exchange rates,
Population)
 Single file
 One set of variables per wave
ECHP Overview - Data Sets
Contents of the ECHP UDB
Link File








“Heart” of the ECHP
Unit of analysis: persons
Regardless of age or participation
General information for cross-sectional and
longitudinal matches
Household membership in a given wave
Sampling information
Weighting information
Single file
Cross-Sectional Matching Procedures
Cross Sectional Matches
 Person-level Matching
 Household-level Matching
 Relational Matching
Central identifiers for all of the above:
 country
 pid/hid
Base for all matches: ECHP Link File
EPUNet 2005
Training Course
Cross-Sectional Matching Procedures
Matching: Logical order
 First:
country variable (country)
 Second: household identifier (HID)
 Third:
personal identifier (PID)
Always use this logical order!
‘sort by country hid pid’
(hid may be left out if no household based
information is included in the data generation)
Base for all matches: ECHP Link File
EPUNet 2005
Training Course
Cross-Sectional Matching Procedures
Person-level matching I - Same Individual
 country
 pid
Examples of matches
 Matching information of one Person across
files
o Using personal information from the Register File
and the Personal File
o Adding information from the Personal File to the
Link File
Cross-Sectional Matching Procedures
Person-level matching - Across Individuals
Examples of matches
 Matching information of a child to the mother - Unit
of analysis: Mother; additional child related
information
 Matching information of a husband to his wife Unit of analysis: Wife; additional information of the
husbands income, e.g.
Central information stored within the
Relation-File
Cross-Sectional Matching Procedures
Identifiers within Files:
Unit of analysis
Basic info / File structure
Personal File
(Register File)
Country
PID
(HID)
Link File
Country
HID[waveN]
PID
Linking across waves
Person Level Information
Country File
Country
General country specific info
Household File
Country
HID
Household Level Information
Relationship File
Country
PID
(HID)
Linking across individuals
Lab Session Day 2
Transcription Routines
Transformation from PDB to UDB
Exploring Data Sets
Cross-sectional matching procedures
Transcription Routines (Raw Data
into Statistics-Package-Format)
From PDB to UDB
 ‘Raw’ ECHP data comes in comma
separated ASCII format.
 ‘Raw’ ECHP data comes without any
labels!
 Transcription Routines for SPSS
In SPSS: open syntax file:
EPUNet 2005
Training Course
Transcription Routines (Raw Data
into Statistics-Package-Format)
From PDB to UDB
 ‘Raw’ ECHP ASCII format without
 Transcription Routines for SPSS:
In SPSS: open syntax file:
‘UDB_readin.SPS’
‘UDB_label.SPS’
Adjust pathnames to fit your file structure
EPUNet 2005
Run
Training Course
Transcription Routines (Raw Data
into Statistics-Package-Format)
From PDB to UDB - Result:
 ECHP UDB Files in SPSS’ *.sav - format
 Link File (1 file ‘ulink’)
 Personal File (‘pfile[n]’ wave 1 to 8)
 Household File (‘hfile[n] wave 1 to 8)
 Register File (‘rfile[n] wave 1 to 8)
 Relationship File (‘relat[n] wave 1 to 8)
EPUNet 2005
 Country File (1 file ‘ctryvars’)
Training Course
Exploring Data Sets See Doc-Pan 166
Personal File
 Open Pfile
Get file = ‘X:\path1\a_w8p.sav’.
Descriptives variables PE001.
For self defined employment status
 Continue with ‘a_w7p.sav’, ‘a_w8h.sav’,
‘ulink.sav’, etc.
EPUNet 2005
Training Course
Exploring Data Sets
Personal File - contents








EPUNet 2005
Training Course
Demographic information
Employment and activity
Calendar of activities
Income
Educational attainment
Current education and training
Health/Care
Migration
 Satisfaction
Exploring Data Sets
Household File - contents






EPUNet 2005
Training Course
Demographic information
Household income
Household related benefits
Accommodation and housing situation
Durables
Persons in household.
Exploring Data Sets
Register File - contents
 Panel specific information (personal identifier
[PID], household identifier, weights, etc.
 Demographic information (age, sex, etc.)
EPUNet 2005
Training Course
Exploring Data Sets
Relationship File - contents
 Always lists two persons per case!
 Central: relation between person one and person
two (pid1 relation pid2)
EPUNet 2005
Training Course
Exploring Data Sets
Country File - contents
EPUNet 2005
Training Course









One record for each country/panel
One block of variables for each wave
RATE: Exchange rates in Euro
PPP: Purchasing power parities
POPTOT: Total population in private
households
POP16P: Number of persons aged 16+ living
in private households
POPHHD: Number of private households
Exploring Data Sets
Link File - contents
 General structural information for linking
households and individuals within and across
waves
 All panel household members (regardless of age
or panel participation)
 Basic demographic information
 Cross sectional and longitudinal weights
 Sample status
 Wave specific household identifiers
EPUNet 2005
Training Course
Exploring Data Sets
General structure:
 Identical naming of variables across waves
 First letter of variable describes file (‘P’ for
Personal File)
 Second letter of variable describes module (‘PM
for Personal File, module migration)
 Subsequent numbers describe exact information
(PM001 for Personal File, module migration 001
for migration trajectory)
EPUNet 2005
Training Course
Exploring Data Sets
Getting information:
Central tool: Codebook (Doc-Pan 166) containing:
 List and description of all ECHP UDB variables
and
information on availability and comparability of
variables
 across countries and
 across waves
EPUNet 2005
Training Course
Files to use






Personal File
Household File
Register File
Relationship File
Country File
Link File
 UDB_readin.sps
 UDB_label.sps