Sampling Overview
Download
Report
Transcript Sampling Overview
Multiple Indicator Cluster Surveys
Survey Design Workshop
Sampling Overview
MICS4 Survey Design Workshop
Introduction
• MICS is multi-national survey program
– Consistency and comparability between surveys is
important
• MICS based on national stratified multi-stage sample
• Workshop session covers highlights of MICS
sampling, including:
– Identifying effective sampling frame and sample design
– Determinants and calculation of sample size
MICS4 Survey Design Workshop
Importance of correct sampling
procedures
• Sample design will affect inferences of survey
results
• Coverage of frame and appropriate
probability sampling are critical
• Accuracy of survey results depends on both
sampling and non-sampling errors
MICS4 Survey Design Workshop
Key concepts in proper sampling
• Use accepted probability techniques at every
stage of selection
• Use of simple sampling methodology
• Use the latest census as the sampling frame
when available
• Sampling in two stages
• Use of clusters of moderate size
• Use of an adequate sample size
MICS4 Survey Design Workshop
MICS4 prototype sample design
• Manual presents different sampling options
• All options involve two or three-stage cluster
sampling
– Primary sampling units (PSUs) – census enumeration areas
– Listing in sample clusters or use of compact clusters
– Sample households selected from listing
• Option for each country depends on type of frame
and availability of existing samples
MICS4 Survey Design Workshop
Sampling option 1 – use existing
sample
• Most cost-effective approach
• Determine whether existing sample is
appropriate
– Probability sample
• Known probabilities of selection at each stage
– Nationally representative
– Sufficient number of sample PSUs for MICS
– Example – 2008 Mozambique MICS3
MICS4 Survey Design Workshop
Sampling option 1 – continued
• Determine when listing of households in sample EAs
was conducted
– Listing can be used if less than 1 year old
– Generally select new sample of households from listing
– Possible to use selected households from other survey
• Consider response burden, analytical needs for both surveys
• Conduct new listing in existing sample or subsample
of EAs
• Availability of master sampling frame in some
countries
MICS4 Survey Design Workshop
Sampling option 2 –
standard segment design
• Based on standard segments of about 500 persons or 100
households
• Effective procedure when some EAs are too large for listing
• Three-stage sample design
• Measure of size for each EA – calculated number of segments
• EAs selected systematically with PPS
• Larger sample EAs with 2 or more segments subdivided
• One segment selected in each EA for listing, enumeration
MICS4 Survey Design Workshop
Sampling option 3 –
modified segment design
• Three-stage sample design
• Small compact clusters with about 20-30 households
• Measure of size of EAs – calculated number of
compact clusters
• EAs selected systematically with PPS
• Mapping and segmentation in all sample EAs
• One compact cluster selected in each sample EA for
listing, enumeration
MICS4 Survey Design Workshop
Relationship of sampling with other
aspects of survey planning
• Survey budget – may determine maximum
sample size
• Sampling plan should also take into account
required timing of MICS results
• Estimate costs and time related to PSUs –
transportation, listing
• Costs related to sample households – cost of
interviewing (days of enumeration)
MICS4 Survey Design Workshop
Sampling and fieldwork organization
• Effective balance of number of sample PSUs
and cluster size
• Dispersion of sample affects transportation
costs, travel time
• Size of team of enumerators and supervisor
• Interviewer workload
MICS4 Survey Design Workshop
Factors that affect sample size
•
•
•
•
•
Budget and resource constraints
Timing and logistical considerations
Select key indicators
Required level of precision
Need for sub-national estimates
– Regions, provinces, urban/rural domains
MICS4 Survey Design Workshop
Selecting key indicators
• Person variables (rates)
• Important indicator that will yield the largest
sample size
• Estimated rates between 15 and 20%
• Target population that comprises a small
proportion of the total population
– Children aged 12 to 23 months
MICS4 Survey Design Workshop
Sample size determinants –
sampling parameters
• Level of precision – standard error, coefficient of
variation (CV)
• Acceptable level of error - margin of error, relative
margin of error
• Level of confidence – 95%
• Design effect (DEFF) – depends mostly on level of
clustering in the sample
• Average household size
• Proportion of base population in total population
• Adjustment for non-response
MICS4 Survey Design Workshop
Use of previous MICS or DHS results
• Review sample design of previous survey
• Examine sample size and allocation to
geographic domains
• Study level of precision for key indicators
– Design effects
– 95% confidence intervals
• Results can be used to determine sample size
and allocation for new MICS
MICS4 Survey Design Workshop
Formula for sample size calculation
• Two similar spreadsheet templates for MICS4
– Entry cell for each sampling parameter
– Formula for calculating required number of
sample households
• Presentation and discussion of spreadsheets
MICS4 Survey Design Workshop
Sample Size Formula
n
4r (1 r )( deff )(1.1)
_
2
(. 12 r ) ( p)( n)
where
– n is the required sample size, expressed as number of households, for the KEY
indicator
– 4 is factor to achieve 95 percent level of confidence,
– r is anticipated prevalence rate for key indicator,
– 1.1 is factor to raise sample size by 10 percent for potential nonresponse,
– deff is shortened symbol for design effect,
– 0.12r is margin of error to be tolerated, defined as 12 percent of r (12 percent
thus represents the relative margin of error),
– p_ is proportion of total population that smallest group comprises, and
– n is average household size.
MICS4 Survey Design Workshop
Sample size reference table
• Table 4.3 in Chapter 4 of MICS manual
• Can be used for specific set of parameters:
–
–
–
–
–
Target population is 2.5% of total population
DEFF = 1.5
Nonresponse rate is 10%
Relative margin of error is 12%
95% level of confidence
MICS4 Survey Design Workshop
Sample Size (Households) to Estimate Coverage
Rates for Smallest Target Population
Average
Household Size
(number of
persons)
4.0
4.5
5.0
5.5
6.0
estimated rate,
r = 0.25
estimated rate,
r = 0.30
estimated rate,
r = 0.35
estimated rate,
r = 0.40
13,750
12,222
11,000
10,000
9,167
10,694
9,506
8,556
7,778
7,130
8,512
7,566
6,810
6,191
5,675
6,875
6,111
5,500
5,000
4,583
Use this table when your
1. Target population is 2.5% of total population; this is generally children 12-23 months
old
2. Sample design effect, deff, is assumed to be 1.5 and nonresponse is expected to
be 10 percent
3. Relative margin of error is set at 12 percent of estimate of coverage rate, r
MICS4 Survey Design Workshop
Number of PSUs and cluster size
• More PSUs and smaller clusters
– Statistically more efficient – greater dispersion of
sample
– More costly – transportation, listing
• Minimum number of PSUs - 300 to 400 at
national level
– Minimum of 30 PSUs for subnational domains
• Number of sample households per cluster
generally in range of 15 to 30 households
MICS4 Survey Design Workshop