Transcript Document
Census 2000:
Public Use Microdata Samples
and Metro Area Definitions
Presented by
Andrew Ruppenstein
September 25th, 2003
HSUG-West, Berkeley, 9/25/03
1
Introduction:
Public Use Microdata Sample (PUMS)
Files
American Community Survey (ACS)
Microdata Files
Metropolitan Areas – Old and New
2
HSUG-West, Berkeley, 9/25/03
Where Does Census Housing
Data Come From?
Decennial Census data comes from both
the short-form (100%) and the long-form
questionnaires (~1-6 households sample).
With regard to housing, the short form
yields only tenure and occupancy data.
The long form has 21 (some are multi-part)
questions concerning housing.
3
HSUG-West, Berkeley, 9/25/03
Number of Housing Tables in
Census 2000 Summary Files:
Summary Summary
Summary Summary
File 2
File 3
File 4
File 1
17
9
146
93
4
HSUG-West, Berkeley, 9/25/03
What is PUMS Data?
Public Use Microdata Sample files contain
a sample of individual housing unit and
person records from the census, along with
sample weights.
There are two released PUMS samples –
1% and 5%. The 1% file has more detailed
characteristics. The 5% offers more
geographic detail.
5
HSUG-West, Berkeley, 9/25/03
What Housing Data does
PUMS contain?
Tenure
Units in Structure
Occupancy Status
Housing Value
Gross Rent
Rooms, Bedrooms
Age of Structure
Plumbing, Kitchen
Facilities, Vehicular, and
Telephone Availability
Year Householder
Moved into Unit
Property Taxes Paid
Cost of Utilities
Cost of Fuels
Acreage
Rent
Mortgage Status
Second Mortgage Status
Property Insurance
6
HSUG-West, Berkeley, 9/25/03
Why Use PUMS?
Summary vs. detail information
Predefined tables vs. custom tabulations
Restricted variables vs. full range of
available responses
Greatest possible detail while still ensuring
confidentiality
7
HSUG-West, Berkeley, 9/25/03
Census Public-Use Microdata
Samples History
1960 Census - 1% sample created as a
prototype
1970 Census - 1% sample introduced as a
standard product
1980 Census – 1% and 5% samples
1990 Census - 1% and 5% samples
2000 Census – 1% and 5% samples
8
HSUG-West, Berkeley, 9/25/03
PUMS Geography
PUMS data is available only for a specially
delineated set of geographies, PUMAs (5%) and
super-PUMAs (1%).
PUMAs have a minimum of 100,000 population,
super-PUMAs 400,000.
Super-PUMA and PUMA boundaries do not
cross state lines.
PUMAs fit within super-PUMAs.
Boundaries are drawn with some input from the
states and local area.
9
HSUG-West, Berkeley, 9/25/03
California’s
2000 PUMAs
California has 62
super-PUMAs and
235 PUMAs.
10
HSUG-West, Berkeley, 9/25/03
PUMS California Geography
In California, PUMA boundaries were
constructed to follow city lines, where possible.
Example: PUMA 07000 = Fullerton City in
Orange County.
This holds less true for San Diego County and
the San Francisco Bay Area Counties, as local
input often chose to use tract boundaries.
Super-PUMA/PUMA boundaries and county
boundaries frequently coincide.
11
HSUG-West, Berkeley, 9/25/03
S.F. Bay Area
PUMAs
We are in PUMA
02403
12
HSUG-West, Berkeley, 9/25/03
Microsample Areas are
Designated With a Code:
Super-PUMAs (1%) have a 5 digit code, with the
first 2 digits as the state FIPS code. Example:
06050 (Solano and Napa Counties) (California’s
FIPS Code = ’06’). Range is 06010 – 06705.
PUMAs (5%) also have a 5-digit code. In
California , PUMA (5%) codes are lowest in the
north, and highest in the southern part of the
state. Range is 00100 – 08116. PUMA codes are
unique within the state, but may be repeated in
other states.
13
HSUG-West, Berkeley, 9/25/03
Where do I get PUMS data?
PUMS data is available free of charge in
downloadable form from the US Census Bureau:
http://www.census.gov/main/www/pums.html
A DVD of the data will be available from the
Census Bureau in Fall, 2003. In addition to the
data it will include the Beyond 20/20 software
for accessing the data.
DataFerret also provides online access to PUMS
data:
http://dataferrett.census.gov/TheDataWeb/index.html
14
HSUG-West, Berkeley, 9/25/03
The American Community Survey
(ACS) Microdata
Microdata sample files are also currently available for
the 1996-1998 and 2000-2001 ACS data. The
technical documentation, record layout, geographies,
and content will not be same! But the same principles
for generating custom tabulations will hold true.
ACS sample sizes are much smaller. Expect a
tradeoff in accuracy vs. timeliness when deciding
whether to use decennial or ACS PUMS data.
When using PUMS, please be sure to cite both the
census source (decennial or ACS) and the data year.
15
HSUG-West, Berkeley, 9/25/03
ACS Microdata for California
ACS coverage has slowly expanded over time.
The first tabulations available are for 1996. 2005
should be the first year for which ACS has
complete coverage of the US.
The first California counties covered by the
program were San Francisco and Tulare in 1999.
2000 has tabulations for 21 California counties.
2001 has tabulations for 24. More counties were
sampled than tabulated, however.
The 2000 and 2001 ACS microdata files for
California have no county-level detail.
For more information, please see:
http://www.census.gov/acs/www/Products/PUMS/index.htm
16
HSUG-West, Berkeley, 9/25/03
Metropolitan Statistical Areas – What
They Were, What They Are
Metropolitan Statistical Area: “A
geographic entity, defined by the Federal
OMB for use by Federal statistical
agencies, based on the concept of a core
area with a large population nucleus, plus
adjacent communities having a high degree
of economic and social integration with
that core.”
17
HSUG-West, Berkeley, 9/25/03
Changes
The standards have changed 6 times since 1950,
and so the definitions (lists of areas) for
presenting metropolitan area statistics have
changed 10 times since 1950.
The June 30, 1999 definition is used for Census
2000 products.
There is a new definition as of June 6, 2003,
which will be applied to future Census Bureau
products.
The changes over time in areas defined as
“Metropolitan” mostly reflect population growth,
not changes in the defined concept.
HSUG-West, Berkeley, 9/25/03
18
The Way Things
Were – 1999
Definitions:
3 CMSAs
12 PMSAs
12 MSAs
19
HSUG-West, Berkeley, 9/25/03
Core Based Statistical Area (CBSA) Classification
A CBSA consists of the county or counties
associated with at least one core of 10,000 or greater
population, plus adjacent counties having a high
degree of social and economic integration with the
core(s) as measured by commuting ties.
CoreBased
BasedStatistical
StatisticalAreas
Areas
Core
MetropolitanStatistical
StatisticalAreas
Areas
Metropolitan
MicropolitanStatistical
StatisticalAreas
Areas
Micropolitan
PopulationininaaCore
Core
Population
50,000or
ormore
more
50,000
10,000toto49,999
49,999
10,000
Territory not included in a CBSA is designated as Outside
Core Based
Based Statistical
Statistical Areas.
Core
Areas.
20
HSUG-West, Berkeley, 9/25/03
CBSA Building Blocks
Counties and equivalent entities throughout the United States
and Puerto Rico
Why?
Single, consistent geographic unit nationwide
Familiar geographic units
Stable boundaries
Availability of a wide range of statistically reliable data
21
HSUG-West, Berkeley, 9/25/03
CBSA Cores
Census Bureau-defined urbanized areas (UAs)
Census Bureau-defined urban clusters (UCs)—a new
geographic entity for Census 2000
Cores identify central counties of CBSAs
Central counties are those counties that:
• have at least 50% of their population in urban areas (UAs or UCs)
of at least 10,000 population; or
• have within their boundaries a population of at least 5,000 that
is located in a single urban area of at least 10,000 population
22
HSUG-West, Berkeley, 9/25/03
Aggregating Counties
Commuting data are used to qualify outlying counties.
A county qualifies as an outlying county if:
• at least 25 percent of the employed residents of the
county work in the CBSA’s central county or counties, or
• at least 25 percent of the jobs in the county are accounted
for by workers residing in the CBSA’s central county or
counties
Measures of settlement structure, such as population density and
the percentage of population that is urban, are not used to qualify
outlying counties.
23
HSUG-West, Berkeley, 9/25/03
Divisions of Metropolitan Statistical Areas
Metropolitan Statistical Areas containing at least one core with
a population of 2.5 million or greater may be subdivided to form
Metropolitan Divisions
A county will be identified as a main county of a Metropolitan
Division if:
65 percent or more of its employed residents work within the
county, and
the ratio of the number of jobs located within that county to its
number of employed residents is at least 0.75.
A main county automatically serves as the basis for forming a
Metropolitan Division.
24
HSUG-West, Berkeley, 9/25/03
2003 Metro
Standards
33 CBSAs, of which:
7 Micropolitan Areas
and
26 Metropolitan
Areas, of which:
6 are multi-county.
25
HSUG-West, Berkeley, 9/25/03
Learn More About PUMS
CENSUS
2000 Main Page:
http://www.census.gov/main/www.cen2000.html
CENSUS
2000 Support Pages:
http://www.census.gov/support/cen2000.html
CENSUS
Electronic Products Support Pages:
http://www.census.gov/tech/techtalk.html
26
HSUG-West, Berkeley, 9/25/03
Learn More About Metropolitan
Areas
CENSUS
Metropolitan Areas Intro Page:
http://www.census.gov/population/www/estimates/metro
area.htm.
27
HSUG-West, Berkeley, 9/25/03
And Finally…
For more information contact me via e-mail or
phone:
– [email protected]
– (916) 327-0103, x2526
28
HSUG-West, Berkeley, 9/25/03
Credits:
1) Slides 5 and 8 were adapted with permission from
“Tabulating Data from the PUMS Files Using Beyond
20/20 Software” by Bill Savino, Electronic Products
Development Branch, US Census Bureau
2) Slides 21-24 were borrowed with permission from
“Metropolitan and Micropolitan Statistical Area Standards”
by Colleen Joyce, Geography Division, US Census Bureau
3) The maps on slides 10, 12 and 25 were created by Cynthia
Singer, Demographic Research Unit, California State
Department of Finance
29
HSUG-West, Berkeley, 9/25/03