Using administrative data to compile agricultural statistics
Download
Report
Transcript Using administrative data to compile agricultural statistics
Using administrative data to compile
agricultural statistics
Experiences from the Census of Agriculture 2010
Fiona O’Callaghan, CSO
29th September 2011
Outline
•
•
•
•
•
•
•
Background
Available data
Analysis/aggregation
Merging of data
Lessons learnt
Future developments
Summary
Background
▫ CSO Statement of Strategy includes as a high level
goals
Minimise response burden, and extend the statistical
use of administrative records
Improve the scope, quality & timeliness of our
statistics
Achieve greater efficiencies using best practices
▫ SPAR Report - Statistical Potential of
Administrative Records
Background
▫ Up until 2010, CSO conducted 2 annual farm
surveys (June & Dec) with sample sizes ranging
from 15,000 to 20,000 farms. Farm Structure
Survey approx 50,000 farms.
June survey: average response burden approx. 30 mins.
December survey: average response burden approx. 18
mins.
COA 2010: average response burden 26 mins.
Available Data
• CSO identified six major data holdings within
DAFF which contained relevant information
Animal Identification and Movement System (AIM)
Single Payment System (SPS)
Organic Database
REPS database
Animal Health Computer System (AHCS)
Corporate Client System (CCS)
Available Data
▫ Three of these databases were used for COA 2010
CCS was used in developing the register
AIM was used for cattle
SPS was used for crops/cereals
Analysis/aggregation
▫ Corporate Client System
Contains name, address, DOB, herd number etc.
This was merged with the existing CSO Agriculture
Register to form a new Register for COA 2010
Issues involving the “unique” identifier, Herd Number,
resulting in duplicates
Result – 153,904 Census forms issued
Analysis/aggregation
▫ AIM – involves the use of electronic means to capture
data on animal movements through computer links
established at livestock markets, meat plants, and
export points
Data available since 2002
At an aggregate level, the AIM figures for the bovine
population have been consistently higher than the
CSO estimates for the corresponding date (on
average approx. 5% higher)
Preliminary comparative analysis performed by CSO
in 2008 and 2009
Eliminated 11 cattle questions from Census form
Analysis/aggregation
▫ AIM data consists of the following variables
Tag number
Herd number
DOB
Gender
Breed
Breed Type (Beef/Dairy)
Animal Class (Cow, bull, etc.)
Date of calving event
Analysis/aggregation
▫ Need to convert this information into totals for the
following categories
Breeding Cattle
Other Cattle
Dairy Cows
Other Cows
Male: 2 years and over
Female: 2 years and over
Dairy Heifers*
Other Heifers
Male: 1-2 years
Female: 1-2 years
Bulls
Male: under 1 year
Female: under 1 year
* Heifers in calf intended for the dairy herd
Analysis/aggregation
• Information on heifers-in-calf not available in
AIM database, but a proxy can be estimated.
• Other categories can be derived directly using
gender, DOB, animal class.
• Eurostat requires further breakdown of
categories into animals for slaughter – currently
developing a methodology to model this.
Analysis/aggregation
▫ SPS
Information on every eligible parcel of land
XY coordinates, Herd Number, Area, Use
Preliminary analysis performed in 2009
Eliminated 14 crops/cereals questions from Census
form
Analysis/aggregation
▫ SPS
XY coordinates used to assign NUTS Region codes at
farm level
Approx. 45% farms are spread over >1 DED - in these
instances the DED containing the largest area owned
was assigned
Merging of data
▫ Three separate data files
Census returns
AIM data
SPS data
▫ “Unique” identifier – Herd Number – but many
instances where one farmer could be associated
with more than one Herd Number
Merging of data
▫ Labour intensive task of matching by name &
address
Issues with Father & Son with same name & address
Different versions of names on different databases –
Seamus/James, Sean/John etc.
Non-unique addresses
▫ Farms that returned to CSO as Retired/Dead/Not
a Farm etc. but active on admin. data
Lessons Learnt
• More collaboration between DAFF & CSO
• Confidentiality – one-way transfer of
information
• Parallel pilot run in 2009
Future Developments
• Beyond the SPAR Initiative – cross sector
efficiencies
Piggy-backing on SFP online applications to
collect remaining survey items on June Survey
• Exploiting geo-coordinates
▫ To produce interactive maps
▫ To link with other databases
▫ Create area frame designs
Summary
• Positive development for Farmers & CSO
▫ Reduction in response burden – 25 questions
dropped
▫ Reduction in editing & processing of data
• Result - a high quality register of agricultural
holdings, and high quality data
Questions?