Transcript Microsoft PowerPoint Presentation, 10Mb
Slide 1
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 2
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 3
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 4
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 5
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 6
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 7
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 8
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 9
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 10
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 11
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 12
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 13
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 14
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 15
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 16
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 17
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 18
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 19
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 20
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 21
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 22
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 23
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 24
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 25
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 26
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 27
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 28
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 29
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 30
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 31
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 32
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 33
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 34
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 35
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 36
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 37
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 38
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 39
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 40
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 41
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 42
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 43
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 44
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 45
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 46
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 47
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 48
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 49
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 50
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 51
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 52
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 53
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 54
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 55
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 56
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 57
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 58
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 59
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 60
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 61
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 62
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 63
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 64
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 65
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 66
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 67
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 68
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 69
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 70
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 2
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 3
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 4
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 5
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 6
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 7
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 8
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 9
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 10
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 11
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 12
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 13
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 14
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 15
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 16
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 17
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 18
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 19
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 20
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 21
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 22
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 23
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 24
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 25
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 26
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 27
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 28
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 29
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 30
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 31
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 32
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 33
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 34
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 35
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 36
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 37
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 38
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 39
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 40
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 41
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 42
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 43
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 44
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 45
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 46
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 47
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 48
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 49
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 50
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 51
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 52
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 53
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 54
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 55
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 56
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 57
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 58
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 59
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 60
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 61
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 62
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 63
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 64
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 65
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 66
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 67
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 68
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 69
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time
Slide 70
GIS and Health
Martin Charlton1
Ronan Foley2
Dennis Pringle2
1National
Centre for Geocomputation
2Department of Geography
NUI Maynooth
Outline
• Preliminary observations on GIS
and health
• Examples of GIS in Health:
research and practice
• Mapping people not places
Patterns in Health Data
• Three main areas:
– Mapping of disease incidence
• Relative risk estimation
– Identification of disease clustering
• Surveillance
– Ecological analysis
• Relationship between incidence and
covariates
(Lawson et al, 2003)
Health Data
• Typically patient data is available
in two forms
• Individual – perhaps anonymised
to postcode/zip code level
• Aggregate – counts are available
for a set of zones
• The nature of the data affects the
nature of the analysis
Geographic Information Systems
• GIS is often confused with mapping
• A range of activities which includes
inter alia mapping and analysis
• Will typically involve several datasets
• May also involve several different
pieces of software
Mapping: EpiINFO
• This is a rather simple analysis and
mapping program
• It can be downloaded from the internet
• It doesn’t provide much in the way of
spatial data manipulation…
• … nor are its analytical capabilities very
advanced
• It is easy to use
Spatial Data Manipulation
• GIS isn’t merely concerned with
data display or query
• It’s useful in bringing together
different spatial datasets in the
first stages of an analysis
• This may well require a range of
different geometrical and
database operations
Analysis with Spatial Data
• Statistical analysis usually involves
some separate software.
• Spatial data have some properties
which mean that SPSS isn’t always
appropriate.
• There is now a range of specialist
software – much of it available for
download
GIS may be ideal for data manipulation, display and query,
but it’s not always ideal for analysis – in epidemiology a
variety of analytical software is available … WinBUGS,
GeoBUGS (MRC Biostatistics Unit, Cambridge)
Software to implement
particular methods – for
example scan statistics - can be
downloaded quickly and easily
from the Internet. Kulldorf’s
SaTScan system implements a
range of different scan
statistical models
GIS in Health in Practice
• Facility location – dialysis units
• Facility location – obs & gyn
• Data integration – schizophrenia
in Cavan
• Data manipulation and display –
GMS card holders in Maynooth
1: Provision of Dialysis Units
• There are 5 dialysis units for renal
patients in a region with a population
of about 3,000,000
• Major population centre has two units
about 5 miles apart
• Where would you locate an extra one?
• What would happen if one was
closed?
Location-allocation
• Location-allocation problems deal
with optimum location of facilities
• Attempt to minimise total
weighted travel time/distance
from users to facilities
• What data is needed?
Data requirements
• Digitised road network with classification
of road types
• Mean travel speeds on each type for
urban/non-urban roads
• Extents of urban areas
• Anonymised postcoded patient records
• Postcode->grid reference lookup
• Locations of hospitals in region (both with
and without dialysis units)
GIS processing
• Data integration
• Determine which segments of road network
are urban
• Determine travel times using road type
urban/non-urban speeds
• Tally numbers of patients at each node in
the road network
• Find nearest nodes in network to each
hospital
• Code hospitals as ‘existing’ or ‘potential’
units
Modelling/Analysis
• Carried out using standard GIS
functions
• Convenient ‘front end’ written
using system’s macro language
(save a lot of typing)
• Explore a range of alternatives
Hospital with
dialysis unit
Study region showing hospital locations and urban areas
Patients requiring dialysis (tallied to road nodes)
Classified road network – includes roads outside study area
•
Some alternative
scenarios:
(a) Optimal allocation
of patients
(b) Provide 1 extra
unit
(c) Provide 2 extra
units
(d) Close 1 unit in
city and reallocate
its patients
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
integrate data
process data
model and analyse
generate alternative scenarios
display the results
2: Facility Location
• Two hospitals both provided facilities
for obstetrics, gynaecology, paediatrics
and a special care baby unit.
• Was there a case for rationalisation and
improving the facilities at one hospital
only?
• Would closure disadvantage any
patients?
Data
• Locations of hospitals in study region
• Anonymised postcoded finished
consultant episodes for each of the
four specialities at each hospital
• Classified road network
• Speeds associated with each road
type
Data preparation
• Postcoded records allocated grid
references using Central Postcode
Directory (POSTZON)
• Vector data converted to raster
• Patient/specialty/hospital counts
produced for each raster
• Passage times computed for each
raster cell based on road
classification and speed estimates
Roads, urban areas, patient locations, hospitals in study area
Processing
• Travel time surfaces computed for
each hospital
• Each raster cell contains time
required to reach hospital in minutes
• Assumed walking speed in ‘non road’
areas
Travel time surface for hospital S – darker = shorter time
Travel time surface for hospital T
Time differences
• If we subtract the travel time surfaces areas
nearer one hospital will have positive
values for the difference, and areas nearer
the other will have negative ones
• Boundary between positive and negative
values is boundary of catchment for each
hospital.
• If you’re on the boundary (difference is
zero), it doesn’t matter which hospital you
choose
Roads, patient locations, hospital catchments
Indifference
• If we identify those rasters where the time
difference (ignoring the sign) is, say, 5 –
people in those areas are in the 5 minute
‘indifference’ zone.
• Patients might not be worried about a 5
minute increase in their travel time, they
might be worried about 30 minutes
• Indifference zones can be computed quite
easily
15-minute indifference zone between the two hospitals
Mean access time
• For each cell in the raster we can
compute the product of the number
of patients and the time required to
reach each hospital
• Dividing this by the number of
patients gives a patient weighted
average time for each hospital
Average travel times for the various specialties. There
isn’t much to chose between specialities with the
exception of paediatric episodes.
Role of GIS
• Used
• Used
• Used
• Used
• Used
to
to
to
to
to
gather data
integrate data
process data
model data
display results
• All carried out using raster
operations
3: Schizophrenia in Cavan
• In 1996 Pringle, Waddington and
Youssef reported on evidence for the
causes of schizophrenia
• Ecological analysis of variation among
EDs
– Cases varied from 0-11 per ED
– Populations varied from 250-2750
• Poisson probabilities suggested that
one area had an unusually high number
of cases – these can be mapped
Male cases
superimposed on plot
of prevalence by ED
Male cases
superimposed on
rectified Landsat
image with ED
boundaries
4: GMS card holders: Maynooth
•Data on GMS – Medical Card holders
drawn under strict data protection
protocols from DOHC
• Anonymised address lists data matched
with GeoDirectory
•Address matching rate of around 77% in
Maynooth
• Interesting range of results around
average for the whole ED
•ED – single value 14.6 % (national average around 30%)
•Small Area – Range from 2% to 70% - noticeable internal variation and
good indicator of elderly or poor populations
•Strategic value in comparing datasets
•Important in policy analysis
Mapping invites comparison
Role of GIS
• Data integration
– Tally of cases in small areas requires
data on locations of cases and small
area boundaries
– ‘spatial join’ places small area codes
on the case records
– Counting is a standard database
operation
• Data display
Finally
• GIS encourages you to think about
space
• In a ‘conventional’ map of Ireland
areas (EDs, Counties…) are shown in
proportion to their physical size
• There are alternative ways of
displaying maps…
In a cartogram the areas are drawn
proportional to their populations –
this is a “density-equalised” map
projection
A map of the boundaries of counties and
electoral divisions – the areas are drawn
proportional to their physical size
By comparison, the more deprived
western area of Dublin, northern area
of Cork and south east of Limerick are
clearly apparent in the cartogram.
Haase’s deprivation index suggests that the
most deprived populations are in peripheral
rural areas in the north west.
Redrawing the data on the cartogram
reveals wide variation in the SMR, and also
uncovers the urban dimension which is
hidden in the conventional display
Patient data matched to Electoral Divisions –
hospital admissions for psychiatric problems – the
patchwork effect does not suggest any pattern
The cartogram reveals the urban nature
of the problem – those areas which are
more deprived are also those with
higher levels of illness/disability
Residents aged 15+ who are unable to work
through illness or disability: the map pattern
suggests that might be a largely rural
phenomenon
The roads, rail, and urban areas data have
been transformed into the same space as
the cartogram. Dublin and Cork seem
poorly provided with roads…
Other spatial information can be transformed
to the space of the density equalised
projection. Here is the familiar map of main
roads and towns.
Using a cartogram we see that the
incidence of lung cancer is randomly
spatially distributed
Lung cancer in males in New York State [Gastner & Newman: 2004]
Plotting locations of lung cancer cases, we
obtain a map which follows the underlying
population distribution – there appear to be
“clusters”
What about Ireland?
• Potentially hampered by challenge of
geocoding patient records
• Lack of small area population data
– EDs have an average population of 1150
– Wide variation in size (50-28000)
– Socially inhomogenous
Rural addresses
• A major challenge is non-uniqueness
of addresses in rural areas.
• For instance in Moy ED, 11 properties
share the address
‘Lackamore, Lahinch, Co. Clare’
What’s needed
• Postcoding should provide a
mechanism for geocoding patient data
with the granularity required for
planning the delivery of health care –
this has been the case in the UK for a
for long time
• Census data should be made available
for areas with small and similar
populations (cf: UK Output Areas)
Finally…
• GIS has an important role to play
in health studies
• Health data analysis is a rapidly
developing field
• However, in Ireland we are
hampered by data deficiencies –
now is the time for change
Maynooth: image in Google Earth
Central Dublin: image in Google Earth
Taoiseach’s Office: image in Google Earth
Downing Street, London: image in Google Earth
Maynooth: OSi orthophotograph added locally uisng KML
Sellafield: image in Google Earth
NCU – On Line Atlas of Cancer Mortality
The Philippine Institute
for Development
Studies makes an online atlas of socioeconomic indicators
available – contains
some health measures
The future
• Patient empowerment…
• GIS used to merge information from
regulatory agencies on 19196 landfill sites
• Sites buffered to 2km and postcodes
within buffer identified
• Postcoded live births (8.2x106), stillbirths
(43471) and congenital anomalies
(124,597) between 1982 and 1997
• Poisson regression modelling undertaken
• Evidence found for excess risk of
congenital anomaly and low birthweight
• Distance from each
census ward to
nearest hospital
computed
• Map shows number
of patients waiting
longer than 6
months for elective
inpatient care per
available and
unoccupied bed
within 60 minutes
travel time