Analysis and Modeling in GIS 11/7/2015 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals.
Download ReportTranscript Analysis and Modeling in GIS 11/7/2015 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals.
Analysis and Modeling in GIS
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 1
GIS and the Levels of Science
Description: Using GIS to create
descriptive models
of the world --representations of reality as it exists.
Analysis: Using GIS to answer a question or test an hypothesis.
Often involves creating a new conceptual
output
layer, (or table or chart), the values of which are some descriptive
input
layer.
transformation
of the values in the --e.g. buffer or slope or aspect layers Prediction: Using GIS capabilities to create a
predictive model
of a real world process, that is, a model capable of reproducing processes and/or making predictions or projections as to how the world might appear.
--e.g. flood models, fire spread models, urban growth models 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 2
The Analysis Challenge
•
Recognizing which generic GIS analytic capability (or combination) can be used to solve your problem:
–
meet an operational need
–
answer a question posed by your boss or your board
–
address a scientific issue and/or test a hypothesis
Send mailings to property owners potentially affected by a proposed change in zoning Determine if a crime occurred within a school’s “drug free zone” Determine the acreage of agricultural, residential, commercial and industrial land which will be lost by construction of new highway corridor Determine the proportion of a region covered by igneous extrusions Do Magnitude 4 or greater sub-oceanic earthquakes occur closer to the Pacific coast of South America than of North America?
Are gas stations or fast food joints closer to freeways?
3 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
• • •
Availability of Capabilities in GIS Software
Descriptive Focus: Basic Desktop GIS packages
– Data editing, description and basic analysis – ArcView – Mapinfo – Geomedia
Analytic Focus: Advanced Professional GIS systems
– More sophisticated data editing plus more advanced analysis – ARC/INFO, MapInfo Pro, etc.
Provided through extra cost Extensions or professional versions of desktop packages
Prediction: Specialized modeling and simulation
– via scripting/programming within GIS » VB and ArcObjects in ArcGIS » Avenue scripts in ArcView 3.2
» AMLs in Workstation ARC/INFO (v. 7) Write your own or download from ESRI Web site – via specialized packages and/or GISs » 3-D Scientific Visualization packages » transportation planning packages e.g TransCAD » ERDAS, ER Mapper or similar package for raster Capabilities move ‘down the chain’ over time.
In earlier generation GIS systems, use of advanced applications often required learning another package with a different user interface and operating system (usually UNIX).
Description and Basic Analysis
(Table of Contents)
•
Spatial Operations
Vector – spatial measurement – Centrographic statistics – buffer analysis – spatial aggregation » redistricting » regionalization » classification – Spatial overlays and joins Raster – neighborhood analysis/spatial filtering – Raster modeling
•
Attribute Operations
– record selection » tabular via SQL » ‘information clicking’ with cursor – variable recoding – record aggregation – general statistical analysis – table relates and joins 5 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Spatial operations:
Spatial Measurement
• • • • • •
Spatial measurements: distance measures
– between points – from point or raster to polygon or zone boundary – between polygon centroids
polygon area polygon perimeter polygon shape volume calculation
– e.g. for earth moving, reservoirs
direction determination
– e.g. for smoke plumes ArcGIS geodatabases contain automatic • • •
Comments: Cartesian distance via Pythagorus
d ij
where:
(
X i
X j
) 2 (
Y i
d = arc distance possible distance metrics:
– straight line/airline – – – city block/manhattan metric distance thru network time/friction thru network
Y j
) 2 Used for projected data by ArcMap measure tools
Spherical distance via spherical coordinates Cos d = (sin a sin b) + (cos a cos b cos P) a = Latitude of A b = Latitude of B P = degrees of long. A to B
Used for unprojected data by ArcMap measure tools variables: •
shape often measured by:
shape.length:
line length or polygon perimeter perimeter area x 3.54
= 1.0 for circle = 1.13 for square Large for complex shape
shape.area:
polygon area Automatically updated after editing.
For shapefiles, these must be calculated •
Projection affects values!!!
Distances depend on
e.g. by opening attribute table and applying
Calculate Geometry
to a column (AV 9.2)
projection. Perimeter to area ratio differs
6 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
SHAPE Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon
Spatial operations:
Spatial Measurement
AREA 0.265
0.368
0.209
0.072
0.233
0.233
0.299
PERIMETER CNTY_ 2.729
2.564
2.171
2605 2545 2680 2.642
1.941
1.941
2.278
2899 2335 2103 2870 CNTY_ID NAME 2605 Anderson 2545 Andrews 2680 Angelina 2899 Aransas 2335 Archer 2103 Armstrong 2870 Atascosa FIPS 48001 48003 48005 48007 48009 48011 48013 Shape Index 1.50
1.19
1.34
2.78
1.14
1.14
1.18
0.224
0.222
0.368
0.072
1.900
1.889
2.580
1.421
2471 2481 2106 2386 2471 Dallas 2481 Dawson 2106 Deaf Smith 2386 Delta 48113 48115 48117 48119 1.13
1.13
1.20
1.50
Area and Perimeter measures are automatically maintained in the attributes table for a Geodatabase or coverage. For a shapefile, you need to apply
Calculate Geometry
to an appropriate column in the attribute table (or convert to a geodatabase) . The
shape index
can be calculated from the area and perimeter measurements. (Note: shapefile and shape index are unrelated) 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 7
Spatial Measurement:
Calculating the Area of a Polygon
Area=(2 x 4)/2=4 4,7 Area=(3 x 4)=12 7,7 A = B
-
C
=
-
7,3 2,3 0 5 0 5 10 0 5 10 6,2 Area=(5 x 1)/2=2.5
0 5 The area of the above 10 polygon is 18.5, based on dividing it into rectangles and triangles. However, this is not practical for a complex polygon.
Area of triangle =
The actual algorithm used obtains the area of
A
by calculating the areas of
B
and
C
, and then subtracting.
The actual formulae used is as follows:
i
1 2 3 4 5 i n
X
2 4 7 7 6 2 1 (X 2 X 1 ) ( Y 1 Y 2 )/2 Its implementation in Excel is shown below.
Y
3 7 7 3 2 3
X 2 -X 1
2 3 0 -1 -4
(base x height)/2
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
(Y 1 +Y 2 )/2
5 7 5 2.5
2.5
product
10 21 0 -2.5
-10
sum
10 31 31 28.5
18.5
8 10
• • • •
Spatial Operations:
Centrographic Statistics
Basic descriptors for spatial point distributions Two dimensional (spatial) equivalents of standard descriptive statistics (mean, standard deviation) for a single-variable distribution
Measures of Centrality (equivalent to mean) – Mean Center and Centroid Measures of Dispersion (equivalent to standard deviation or variance) – Standard Distance – Standard Deviational Ellipse
Can be applied to polygons by first obtaining the centroid of each polygon Best used in a
comparative context
to compare one distribution (say in 1990, or for males) with another (say in 2000, or for females)
9 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Centroid and Mean Center
• • •
balancing point for a spatial distribution
– analogous to the mean – single point representation for a polygon (centroid)
X
– single point summary for a point distribution (mean center)
i n
1
X i n
,
Y
i n
1
n Y i
– can be weighted by ‘magnitude’ at each point (analogous to weighted mean) – minimizes squared distances to other points, thus ‘distant’ points have bigger influence than close points ( Oregon births more impact than Kansas births!) – is not the point of “minimum aggregate travel”--this would minimize distances (not their square) and can only be identified by approximation.
useful for
– summarizing change over time in a distribution (e.g US pop. centroid every 10 years) – placing labels for polygons
for weird-shaped polygons, centroid may not lie within polygon
centroid outside polygon
Note: many ArcView applications calculate only a “psuedo” centroid: the coordinates of the Can be implemented via: bounding box (the extent) of the polygon ArcToolbox>Spatial Statistics Tools>Measuring Geographic Distributions>Mean Center 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 10
2,3 4,7 6,2 7,7 7,3 Calculating the centroid of a polygon or the mean center of a set of points.
1 2 3 4 5 sum Centroid/MC 2 4 7 7 6 26
5.2
3 7 7 3 2 22
4.4
(same example data as for area of polygon)
X
i n
1
X i
,
Y n
n
i
1
Y i n
0 0 2,3 4,7 5 5 6,2 7,7 7,3 10 10
i
1 2 3 4 5 sum w MC Calculating the weighted mean center. Note how it is pulled toward the high weight point.
X
2 4 7 7 6 26
Y
3 7 7 3 2 22
weight
3,000 500 400 100 300 4,300
wX wY
6,000 2,000 2,800 700 1,800 13,300
3.09
9,000 3,500 2,800 300 600
X
i n
1
w i X i w i
,
Y
i n
1
w i Y i w i
16,200
3.77
11 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Source:
US Statistical Abstract 2003
Median Center:
Intersection of a north/south and an east/west line drawn so half of population lives above and half below the e/w line, and half lives to the left and half to the right of the n/s line.
Same as “
point of minimum aggregate travel
” the location that would minimize travel distance if we brought all US residents straight to one location.
Mean Center:
Balancing point of a weightless map, if equal weights placed on it at the residence of every person on census day
.
Note: minimizes squared distances. The point is considerable west of the median center because of the impact of “squared distance” to “distant” populations on west coast For a fascinating discussion of the effect of population projection see: E. Aboufadel & D. Austin,
A new method for calculating the mean center of population center of the US
Professional Geographer, February 2006, pp. 65-69
Standard Distance Deviation
single unit measure of the spread or dispersion of a distribution.
• Is the spatial equivalent of standard deviation for a single variable • Equivalent to the standard deviation of the distance of each point from the mean center • Given by:
i n
1 (
X i
X c
) 2
i n
1 (
Y i
Y c
) 2
N
which by Pythagoras reduces to:
i n
1
N d iC
2 ---the square root of the average squared distance ---essentially the average distance of points from the center We can also weight each point and calculate weighted standard distance (analogous to weighted mean center.) 13 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Standard Distance Deviation
Example
4,7 Circle with radii=SDD=2.9
7,7 7,3 2,3
i
1 2 3 4 5 sum Centroid
X
2 4 7 7 6 26 5.2
Y
3 7 7 3 2 22 4.4
(X - Xc) 2 (Y - Yc) 2
10.2
1.4
3.2
3.2
0.6
2.0
6.8
6.8
2.0
5.8
18.8
sum divide N sq rt 23.2
42.00
8.40
2.90
0 6,2
i
1 2 3 4 5 sum Centroid
X
2 4 7 7 6 26 5.2
5
Y
3 7 7 3 2 22 4.4
(X - Xc) 2
10.2
1.4
3.2
3.2
0.6
18.8
sum of sums divide N sq rt
(Y - Yc) 2
2.0
6.8
6.8
2.0
5.8
23.2
42 8.4
2.90
10
sdd
i n
1 (
X i
X c
) 2
i n
1 (
Y i
Y c
) 2
N
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 14
Standard Deviational Ellipse:
concept
• • •
Standard distance deviation is a good single measure of the dispersion of the incidents around the mean center, but it does not capture any
directional bias
– doesn’t capture the shape of the distribution.
The standard deviation ellipse gives dispersion in two dimensions Defined by 3 parameters
– Angle of rotation – Dispersion along major axis – Dispersion along minor axis The major axis defines the direction of maximum spread of the distribution The minor axis is perpendicular to it and defines the minimum spread
Standard Deviational Ellipse:
example
There appears to be no major difference between the location of the software and telecommunications industry in North Texas. For formulae for its calculation, see Lee and Wong
Statistical Analysis with ArcView GIS
pp. 48-49 (1 st ed.), pp 203-205 (2 nd ed.) 16 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Spatial Operations:
buffer zones
• region within ‘x’ distance units • buffer any object: point, line or polygon • use multiple buffers at progressively greater distances to show gradation • may define a ‘friction’ or ‘cost’ layer so that spread is not linear with distance • Implement in Arcview 3.2 with
Theme/Create buffers
in ArcGIS 8 with
ArcToolbox>Analysis Tools>Buffer
point buffers line buffer polygon buffer
Examples
• 200 foot buffer around property where zoning change requested • 100 ft buffer from stream center line limiting development • 3 mile zone beyond city boundary showing ETJ (extra territorial jurisdiction) • use to define (or exclude) areas as options (e.g for retail site) or for further analysis • in conjunction with ‘friction layer’, simulate spread of fire
Note: only one layer is involved, but the buffer can be output as a new layer
Spatial Operations:
• • •
spatial aggregation
districting/redistricting
– grouping
contiguous
polygons into
districts
– original polygons preserved
Regionalization (or dissolving)
– grouping polygons into contiguous regions – original polygon boundaries dissolved
classification
– grouping polygons into non contiguous regions – original boundaries usually dissolved – usually ‘formal’ groupings Implement in ArcView 9 thru ArcToolbox>Generalization>Dissolve
Grouping/combining polygons—is applied to one polygon layer only.
Criteria may be:
– formal (based on in situ characteristics) e.g. city neighborhoods – functional (based on flows or links): e.g. commuting zones
Groupings may be:
– contiguous – non-contiguous
Boundaries for original polygons:
– may be preserved – may be removed (called dissolving) • • • • • •
Examples:
elementary school zones to high school attendance zones (
functional districting
) election precincts (or city blocks) into legislative districts (
formal districting
) creating police precincts (
funct. reg
.) creating city neighborhood map (
form. reg.
) grouping census tracts into market segments--yuppies, nerds, etc (
class.
) creating soils or zoning map (
class
)
Districting
: elementary school attendance zones grouped to form junior high zones.
Regionalization
: census tracts grouped into neighborhoods
Classification
: cities categorized as central city or suburbs soils classified as igneous, sedimentary, metamorphic 19 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Spatial Operations:
Spatial Matching:
Spatial Joins and Overlays
•
combine two (or more) layers to: Examples
– – select features in one layer, &/or create a new layer • assign environmental samples (points) to census tracts to estimate •
used to integrate data having different spatial properties (point v. polygon), or different boundaries (e.g. zip codes and census tracts)
exposure per capita (point in polygon) • identify tracts traversed by freeway for study of neighborhood blight •
can overlay polygons on:
(polygon on lines) – points (
point in polygon)
– lines (
line on polygon
) – other polygons
(polygon on polygon)
– many different Boolean logic combinations possible » Union (A or B) » Intersection (A and B) • integrate census data by block with sales data by zip code (polygon on polygon) • Clip US roads coverage to just cover Texas (polygon on line) • » A and not B ; not (A and B)
can overlay points on:
• Join
capital city
layer to layer to calculate distance to nearest state capital
all city
– Points, which finds & calculates distance to nearest point in other theme (point on point) – Lines, which calculates distance to nearest line 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 20
Example: Spatial Matching:
Clipping and Erasing
(sometimes referred to as
spatial extraction
) •
CLIP
- extracts those features from an input coverage that overlap with a clip coverage. This is the most frequently used polygon overlay command to extract a portion of a coverage to create a new coverage.
•
ERASE
- erases the input coverage features that overlap with the erase coverage polygons.
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 21
Example: Spatial Matching via
Polygon-on-Polygon Overlay: Union
Land Use Drainage Basins Atlantic Gulf Combined layer aG a.
aA b.
G.
bA bG A.
c.
cA cG The two layers (land use & drainage basins) do not have common boundaries. GIS creates combined layer with all possible combinations, permitting calculation of land use by drainage basin. Note: the definition of Union in GIS is a little different from that in mathematical set theory. In set theory, the union contains everything that belongs to any input set, but original set membership is lost. In a GIS union, all original set memberships are explicitly retained.
Another example
In set theory terms, the outcome of the above would simply be: 1 2 3 GIS Union Set Theory Union
Implementing Spatial Matching in ArcGIS 9
• • •
Available in three places
via
Selection/Select by Location
– this selects features of one layer(s) which relate in some
specified spatial manner
to the features in another layer – if desired, selected features may be saved later to a new theme via
Data/Export Data
– Individual features are not themselves modified via
Spatial Join
(right click layer in T of C, select
Join/Joins and Relates
, then click down arrow in first line of
Join Data
window---see
Joining Data
in
Help
for details) –
Use for: points in polygon lines in polygon points on lines (to calculate distance to nearest line) points on points (to calculate distance to “nearest neighbor” point)
– operate on tables and normally creates a new table with additional variables, but again does not modify spatial features themselves via
ArcToolbox
– Generally these tools modify geographic feature, thus they create a new layer (e.g. shape file) – Tools are organized into multiple categories • • • •
ArcToolbox Examples
Dissolve features based on an attribute
– Combine contiguous polygons and remove common border –
ArcToolbox>Generalization>Dissolve
Clip one layer based on another
– –
ArcToolbox>Analysis Tools>Extract>Clip Use one theme to limit features in another theme
(e.g. limit a Texas road theme to Dallas county only)
Intersect two layers (extent limited to common area)
– –
ArcToolbox>Analysis Tools>Overlay>Intersect Use for polygon on polygon overlay
Union two layers (covers full extent of both layers)
– –
ArcToolbox>Analysis Tools>Overlay>Intersect Use for polygon on polygon overlay
Spatial Operations
:
•
neighborhood analysis/spatial filtering
spatial convolution or filter
– applied to one raster layer – value of each cell replaced by some function of the values of itself and the cells (or polygons) surrounding it – can use ‘neighborhood’ or ‘window’ of any size » 3x3 cells (8-connected) » 5x5, 7x7, etc.
– differentially weight the cells to produce different effects –
kernel
for 3x3 mean filter: 1/9 1/9 1/9 1/9 1/9 1/9 1/9 1/9 1/9
weights must sum to 1.0
•
low frequency ( low pass) filter:
mean filter
– cell replaced by the mean for neighborhood – equivalent to weighting (mutiplying) each cell by 1/9 = .11 (in 3x3 case) – smooths the data – use larger window for greater smoothing
median filter
– use median (middle value) instead of mean – smoothing, especially if data has extreme value outliers 24 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Spatial Operations:
spatial filtering -- high pass filter
high frequency (high pass) filter
negative weight filter
– exagerates rather than smooths local detail – used for edge detection cell values ( v i ) on each side of edge
2 5
filtered values for highlighted pixel 1(5)(9)+5(5)(-1)+3(2)(-1) = 14
standard deviation filter (texture transform)
– calculate
standard deviation
of neighborhood raster values – high SD=high texture/variability – low SD=low texture/variability – again used for edge detection – neighorhoods spanning border have large SD ‘cos of variability 1(2)(9)+5(2)(-1)+3(5)(-1) = -7 –
kernel for example (
w i
)
1(2)(9)+8(2)(-1) = 2
-1 -1 -1
1(5)(9)+8(5)(-1) = 5
-1 9 -1
f i
.
v
.
i w i
-1 -1 -1
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 25
• • •
Spatial Operations:
raster–based modelling
Relating multiple rasters
•
Suitability modeling
Processes may be:
– Local: one cell only – Neighborhood: cells relating to each other in a defined manner – Zonal: cells in a given section
•
– Global: all cells
ArcGIS implementation:
– All raster analyses require either the Spatial Analyst or 3-D Analyst extensions
•
– Base ArcView can do no more than display an image (raster) data set 1 0 soil 0 0 1 slope 0 1 1 1 for 1 sale 1 0
Diffusion Modeling
Incidence matrix
Connectivity Modeling
Initial State Probability mask Connectivity matrix 2 Site 0 3 options 2 System at time t+1 Resultant State 26 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Attribute Operations:
record selection or extraction
--features selected on the map are identified in the table (and visa versa)
• • •
Select by Attribute (tabular)
Independent selection by clicking table rows: – Open Attribute Table & click on grey
selection box
start of row (hold
ctrl
for multiple rows) Create SQL query – use
Selection/Select by Attribute
use table
Relates /Joins
to select specific data at • •
Outputs may be:
Simultaneously highlighted records in table, and features on map New tables and/or map layers • • • •
Select by Graphic
Manually, one point at a time –
use Select Features tool
within a rectangle or an irregular polygon –
use Selection/Select by Graphic
within a radius (circle) around a point or points –
use Selection/Select by Location (are wthin distance) Select by Location
By using another layer –
Use Selection/Select by Location
(same as
Spatial Matching
discussed previously) •
Hot Link
Click on map to ‘hot link’ to pictures, graphs, or other maps • • •
Examples
Use SQL query to select all zip codes with median incomes above $50,000 (tabular) identify zip codes within 5 mile radius of several potential store sites and sum household income (graphic) show houses for sale on map, and click to obtain picture and additional data on a selected house (hot link) 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 27
• • •
Attribute Operations:
statistical analysis on one or more columns in table
univariate (one variable or column)
– central tendency: mean, median, mode – dispersion: standard deviation, min, max – To obtain these statistics in ArcGIS: » Right click in T of C and select
Open attribute table
» Right click on column heading and select
Statistics
bivariate (relating two variables or columns)
– interval and nominal scale variables: sum or mean by category » average crop yield by silt-sand-clay soil types » To implement in ArcGIS, proceed as above but use
Summarize
– two interval scale variables: correlation coefficients » income by education » ArcScripts are available for this on ESRI web site (or use Excel!)
multivariate (more than two variables)
– usually requires external statistical package such as SAS, SPSS, STATA or S-PLUS 28 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
• •
Attribute Operations:
variable recoding
establishing/modifying number of classes and/or their boundaries for continuous variable. Options for ArcGIS
– natural breaks (default) (finds inherent inherent groups via Jenks optimization which minimizes the variances within each of the classes). – quantile (classes contain equal number of records- or equal area under the frequency distribution) – equal interval (user selects # of classes)
Implement in ArcGIS via:
Right click in T of C, select
Properties
, then
Symbology
tab (equal width classes on variable) – Defined interval (user selects width of classes) (equal width classes on variable) – standard deviation 23% 25% 25% (assumes a Normal distribution) 23% Equal area % (categories based on 1,2, etc, SDs 14% 34% 34% 14% Equal interval % above/below mean) – Manual (user defined) -2 » whole numbers (e.g. 2,000) » meaningful to phenomena (e.g zero, 32 o ) -1 -.68
0 0 .68
1 2 Standard Deviation Equal interval score Equal area score
aggregating categories on a nominal (or ordinal) variable
– pine and fir into evergreen
No change in number of records (observations).
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 29
Attribute Operations:
record aggregation
• • • • combining two or more records into one, based on common values on a key variable the attribute equivalent of regionalization or classification equivalent of PROC SUMMARY in SAS interval scale variables can be aggregated using mean, sum, max, min, standard deviation, etc. as appropriate
Fips
48085 48113 48121 48139 48213 48231 48257 48397 48221 48251 48367 48439
PMSA_90 PMSA_93 Pop90
1920 1920 264036
Pop95_est Pop90-95% MedInc89 Suburb Name
346232 5.93
46020 1 Collin 1920 1920 1920 1920 1920 1920 1920 1852810 273525 85167 58543 1959281 334070 94223 64293 1.09
4.22
2.03
1.87
31605 36914 30553 20747 0 1 1 1 Dallas Denton Ellis Henderson 1920 64343 66972 0.78
25317 1 Hunt 1920 1920 2800 2800 2800 1920 1920 2800 2800 2800 2800 52220 25604 28981 97165 64785 1170103 60114 32725 33384 106181 73794 1278606 2.88
5.30
2.89
1.77
2.65
1.77
27280 42417 31627 30612 30592 32335 1 1 1 1 1 0 Kaufman Rockwall Hood Johnson Parker Tarrant Source: US Bureau of the Census MedInc=Median Household Income. Pop90 as of April 1. Pop95 as of July 1.
• ordinal and nominal require special consideration • example: aggregate county data to states, or county to CMSA
Fips PMSA_90 PMSA_93 Pop90
1920 2676248
Pop95_est Pop90-95% MedInc89 Suburb Name
2957910 2.00
32607 7 Dallas 2800 1361034 1491965 1.83
31292 3 Fort Worth
Record count decreases (e.g. from 12 to 2)
sum 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals sum re-calc.
Type of processing:
average of medians!
count 30
Attribute Operations:
Joining and Relating Tables
associating spatial layer to non-spatial table
Join: one to one, or one to many, relationship appends attributes Associate table of country capitals with country layer: only one capital for each country (one to one) Country Code
29 68 106 248 199 9
Country
France Saudi Arabia Chad Spain Venezuela UK 96 Philippines
Layer Attribute Table
Country Code
199 96 68 29 106 9
Capital
Caracas Manila Riyadh Paris N'Djamena London 248 Madrid
NonSpatial Table
Country Code
9 29 68 96 106 199 248
Country
UK France Saudi Arabia Philippines Chad Venezuela
Capital
London Paris Riyadh Manila N'Djamena Caracas Spain Madrid
Layer Attribute Table after Join
Associate country layer with type of government: one gov. type assigned to many countries--but each country has only one gov. type (one to many) Gov. Code
20 30 15 20 10 15
Country
France Vietnam UK Argentina Saidi Arabia Sweden 45 Portugal
Layer Attribute Table
Gov. Code
10 15 20 30 45
Type
Absolute Monarchy Const. Monarchy Republic Communist State Parliamentary Democracy
Gov. Code
20 30 15 20 10 15 45
Country
France Vietnam UK Argentina Saudi Arabia Sweden
Type
Republic Communist State Const. Monarchy Republic Absolute Monarchy Const. Monarchy Portugal Parliamentary Democracy
Layer Attribute Table after Join NonSpatial Table
31 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Single most common error in GIS Analysis
--intending a one to one join of attribute to spatial table --getting a one to many join of attributes to spatial table NAME Alabama Alaska …..
Georgia Hawaii Idaho …..
Wisconsin Wyoming FIPS 01 02 13 15 16 55 56 CODE AL AK GA HI ID WI WY POP2000 4,447,100 626,932 8,186,453 1,211,537 1,293,953 5,363,675 493,782 Spatial 51 states Total 282,421,906 FID 0 1 11 12 13 14 15 16 53 54 SHAPE Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon Polygon NAME Alabama Alaska …..
Georgia Hawaii-Hawaii FIPS 01 02 13 15 Hawaii-Maui Hawaii-Oahu Hawaii-Kauai Idaho …..
Wisconsin Wyoming 15 15 15 16 55 56 CODE AL AK GA HI HI HI HI ID WI WY Total POP2000 4,447,100 626,932 8,186,453 1,211,537 1,211,537 1,211,537 1,211,537 1,293,953 5,363,675 493,782 286,056,517 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals After joining attribute to spatial data 32
Attribute Operations:
Joining and Relating Tables
associating spatial layer to non-spatial table (contd.)
Relate: many to one relationship, attributes not appended Associate country layer with its multiple cities (many to one) Country Code Country
29 France 68 Saudi Arabia 106 Chad 248 Spain 199 Venezuela 9 UK 96 Philippines
Layer Attribute Table
Country Code
129 129 29 29 29 60 248 248 248
City
Mombasa Nairobi Paris Lyon Marseille Katmandu Madrid Barcelona Valencia
NonSpatial Table
If
joined
Paris to France, for example, we lose Lyon and Marseille, therefore use
relate
Note: if we flip these tables we can do a join since there is only one country for each city (one to many)
• •
For both Joins and Relates: Association exists only in the map document Underlying files not changed unless export data
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 33
Analysis Options:
Advanced & Specialized
• • • • •
Advanced
(Table of Contents)
Specialized
Proximity/point pattern analysis
• Remote Sensing image processing and classification – nearest neighbor layer • raster modeling – distance matrix layer • 3-D surface modeling
surface analysis
– cross section creation • spatial statistics/statistical – visibility/viewshed modeling
network analysis
– routing • functionally specialized – transportation modeling » shortest path (2 points) » travelling salesman (n points) – land use modeling – hydrological modeling – time districting – etc.
– allocation
Convex Hull Thiessen Polygon creation
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 34
Advanced Applications:
Proximity Analysis
• •
Nearest Neighbor
location (distance) relative to nearest neighbor ( points or polygon centroids) location (distance) relative to nearest objects of selected other types (e.g. to line, or point in another layer, or polygon boundary)
Point Pattern Analysis
is pattern?
Random Clustered Dispersed
Requires only one output column –
altho generalizable to k th nearest neighbor
•
Full matrix
measure location of each object relative to every other object –
requires output matrix with as many columns as rows in input table
Requires the application of
Spatial Statistics
•Nearest neighbor statistic •Moran’s I such as which are based on proximity of points to each other
ArcToolbox>Spatial Statistics Tools
35 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Advanced Applications:
• •
Routing
Network Analysis
Network-based Districting
•
shortest path between two points
– direction instructions (locating hotel from airport)
travelling salesman: shortest expand from site along network until criteria (time, distance, cost, object count) is reached; then assign area to district
– creating market areas, attendance zones, etc – essentially network-based buffering
path connecting n points Network-based Allocation
– bus routing, delivery drivers •
assign locations to the nearest center
for water, sewage, etc.).
based upon travel thru network
In all cases, ‘distance’ may be measured in miles, time, cost or other ‘friction’ (e.g pipe diameter • – assign customers to pizza delivery outlets
draw boundaries (lines of equidistance between 2 centers) based on the above
Arc or node attributes (e.g one-way streets, no left turn) may also be critical.
– Network-based market area delimitation – Essentially, network-based polygon tesselation 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 36
Advanced applications:
Surface Analysis
• •
Slope Transform
fit a plane to the 3 by 3 neighborhood around every cell, or use a TIN output layer is the
slope
(first derivative) of the plane for each cell • • •
Aspect Transform
direction slope faces: (E-W oriented ridge has slopes with northern and southern aspects) aspect normally classified into eight 45 degree categories calculate as horizontal component of the vector perpendicular to the surface
Cross-section Drawings and Volumes
• elevation (or slope) values along a line • Volume & cut-and-fill calculation • Cross-section easy to produce for raster, more difficult for vector especially if uses contours lines
Viewshed/Visibility
• terrain visible from a specific point • • applications – visual impact of new construction – select scenic overlooks – Military
Contouring
– Lines joining points of equal (vertical) value – From raster, massed-points or breakline data 37 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Advanced Applications: Convex Hull
• Formally: the smallest convex polygon (no concave angles) able to contain a set of points • Informally: a rubber band wrapped around a set of points • Just as a centroid is a point representation
No!
for a polygon, the convex hull is the polygon representation for a set of points • Go to the following web site for a neat application showing how convex hull changes as you move points around
–
http://www.cs.princeton.edu/~ah/alg_anim/version1/ConvexHull.html
38 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Advanced Applications:
Thiessen (Dirichlet, Voronoi) Polgons
•
and Delaunay Triangles
polygons generated from a point layer such that any location within a polygon is closer to the enclosed point than to a point within any other polygon • • • • they divide the space between the points as ‘evenly’ as possible used for market area delimitation, rain gauge area assignment, contouring via Delaunay triangles (DTs), etc.
elevation, slope and aspect of triangle calculated from heights of its three corners DTs are as near equiangular as possible and longest side is as short as possible, thus minimizes distances for interpolation Thiessen neighbors of point A share a common boundary. Delauney triangles are formed by joining point to its Thiessen neighbors.
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals Thiessen Polygons (or proximal regions or proximity polygons) A Delaunay Triangles A 39
Specialized Applications
• • •
Remote Sensing/Digital Image Processing
reflectance value (usually 8 bit; 256 values) collected for each bands (wavelength area) in the electro magnetic spectrum – 1 band for grey scale (Black & white) – 3 for color – up to 200 or so for ‘hyperspectral’ – permits creation of image
‘
spectral signature’: set of reflectance values/ranges over available bands typifying a specific phenomena – provides basis for identification of phenomena
Location Science/Network Modeling Network based models for optimum location decisions for (e.g.)
– police beats – School attendance zones – Bus routes – Hazardous material routing – Fire station location • •
Raster Modeling: 2-D
use of direction and friction surfaces to develop models for: – spread of pollution – dispersion of forest fires
Surface Modeling: 3-D
– flood potential – ground water/reservoir studies – Viewshed/visibility analysis
Spatial Statistics/Econometrics
analyses on spatial data which explicitly incorporates relative location or
proximity
property of observations Global (applies to entire study area) – spatial autocorrelation – Regressions adjusted for spatial autocorrelation Local (separately calculated for local areas) – LISA (local indicators of spatial autocorrelation) – Geographically weighted regression
We offer one or more courses on each!
40 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Implementation of Advanced and Specialized Applications in ArcGIS 8/9
• • • • • •
Extensions support many of the Advanced and some Specialized Applications Spatial Analyst
extension provides 2-D modeling of GRID (raster) data (AV 3.2 and 8/9)
3-D Analyst
extension provides 3-D modeling (AV 3.2 and 8/9)
Geostatistical Analyst Network Analyst extension (3.2 only) and ArcLogistics Route (standalone)
and network analysis extension provides interpolation (ArcGis 8/9 only) for routing
Image Analyst
extension for remote sensing applications in AV 3.2
– Leica Image Analysis and Stereo Analyst for ArcGIS 8 (9 version not yet released-Fall ’04)
Spatial Statistics Tools
in ArcToolbox provide spatial statistics (centroid, etc..) •
ArcScripts support other Advanced Applications and Specialized Applications ArcScripts (in Visual Basic, C++, etc.)
are used to customize ArcGIS 8 – A variety of scripts available at
http://support.esri.com/
– Note: ArcScripts written in
Avenue
>downloads
work only in ArcView 3 and will not work in ArcGIS 8/9 – Many functions previously requiring Avenue scripts for AV 3.2 are built into ArcGIS 8/9 • •
Specialized Software Packages
Remote Sensing packages such as Leica GeoSystems Imagine (formerly ERDAS Imagine) For links to some of these packages go to: http://www.utdallas.edu/~briggs/other_gis.html
41 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Appendix
Implementing Spatial Analysis in ArcView 3.2/3.3
5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 42
• • •
Implementing Spatial Measurement in ArcView 3.2
Unlike in ArcGIS 8.1 where spatial measurements are provided automatically, in AV 3.2 spatial measurement often has to be implemented using Avenue code:
– in functional expressions – in scripts
Using functional expression for areas and lengths:
– Use Edit/Add field to add a new variable to atttributes of…table called
area
– Use Field/calculate and make this variable equal to:
[Shape].ReturnArea
– – Calculation is based on map units irrespective of defined distance units.
If map units are feet, to obtain square miles use: (or similar)
[Shape].ReturnArea/5280/5280
– If file is a polyline file (arcs), for length of arcs use:
[Shape].ReturnLength
Using a Script for areas, perimeters and lengths
In Project window, select Script and click new button to open script window Use
Script/load text file
to load code from an existing text file e.g.
arcview\samples\scripts\calcapl.ave will calculate areas, perimeters, lengths
Click the “check mark” icon to compile the code.
Open a View and be sure the theme you want processed is active.
Click on script window then click the Runner icon to run script.
variables measuring area and perimeter will be added to theme table
43 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Implementing Spatial Matching in ArcView 3.2
•
Available in two places (plus additional user extensions such as districting)
via
Theme/select by theme
– this selects features of the active theme which relate in some
specified spatial manner
to • • • • • • another theme – if desired, selected features may be saved later to a new theme via
Theme/convert to shape file
• via Geoprocessing Wizard Extension (use
Options in Geoprocessing Wizard
(use
File/Extensions
– Six options available for different types of matching to load) – this creates a new theme (shape file) & combines attribute tables from 2 or more input themes
View/Geoprocessing Wizard
to activate)
Dissolve features based on an attribute
–
Use for spatial aggregation/dissolving
Merge themes together
– –
Use for edge matching
Clip one theme based on another
Use one theme to limit features in another theme (e.g. limit a Texas road theme to Dallas county only)
•Scripts and extensions can provide additional capabilities •Download from ArcScripts at ESRI http://gis.esri.com/arcscripts/ scripts.cfm
•Place extensions (.avx) in your
Intersect two themes
–
Use for polygon on polygon overlay
folder
Arcview/ext32
•The extension
district.avx
is good
Union two themes
–
Use for polygon on polygon overlay
for doing spatial aggregation or “districting”
Assign data by location (Spatial Join
) –
Use for: points in polygon lines in polygon points on lines (to calculate distance to nearest line) points on points (to calculate distance to “nearest neighbor” point)
44 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
•
Using Extensions and Scripts in ArcView 3.2
Obtain copy of script or extension
– Write yourself with Avenue language – Supplied with ArcView in folder:
arcview/samples/scripts
or
arcview/samples/ext
» Go to ArcView
Help/Contents/Sample Scripts and Extensions
for documentation – Buy from ESRI and other companies – Supplied free by ESRI or users and available on ESRI web site at:
http://arcsripts.esri.com/ Select Avenue language
» or go to www.esri.com
and click Support » Be sure to print or download documentation/description •
To load and use an extension
– Place .avx file in
arcview/ext32
folder – Open ArcView, choose
File/extensions
, place tick next to name, click OK •
To load and use a script
In Project window, select Script and click new button to open script window Use
Script/load text file
to load code from existing text file containing avenue code (.ave)
e.g. \av_gis30\ arcview\samples\scripts\calcapl.ave will calculate areas, perimeters, lengths
Click the “check mark” icon to compile the code.
Take steps within ArcView as appropriate for specific script e.g. Open a View and be sure the theme you want processed is active.
Click on script window then click the “Runner" icon to run script.
e.g. variables measuring area and perimeter will be added to theme table 45 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals
Some Example Avenue Scripts for ArcView 3
•
Avenue scripts and extensions
for AV 3.2 can be downloaded from ESRI Web site to do many basic, advanced and specialized applications not available in standard products. Some examples are: –
Addxycoo.ave:
adds X,Y coordinates of points (e.g of geocoded addresses), or of centroid for polygons, to
attributes of …
file – –
Polycen.ave:
creates point theme containing polygon centroids
Dwizard.zip:
various districting applications » Use
avdist31b
which is an update – –
Line.zip:
enhanced buffering of lines
Nearestneighbor.zip:
nearest neighbor analysis For more scripts, go to: http://arcsripts.esri.com/ Select
Avenue
language 5/6/2020 Ron Briggs, UTDallas GISC 6381 GIS Fundamentals 46