Mechanistic models for macroecolgy: moving beyond correlation Nicholas J. Gotelli Department of Biology University of Vermont Burlington, VT 05405 ?? What causes geographic variation in species richness ?? Understanding species richness patterns •

Download Report

Transcript Mechanistic models for macroecolgy: moving beyond correlation Nicholas J. Gotelli Department of Biology University of Vermont Burlington, VT 05405 ?? What causes geographic variation in species richness ?? Understanding species richness patterns •

Slide 1

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 2

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 3

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 4

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 5

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 6

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 7

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 8

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 9

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 10

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 11

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 12

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 13

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 14

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 15

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 16

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 17

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 18

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 19

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 20

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 21

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 22

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 23

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 24

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 25

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 26

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 27

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 28

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 29

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 30

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 31

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 32

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 33

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 34

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 35

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 36

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 37

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 38

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 39

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 40

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 41

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 42

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 43

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 44

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 45

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 46

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 47

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 48

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 49

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 50

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 51

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 52

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 53

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 54

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 55

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 56

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 57

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 58

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 59

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 60

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 61

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 62

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 63

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 64

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 65

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 66

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 67

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 68

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 69

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 70

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 71

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 72

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 73

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 74

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 75

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 76

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 77

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 78

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 79

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 80

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 81

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 82

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 83

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 84

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 85

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 86

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 87

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 88

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 89

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 90

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 91

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 92

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 93

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 94

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 95

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 96

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 97

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 98

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 99

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 100

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 101

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 102

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 103

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 104

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes


Slide 105

Mechanistic models for
macroecolgy:
moving beyond correlation
Nicholas J. Gotelli
Department of Biology
University of Vermont
Burlington, VT 05405

??
What causes geographic variation
in species richness
??

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Gary Entsminger
Acquired Intelligence

Nicholas Gotelli,
University of Vermont

Gary Graves
Smithsonian

Rob Colwell
University of
Connecticut

Carsten Rahbek
University of Copenhagen

Thiago Rangel

Federal University of Goiás

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Data sources
• Gridded map of domain

Avifauna of South America
“There can be no question, I
think, that South America is
the most peculiar of all the
primary regions of the globe
as to its ornithology.”
P.L. Sclater (1858)

South American Avifauna
• 2891 breeding
species
• 2248 species
endemic to South
America and
associated landbridge islands

Minimum:
18 species

Maximum:
846 species

Minimum:
18 species

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells

Anas puna

Geographic Ranges
For Individual
Species

Phalacrocorax brasilianus

Myiodoorus cardonai

Geographic Ranges

Species Richness

Geographic Ranges

Species Richness

Data sources
• Gridded map of domain
• Species occurrence records within grid
cells
• Quantitative measures of potential
predictor variables within grid cells (NPP,
temperature, habitat diversity)

Climate, Habitat Variables
Measured at Grid Cell Scale

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

How are these macroecological
data typically analyzed?

800
600

Observed Species Richness

400
200
0
0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400
200
0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

800
600
400

OLS

200

LOESS

0

Observed Species Richness

Poisson

0

2

4

6

8

10

N et Primary Produc tiv ity (Tonnes /hec tare)

12

14

How are these macroecological
data typically analyzed?

Curve-fitting!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size

Criticisms of Curve-Fitting
• “Correlation does not equal causation”
Common to all of macroecology!
• Non-linearity & non-normal, spatially correlated errors
LOESS, Poisson, Spatial Regression (SAM)
• Choosing among correlated predictor variables
Model selection strategies, stepwise regression, AIC
• Sensitivity to spatial scale, taxonomic resolution,
geographic range size
Stratify analysis

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Conceptual Weakness of
Curve-Fitting Paradigm
Potential Predictor
Variables
(tonnes/ha, C°)

??
MECHANISM
??

minimize
residuals

Predicted
Species Richness
(S / grid cell)

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

How can we build explicit
simulation models for
macroecology?

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

One-dimensional geographic domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

Species
Number

One-dimensional geographic domain
Species geographic ranges randomly
placed line segments within domain

Peak of species richness in geographic
center of domain

domain

geographic range

domain

Pancakus spp.

der Pfankuchen
Guild

Reduced species richness
at margins of the domain

Mid-domain
peak of species richness
in the center of the domain

2-dimensional MDE Model
• Random point of origination
within continent (speciation)
• Random spread of geographic
range into contiguous
unoccupied cells
• Spreading dye model (Jetz & Rahbek
2001) predicts peak richness in
center of continent (r2 = 0.17)

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

Assumptions of MDE models
• Placement of ranges within domain is
random with respect to environmental
gradients
– Controversial, but logical for a null model for
climatic effects

• Geographic ranges are cohesive within the
domain
– Rarely discussed, but important as the basis
for a mechanistic model of species richness

Range Cohesion

Range Scatter

At the 1º x 1º scale, > 95% of species of
South American birds have contiguous
geographic ranges

Causes of Range Cohesion
• Extrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes

Causes of Range Cohesion
• Extrinsic Causes
– Coarse Spatial Scale
– Spatial Autocorrelation in Environments

• Intrinsic Causes
– Limited Dispersal
– Philopatry & Site Fidelity
– Metapopulation & Source/Sink Structure
– Fine-scale Genetic Structure & Local Adaptation
– Spatially Mediated Species Interactions

Strict Range Cohesion

Stepping Stone

* The mid-domain effect does not require strict
range cohesion. A mid-domain peak in species
richness will also arise from stepping stone
models with limited dispersal and from neutral
model dynamics (Rangel & Diniz-Filho 2005)

Homogenous
Environment

Heterogeneous
Environment

Almost all MDE models have assumed a
homogeneous environment:
grid cells are equiprobable

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT
Homogeneous
Enforced
RANGE
COHESION
Relaxed

Classic MDE

Statistical Null
(slope = 0)

Heterogeneous

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

ENVIRONMENT

Enforced
RANGE
COHESION
Relaxed

Homogeneous

Heterogeneous

Classic MDE

Range Cohesion
Models

Statistical Null
(slope = 0)

Range Scatter
Models

Range Cohesion Models are a hybrid that describes a
stochastic MDE model in a more realistic heterogeneous environment.
Range Scatter Models also incorporate environmental heterogeneity, but
do not place any constraints on species geographic ranges.

Alternative Strategy:
Mechanistic Simulation Models
Potential Predictor
Variables
(tonnes/ha, C°)

Predicted
Species Richness
(S / grid cell)

mechanism

Explicit
Simulation
Model

Observed
Species Richness
(S / grid cell)

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)

Modeling Strategy
• Establish simple algorithms that describe
P(occupancy) based on environmental
variables
• Simulate origin and placement of each
species geographic range in
heterogeneous landscape (with or without
range cohesion)
• Repeat simulation to estimate predicted
species richness per grid cell

Geographic Ranges

Species Richness

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
– Temperature Kinetics (Brown et al. 2004)

What determines
P(cell occurrence)?
• Simple environmental models
P(occurrence)  Measured Environmental
Variable (NPP, Temperature, etc.)

• Formal analytical models
– Species-Energy Model (Currie et al. 2004)
P(occurrence)  (NPP)(Grid-cell Area)
– Temperature Kinetics (Brown et al. 2004)
P(occurrence)  e-E/kT

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Model-Selection
in Curve-Fitting Analyses
• Simple tests against the null hypothesis
that b=0
• No consideration of what expected slope
should be with a specific mechanism
• Least-square and AIC criteria to try and
select a subset of variables that best
account for variation in S

800
600
400
200

H0: b = 0

0

Observed Species Richness

OLS

0

2

4

6

8

10

Net Primary Productivity (Tonnes/hectare)

12

14

Model Selection with
Mechanistic Simulation Models
• Models make quantitative predictions of
expected species richness
• Test slope of observed richness versus
predicted richness
• Hypothesis of an acceptable fit H1: b = 1.0
• Rank acceptable models according to
slope, intercept, and r2
• AIC criteria not appropriate

Observed b
Theoretical b = 1.0

Observed S

Predicted S

Understanding species
richness patterns
• Data sources
• A critique of current methods
• Range cohesion and the mid-domain
effect
• Mechanistic models for species richness
• Model selection
• Summary

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)

Summary
• Curve-fitting framework does not
incorporate explicit mechanisms
• Use mechanistic simulations to define the
placement of geographic ranges in a
gridded domain
• Specify rules for P(occurrence)=
f(environmental variables)
• Test model fit against expected slope = 1.0

Criticisms & Rejoinders

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?

Criticisms & Rejoinders
• “Each species has a unique and distinctive response to different
environmental variables. Species ranges should be modeled
independently, not with a single function for all species.”
If this is true, why are there widespread repeatable patterns of
species richness (e.g., latitude, elevation, area, productivity)?
Often not enough data to model each species individually. We need
a simple framework for analysing entire floras and faunas at a
biogeographic scale.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary.”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Criticisms & Rejoinders
• “1:1 scaling of environmental variables with P(occurrence) is
unrealistic and arbitrary”
Perhaps, but this is a parsimonious mechanistic model that relates
environmental variables to geographic range placement.
Linearity in P(occurrence) is not unreasonable over the empirical
ranges of environmental variables measured in South America.
(Linearity of P(occurrence) ≠ Linearity of (Species Richness))

Mechanistic models are scarce in this literature (n = 2)! We have to
begin somewhere!

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”

Criticisms & Rejoinders
• “Many environmental variables, but especially NPP, show non-linear
relationships with peaks in richness at intermediate levels. This is
not captured by linear models.”
At least at this spatial scale, no evidence for a diversity hump of
avian species richness when plotted with NPP or other variables

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”

Criticisms & Rejoinders
• “Using slopes comparisons will not successfully distinguish between
models with intercorrelated predictor variables.”
Not a problem for these analyses. From an initial set of ~ 100
candidate models (10 variables x 2 algorithms x 5 range size
quartiles), we reduced the set down to only 4 or 5 possible
contenders.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.

Criticisms & Rejoinders
• “The model is not truly mechanistic because it does not model the
sizes of the geographic ranges, only their placement.”
True! Our model takes range sizes as a given and then uses algorithms to
place them in a heterogeneous domain. A more realistic model would
describe the processes of speciation, dispersal, and extinction of an
evolving fauna.
But how can the parameters of such a model (e.g. speciation and dispersal
rates) ever be measured in the real world? Same problems have plagued
most empirical evaluations of the neutral model.
Our models are designed to analyze the data that macroecologists typically
have: gridded maps of environmental variables and species geographic
ranges.

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”

Criticisms & Rejoinders
• “The range cohesion and range scatter models don’t’ seem like they
would give predictions that are any different from just a regression
with the underlying variables themselves. What is the added value of
these simulation models?”
The predictions are not the same. For species with large geographic
ranges, the range cohesion models always fit the data better than
the range scatter models, regardless of which environmental
variable is considered.

Key Differences
Curve-Fitting

Mechanistic
Models

Unit of Study

Species Richness

Underlying geographic ranges

Predicted values

Minimization of residuals (data
dependent)

Algorithms for origin and spread
of geographic ranges (data
independent)

Model Selection Criteria

Smallest number of variables
that reduce residual sum of
squares

Quantitative fit to model
predictions

Statistical Tests

H0: (b = 0) tests for any effect
that is larger than 0

H0: (b = 1.0) tests for
quantitative match between
observed and predicted S

To Be Continued…

Carsten Rahbek.
Perception of Species Richness Patterns:
The Role of Range Sizes