Chapter 4 Randomized Blocks, Latin Squares, and Related
Download
Report
Transcript Chapter 4 Randomized Blocks, Latin Squares, and Related
Chapter 4 Randomized Blocks, Latin
Squares, and Related Designs
1
4.1 The Randomized Complete
Block Design
• Nuisance factor: a design factor that probably has
an effect on the response, but we are not interested
in that factor.
• If the nuisance variable is known and
controllable, we use blocking
• If the nuisance factor is known and
uncontrollable, sometimes we can use the
analysis of covariance (see Chapter 14) to
remove the effect of the nuisance factor from the
analysis
2
• If the nuisance factor is unknown and
uncontrollable (a “lurking” variable), we hope
that randomization balances out its impact across
the experiment
• Sometimes several sources of variability are
combined in a block, so the block becomes an
aggregate variable
3
• We wish to determine whether 4 different tips
produce different (mean) hardness reading on a
Rockwell hardness tester
• Assignment of the tips to an experimental unit;
that is, a test coupon
• Structure of a completely randomized experiment
• The test coupons are a source of nuisance
variability
• Alternatively, the experimenter may want to test
the tips across coupons of various hardness levels
• The need for blocking
4
• To conduct this experiment as a RCBD, assign all
4 tips to each coupon
• Each coupon is called a “block”; that is, it’s a
more homogenous experimental unit on which to
test the tips
• Variability between blocks can be large,
variability within a block should be relatively
small
• In general, a block is a specific level of the
nuisance factor
• A complete replicate of the basic experiment is
conducted in each block
• A block represents a restriction on
randomization
5
• All runs within a block are randomized
• Suppose that we use b = 4 blocks:
• Once again, we are interested in testing the
equality of treatment means, but now we have to
remove the variability associated with the
nuisance factor (the blocks)
6
Statistical Analysis of the RCBD
• Suppose that there are a treatments (factor levels)
and b blocks
• A statistical model (effects model) for the RCBD
is
i 1, 2,..., a
yij i j ij
j 1, 2,..., b
– is an overall mean, i is the effect of the ith
treatment, and j is the effect of the jth block
– ij ~ NID(0,2)
k
– a
i 0, j 0
i 1
j 1
7
• Means model for the RCBD
yij ij ij , ij i j
• The relevant (fixed effects) hypotheses are
H 0 : 1 2
a where i (1/ b) j 1 ( i j ) i
b
• An equivalent way for the above hypothesis
H0 :1 2 a 0
• Notations:
b
y i y ij , i 1,...,a
j 1
a
y j y ij , j 1,...,b
i 1
a
b
b
a
j 1
i 1
y y ij y j y i
i 1 j 1
y i y i / b, y j y j / a, y y / N
8
• ANOVA partitioning of total variability:
a
b
a
b
2
(
y
y
)
ij .. [( yi. y.. ) ( y. j y.. )
i 1 j 1
i 1 j 1
( yij yi. y. j y.. )]2
a
b
b ( yi. y.. ) a ( y. j y.. ) 2
2
i 1
a
j 1
b
( yij yi. y. j y.. ) 2
i 1 j 1
SST SSTreatments SS Blocks SS E
9
•
•
•
•
SST = SSTreatment + SSBlocks + SSE
Total N = ab observations, SST has N – 1 degrees
of freedom.
a treatments and b blocks, SSTreatment and SSBlocks
have a – 1 and b – 1 degrees of freedom.
SSE has ab – 1 – (a – 1) – (b – 1) = (a – 1)(b – 1)
degrees of freedom.
From Theorem 3.1, SSTreatment /2, SSBlocks / 2 and
SSE / 2 are independently chi-square
distributions.
10
• The expected values of mean squares:
a
E ( MSTreatment ) 2
b i2
i 1
a 1
b
E ( MS Blocks ) 2
a j2
j 1
b 1
E ( MS E ) 2
• For testing the equality of treatment means,
MSTreatments
F0
~ Fa 1,( a 1)(b1)
MS E
11
• The ANOVA table
• Another computing formulas:
2
2
a
y
y
1
SST y ij2 , SSTreatments y i2
N
b i 1
N
i 1 j 1
a
SSBlocks
b
1 b 2 y2
y j
, SSE SST SSTreatments SSBlocks
a j 1
N
12
• Example 4.1
4.1.2 Model Adequacy Checking
• Residual Analysis
• Residual: eij yij yˆ ij yij yi y j y
• Basic residual plots indicate that normality,
constant variance assumptions are satisfied
• No obvious problems with randomization
13
ESIGN-EXPERT Plot
ardness
Normal plot of residuals
99
95
Norm al % probability
90
80
70
50
30
20
10
5
1
-1
-0.375
0.25
0.875
1.5
Res idual
14
DESIGN-EXPERT Plot
Residuals vs. Run
Residuals vs. Predicted
Hardness
1.5
1.5
0.875
0.875
Res iduals
Res iduals
EXPERT Plot
2
0.25
0.25
-0.375
-0.375
2
-1
-1
-2.75
-0.31
2.13
Predicted
4.56
7.00
1
4
7
10
13
16
Run Num ber
15
• Can also plot residuals versus the type of tip
(residuals by factor) and versus the blocks. Also
plot residuals v.s. the fitted values. Figure 4.5 and
4.6 in Page 137
• These plots provide more information about the
constant variance assumption, possible outliers
4.1.3 Some Other Aspects of the Randomized
Complete Block Design
• The model for RCBD is complete additive.
i 1, 2,..., a
yij i j ij
j 1, 2,..., b
16
• Interactions?
• For example:
E( yij ) i j ln E( yij ) ln ln i ln j
• The treatments and blocks are random.
• Choice of sample size:
– Number of blocks , the number of replicates
and the number of error degrees of freedom
a
–
b i2
2 i 1 2
a
17
• Estimating miss values:
– Approximate analysis: estimate the missing
values and then do ANOVA.
– Assume the missing value is x. Minimize SSE to
find x
ay' by' y '
x
– Table 4.8
– Exact analysis
i
j
(a 1)(b 1)
18
4.1.4 Estimating Model Parameters and the General
Regression Significance Test
• The linear statistical model
i 1, 2,..., a
yij i j ij
j 1, 2,..., b
• The normal equations
abˆ
bˆ
bˆ1
bˆ1
bˆ
a
aˆ1
ˆ
1
ˆ1
aˆ
bˆa
bˆa
ˆa
ˆ1
ˆ
1
aˆb
ˆ
b
y
y1
aˆ1
ˆb
y a
y1
ˆa
aˆb
yb
19
• Under the constraints,
a
ˆ
the solution is
i
i 1
b
0, ˆ j 0
j 1
ˆ y ,ˆi yi y , ˆ j y j y
and the fitted values,
yˆ ij ˆ ˆi ˆ j yi y j y
• The sum of squares for fitting the full model:
2
2
2
b y
y
y
j
R( , , ) ˆy ˆi yi ˆ j y j i
ab
i 1
j 1
i 1 b
j 1 a
a
b
a
• The error sum of squares
2
SSE yij2 R( , , ) yij yi y j y
a
b
i 1 j 1
a
b
i 1 j 1
20
• The sum of squares due to treatments:
yi2 y2
R( | , )
ab
i 1 b
a
21
4.2 The Latin Square Design
• RCBD removes a known and controllable
nuisance variable.
• Example: the effects of five different formulations
of a rocket propellant used in aircrew escape
systems on the observed burning rate.
– Remove two nuisance factors: batches of raw
material and operators
• Latin square design: rows and columns are
orthogonal to treatments.
22
• The Latin square design is used to eliminate two
nuisance sources, and allows blocking in two
directions (rows and columns)
• Usually Latin Square is a p p squares, and each
cell contains one of the p letters that corresponds
to the treatments, and each letter occurs once and
only once in each row and column.
• See Page 145
23
• The statistical (effects) model is
i 1, 2,..., p
yijk i j k ijk j 1, 2,..., p
k 1, 2,..., p
– yijk is the observation in the ith row and kth
column for the jth treatment, is the overall
mean, i is the ith row effect, j is the jth
treatment effect, k is the kth column effect and
ijk is the random error.
– This model is completely additive.
– Only two of three subscripts are needed to
denote a particular observation.
24
• Sum of squares:
SST = SSRows + SSColumns + SSTreatments + SSE
• The degrees of freedom:
p2 – 1 = p – 1 + p – 1 + p – 1 + (p – 2)(p – 1)
• The appropriate statistic for testing for no
differences in treatment means is
MSTreatments
F0
~ Fp 1,( p 2)( p 1)
MS E
• ANOVA table (Table 4-10) (Page 146)
• Example 4.3
25
• The residuals
eijk yijk yˆ ijk yijk yi y j yk 2 y
• Table 4.13
• If one observation is missing,
p( yi' y' j y'k ) 2 y'
yijk
( p 2)( p 1)
• Replication of Latin Squares:
– Three different cases
– See Table 4.14, 4.15 and 4.16
• Crossover design: Pages 150 and 151
26
4.3 The Graeco-Latin Square Design
• Graeco-Latin square:
– Two Latin Squares
– One is Greek letter and the other is Latin letter.
– Two Latin Squares are orthogonal
– Table 4.18
– Block in three directions
– Four factors (row, column, Latin letter and
Greek letter)
– Each factor has p levels. Total p2 runs
27
• The statistical model:
yijkl i j k l ijkl , i, j, k , l 1,, p
– yijkl is the observation in the ith row and lth
column for Latin letter j, and Greek letter k
– is the overall mean, i is the ith row effect, j
is the effect of Latin letter treatment j , k is the
effect of Greek letter treatment k, l is the
effect of column l.
– ANOVA table (Table 4.19)
– Under H0, the testing statistic is Fp-1,(p-3)(p-1)
distribution.
• Example 4.4
28
4.4 Balance Incomplete Block
Designs
• May not run all the treatment combinations in
each block.
• Randomized incomplete block design (BIBD)
• Any two treatments appear together an equal
number of times.
• There are a treatments and each block can hold
exactly k (k < a) treatments.
• For example: A chemical process is a function of
the type of catalyst employed. See Table 4.22
29
4.4.1 Statistical Analysis of the BIBD
• a treatments and b blocks. Each block contains k
treatments, and each treatment occurs r times.
There are N = ar = bk total observations. The
number of times each pairs of treatments appears
in the same block is r (k 1)
(a 1)
• The statistical model for the BIBD is
yij i j ij
30
• The sum of squares
SST SSTreatments SSBlocks SSE
SST y ij2 y2 / N
i
SSBlocks
j
1 b 2
y j y2 / N
k j 1
a
SSTreatments ( adjusted )
k Qi2
i 1
a
1 b
, Qi y i nij y j
k j 1
SSE SST SSTreatments ( adjusted ) SSBlocks
31
• The degree of freedom:
– Treatments(adjusted): a – 1
– Error: N – a – b – 1
• The testing statistic for testing equality of the
treatment effects:
MSTreatments ( adjusted )
F0
MS E
• ANOVA table (see Table 4.23)
• Example 4.5
32
4.4.2 Least Squares Estimation of the Parameters
• The least squares normal equations:
a
b
i 1
j 1
: Nˆ r ˆi k ˆ j y
b
i : rˆ rˆi nij ˆ j y i
j 1
a
j : kˆ nijˆi kˆ j y j
i 1
• Under the constrains,
a
ˆ
we have ˆ y
i 1
i
b
0, ˆ j 0
j 1
33
• For the treatment effects,
r (k 1)ˆi
ˆi
a
ˆ
p 1, p i
p
kQi
kQi
, i 1,2,, a
a
34