RESPONSE SURFACE METHODOLOGY (R S M) Mariam MAHFOUZ

Download Report

Transcript RESPONSE SURFACE METHODOLOGY (R S M) Mariam MAHFOUZ

RESPONSE SURFACE
METHODOLOGY
(R S M)
Par
Mariam MAHFOUZ
Remember that:
General Planning


Part I
A - Introduction to the RSM method
B - Techniques of the RSM method
C - Terminology
D - A review of the method of least squares
Part II
A - Procedure to determine optimum
conditions – Steps of the RSM method
B – Illustration of the method with an example
2
3
A - Procedure to determine optimum conditions
steps of the method
This method permits to find the settings of the
input variables which produce the most desirable
response values.
The set of values of the input variables which
result in the most desirable response values is
called the set of optimum conditions.
4
Steps of the method
The strategy in developing an empirical model through
a sequential program of experimentation is as follows:
1.
The simplest polynomial model is fitted to a set of data
collected at the points of a first-order design.
2.
If the fitted first-order model is adequate, the information
provided by the fitted model is used to locate areas in the
experimental region, or outside the experimental region,
but within the boundaries of the operability region, where
more desirable values of the response are suspected to
be.
5
3.
In the new region, the cycle is repeated in that
the first-order model is fitted and testing for
adequacy of fit.
4.
If nonlinearity in the surface shape is detected
through the test for lack of fit of the first-order
model, the model is upgraded by adding crossproduct terms and / or pure quadratic terms to it.
The first-order design is likewise augmented with
points to support the fitting of the upgraded
model.
6
5.
If curvature of the surface is detected and a
fitted second-order model is found to be
appropriate, the second-order model is used
to map or describe the shape of the surface,
through a contour plot, in the experimental
region.
6.
If the optimal or most desirable response
values are found to be within the boundaries
of the experimental region, then locating the
best values as well as the settings of the input
variables that produce the best response
values.
7
7. Finally, in the region where the most desirable
response values are suspected to be found,
additional experiments are performed to
verify that this is so.
8
B- Illustration of the method with an
example
For simplicity of presentation we shall
assume that there is only one response
variable to be studied although in practice
there can be several response variables that
are under investigation simultaneously.
9
Experience
Two controlled
Factors
One response
Temperature (X1)
Chemical
reaction
percent yield
Time (X2)
An experimenter, interested in determining if an
increase in the percent yield is possible by varying
the levels of the two factors.
10
Two levels of temperature: 70° and 90°.
Two levels of time: 30 sec and 90 sec.
Four temperature-time settings
(factorial combinations)
Four
different
design
points
And two repetitions at each point
The total number of observations is N = 8
11
Detail
The response of interest is the percent
yield, which is a measure of the purity of the
end product.
The process currently operates in a
range of percent purity between 55 % and 75
%, but it is felt that a higher percent yield is
possible.
12
Design 1
Original variables
Coded variables
Percent yield
Temperature
X1 (C°)
Time
X2 (sec.)
x1
x2
Y
70
30
-1
-1
49.8
48.1
-1
57.3
52.3
1
65.7
69.4
1
73.1
77.8
90
70
90
30
90
90
1
-1
1
x1 and x2 are the coded variables which are defined as:
X 1  80
x1 
10
x2 
X 2  60
30
13
Representation of the first design
14
First-order model
Expressed in terms of the coded
variables, the observed percent yield values
are modeled as:
Y   0  1 x1   2 x2  
The remaining term, , represents random
error in the yield values.
The eight observed percent yield values,
when expressed as function of the levels of
the coded variables, in matrix notation, are:
Y=X+
15
Matrix form
Vector of response
values
 49.8
 48.1


 57.3


52
.
3


65.7 


69.4 
 73.1


 77.8
=
1
1

1

1
1

1
1

1
1
1
1
1
1
1
1
1
Vector of error terms
Matrix of the design
 1
 1
 1

 1
 1

 1
 1

 1
 0 
 
 1 +
  2 
 1 
 
 2
 3 
 
 4 
 5 
 
 6 
 
 7
 8 
Vector of unknown parameters
16
Estimations
The estimates of the coefficients in the firstorder model are found by solving the normal
equations:
X Xb  X Y
The estimates are:
61.6875
1

b   X X  X Y  
3
.
4375



 9.8125 

The fitted first-order model in the coded variables is:
Yˆ ( x)  61.6875  3.4375 x1  9.8125 x 2
17
ANOVA table – design 1
H 0 : 1   2  0
Source
Degrees of
freedom
d.f.
Sum of
squares
SS
Mean
square
F
Model
2
864.8125
432.4063
63.71
Residual
5
33.9363
6.7873
Test of
adequacy
Lack of fit
1
2.1013
2.1013
Pure error
4
31.8350
7.9588
0.264
18
Individual tests of parameters
To do that the Student-test is used.
For the test of: H 0 : 1  0 we have t 
And for H 0 :  2  0
we have
t
b1  0
 3.73
var( b1 )
b2  0
 10.65
var( b2 )
Each of the null hypotheses is rejected at the  =
0.05 level of significance owing to the calculated
values, 3.73 and 10.65, being greater in absolute
value than the tabled value,
T5;0.025 =2.571.
19
Conclusion of the first analysis
The first – order model is adequate.
That both temperature and time have an effect on
percent yield.
Since both b1 and b2 are positive, the effects are
positive.
Thus, by raising either the temperature or time of
reaction, this produced a significant increase in
percent yield.
20
Second stage of the sequential program
At this point, the experimenter quite naturally
might ask:
If additional experiments can be performed
At what settings of temperature and time should
the additional experiments be run?”
To answer this question, we enter the second
stage of our sequential program of
experimentation.
21
Contour plots
The fitted model: Yˆ ( x)  61.6875  3.4375 x1  9.8125 x 2
can now be used to map values of the estimated response
surface over the experimental region.
This response surface is a hyper-plane; their contour
plots are lines in the experimental region.
The contour lines are drawn by connecting two points
(coordinate settings of x1 and x2) in the experimental
region that produce the same value of Yˆ
22
In the figure above are shown the contour lines
of the estimated planar surface for percent yield
corresponding to values of Yˆ ( x ) = 55, 60, 65 and 70 %.
23
Performing experiments along the path of
steepest ascent
To describe the method of steepest ascent
mathematically, we begin by assuming the true
response surface can be approximated locally with an
equation of a hyper-plane
k
   0    i xi
i 1
Data are collected from the points of a first-order
design and the data are used to calculate the coefficient
estimates to obtain the fitted first-order model
k
Yˆ ( x)  b0   bi xi
i 1
Estimated response function
24
The next step is to move away from the center of the
design, a distance of r units, say, in the direction of the
maximum increase in the response.
By choosing the center of the design in the coded
variable to be denoted by O(0, 0, …, 0), then movement
away from the center r units is equivalent to find the
k
values of x1 , x 2 ,..., x k which maximize Yˆ ( x)  b0   bi xi
k
subject to the constraint
x
i 1
2
i
 r
2
i 1
Maximization of the response function is performed
by using Lagrange multipliers. Let
 k 2

Q( x1 , x 2 ,..., x k )  b0   bi xi     xi  r 2 
i 1
 i 1

k
where  is the Lagrange multiplier.
25
26
To maximize Yˆ ( x ) subject to the above-mentioned
constraint, first we set equal to zero the partial derivatives
Q( x1 , x 2 ,..., x k )
xi
i=1,…,k and
Q ( x1 , x 2 ,..., x k )

Setting the partial derivatives equal to zero produces:
Q( x1 , x2 ,..., xk )
 bi  2xi  0
xi
i =1,…,k, and
k
Q ( x1 , x 2 ,..., x k )
   xi2  r 2  0

i 1
The solutions are the values of xi satisfying
or
xi
 2
bi
xi 
bi
2
i = 1,…,k, where the value of  is yet to be
determined. Thus the proposed next value of xi is directly
proportional to the value of bi.
27
Let us the change in Xi be noted by i , and the
change in xi be noted by i. The coded variables is
obtained by these formulas x  X  X
where X i
i
i
i
si
(respectively si) is the mean (respectively the
standard deviation) of the two levels of Xi .
Thus
i 
xi   i 
i
si
( X i  i )  X i
si
or
, then
 i  si  i
28
Let us illustrate the procedure with the fitted firstorder model: Yˆ ( x)  61.6875  3.4375 x  9.8125 x
1
2
that was fitted early to the percent yield values in our
example.
To the change in X2, 2=45 sec. corresponds the
change in x2, 2=45/30=1.5 units.
In the relation
1
b1

2
b2
x1
x2

b1
b2
1
, thus
3.4375

, we can substitute i to xi:
1.5
9.8125
and 1 = 0.526, so
1=0.526*10=5.3°C .
29
Points along the path of steepest ascent and
observed percent yield values at the points
Temperature
X1 (°C)
Time
X2 (sec.)
Base
80.0
60
i
5.3
45
Base + i
85.3
105
74.3
Base + 1.5 i
87.95
127.5
78.6
Base + 2 I
90.6
150
83.2
Base + 3 I
95.9
195
84.7
Base + 4 i
101.2
240
80.1
Observed
percent yield
30
31
Sequence of experimental trials performed in
moving to a region of high percent yield values
Design two: For this design the coded variables are
Time( X 2 )  195
Temperature( X 1 )  95.9
defined as:
x

2
x 
1
30
10
x1
x2
X1
X2
% yield
-1
-1
85.9
165
82.9; 81.4
+1
-1
105.9
165
87.4; 89.5
-1
+1
85.9
225
74.6; 77.0
+1
+1
105.9
225
84.5; 83.1
0
0
95.9
195
84.7; 81.9
32
33
The fitted model corresponding to the group of experiments
of design two is: Yˆ ( x )  82.70  3.575 x1  2.750 x 2
The corresponding analysis of variance is:
Source
d.f.
SS
MS
F
Model
2
162.745
81.372
42.34
Residual
7
13.455
1.922
Lack of fit
2
2.345
1.173
Pure error
5
11.110
2.222
Total
(variations)
9
176.2
The model is jugged adequate
0.53
34
sequence of experimental trials that were
performed in the direction two:
Steps
x1
x2
X1
X2
% yield
1
Center + i
+1
- 0.77
105.9
171.9
89.0
2
Center + 2 i
+2
- 1.54
115.9
148.8
90.2
3
Center + 3 i
+3
- 2.31
125.9
125.7
87.4
4
Center + 4 i
+4
- 3.08
135.9
102.6
82.6
35
36
Retreat to center + 2 i and proceed in
direction three
Steps
x1
x2
X1
X2
% yield
5
Replicated 2
+2
- 1.54
115.9
148.8
91.0
6
+3
- 0.77
125.9
171.9
93.6
7
+4
0
135.9
195
96.2
8
+5
0.77
145.9
218.1
92.9
37
Set up design three using points of steps 6, 7,
and 8 along with the following two points:
Steps
x1
x2
X1
X2
% yield
9
+3
0.77
125.9
218.1
91.7
10
+5
- 0.77
145.9
171.9
92.5
11
Replicated 7
+4
0
135.9
195
97.0
38
39
Design three was set up using the point at step 7 as its
center. It includes steps 6 – 11. If we redefine the coded
variables:
Temperature( X 1 )  135.9
x1 
10
and
x1 
Time( X 2 )  195
23.1
then the fitted first-order model is:
Yˆ ( x)  93.983  0.025 x1  0.375 x 2
The corresponding analysis of variance table is:
40
ANOVA table
source
d.f.
SS
MS
F
Model
2
0.5650
0.2825
0.04
Residual
3
22.1833
7.3944
total
5
22.7483
It is obvious from the ANOVA table that the least
model does not explain a significant amount of the
overall variation in the percent yield values, and it is
necessary to fit a curved surface.
41
Fitting a second-order model
A second-order model in k variables is of the form:
k
k
i 1
i 1
Y   0    i xi    ii xi2    ij xi x j  
i j
The number of terms in the model above is
p=(k+1)(k+2)/2; for example, when k=2 then p=6.
Let us return to the chemical reaction example of the
previous section. To fit a second-order model (k=2), we
must perform some additional experiments.
42
Central composite rotatable design
Suppose that four additional experiments are
performed, one at each of the axial settings (x1,x2):
(
2 ,0); ( 
2 ,0); (0,
2 ); (0,
2)
These four design settings along with the four factorial
settings: (-1,-1); (-1,1); (1,-1); (1,1) and center point
comprise a central composite rotatable design.
The percent yield values and the corresponding nine
design settings are listed in the table below:
43
Central composite rotatable design
44
Percent yield values at the nine points of a
central composite rotatable design
45
The fitted second-order model, in the coded
variables, is:
Yˆ ( x)  96.6  0.03x1  0.31x2  1.98 x12  1.83x22  0.58 x1 x2
The analysis is detailed in this table, using
the RSREG procedure in the SAS software:
46
SAS output 1
Yˆ ( x)  96.6  0.03x1  0.31x2  1.98 x12  1.83x22  0.58 x1 x2
47
SAS output 2
48
Response surface and the contour plot
49
More explanations




The contours of the response surface, showing
above, represent predicted yield values of 95.0 to
96.5 percent in steps of 0.5 percent.
The contours are elliptical and centered at the point
(x1; x2)=(- 0.0048; - 0.0857)
or (X1; X2)=(135.85°C; 193.02 sec).
The coordinates of the centroid point are called the
coordinates of the stationary point.
From the contour plot we see that as one moves
away from the stationary point, by increasing or
decreasing the values of either temperature or time,
the predicted percent yield (response) value
decreases.
50
51
Determining the coordinates of the stationary point

A near stationary region is defined as a region where the
surface slopes (or gradients along the variables axes) are
small compared to the estimate of experimental error.

The stationary point of a near stationary region is the point
at which the slope of the response surface is zero when
taken in all direction.

The coordinates of the stationary point x0  ( x10 , x20 ,..., xk 0 )
are calculated by differentiating the estimated response
equation with respect to each xi, equating these derivatives
to zero, and solving the resulting k equations
simultaneously.
52
Remember that the fitted second-order model in k
variables is:
k
k
i 1
i 1
Yˆ ( x)  b0   bi xi   bii xi2   bij xi x j
i j
To obtain the coordinates of the stationary point,
let us write the above model using matrix notation,
as:
Yˆ ( x)  b0  xb  xBx
53
Yˆ ( x)  b0  xb  xBx
where
 x1 
x 
 2
 . 
x 
 . 
 . 
 
 xk 
 b1 
b 
 2
.
b 
.
.
 
bk 
and
b12

b11
2


b22

B


symetric



.
.
.
.
.
.
.
.
.
b1k 
2 
b2 k 

2 
. 
. 

. 
bkk 
54
Some details
The partial derivatives of Yˆ ( x ) with respect to x1, x2, …, xk
are :
k

Yˆ ( x )
 b1  2b11 x1   b1 j x j 
x1
j 2

k

Yˆ ( x )
 b2  2b22   b2 j x j 
x2
j2

.
  b  2 Bx

.

.

k 1

Yˆ ( x )
 bk  2bkk   bkj x j 
xk
j 1

55
More details
Setting each of the k derivatives equal to zero and
solving for the values of the xi, we find that the
coordinate of the stationary point are the values of the
elements of the kx1 vector x0 given by:
x0
B 1b

2
At the stationary point, the predicted response
value, denoted by Ŷ0 , is obtained by substituting x0
for x:
'
x
'
'
0b
ˆ
Y0  b0  x0b  x0 Bx 0  b0 
2
56
Return to our example
The fitted second-order model was:
Yˆ ( x)  96.6  0.03 x1  0.31x2  1.98 x12  1.83 x22  0.58 x1 x2
 1.98127
B
 0.28750
0.28750 
 1.83127
B
1
 0.51650

 0.08109
 0.08109
 0.5588 
so the stationary point is:
1
x0   B 1b 
2
 0.00486
  0.08568


In the original variables, temperature and time of the
chemical reaction example, the setting at the stationary
point are: temperature=135.85°C and time=193.02 sec.
And the predicted percent yield at the stationary point
is:
Yˆ  96.60  0.013  96.613
0
57
Moore details
Note that the elements of the vector x0 do not tell us
anything about the nature of the surface at the stationary
point.
This nature can be a minimum, a maximum or a
mini_max point.
For each of these cases, we are assuming that the
stationary point is located inside the experimental region.
When, on the other hand, the coordinates of the
stationary point are outside the experimental region, then
we might have encountered a rising ridge system or a
falling ridge system, or possibly a stationary ridge.
58
59
Nest Step
The next step is to turn our attention to
expressing the response system in
canonical form so as to be able to describe
in greater detail the nature of the response
system in the neighborhood of the
stationary point.
60
The canonical Equation of a Second-Order
Response System
The first step in developing the canonical equation
for a k-variable system is to translate the origine of the
system from the center of the design to the stationary
point, that is, to move from (x1,x2,…,xk)=(0,0,…,0) to x0.
This is done by defining the intermediate variables
(z1,z2,…,zk)=(x1-x10,x2-x20,…,xk-xk0) or z=x-x0.
Then the second-order response equation is
expressed in terms of the values of zi as:
Yˆ ( z )  b0  ( z  x0 )b  ( z  x0 ) B ( z  x0 )
 Yˆ0  z Bz
61
Now, to obtain the canonical form of the predicted
response equation, let us define a set of variables
w1,w2,…,wk such that W’=(w1,w2,…,wk) is given by
W  M z
where M is a kxk orthogonal matrix whose columns
are eigenvectors of the matrix B.
The matrix M has the effect of diagonalyzing B, that
is, where 1,2,…,k are the corresponding eigenvalues
of B.
M BM  diag (1 , 2 ,..., k )
The axes associated with the variables w1,w2,…,wk
are called the principal axes of the response system.
This transformation is a rotation of the zi axes to
form the wi axes.
62
63
So we obtain the canonical equation:
Yˆ (W )  Yˆ0 W  M  B MW
 Yˆ   w 2  ...   w 2
0
1
1
k
k
The eigenvalues i are real-valued (since the matrix B
is a real-valued, symmetric matrix) and represent the
coefficients of the terms in the canonical equation.
It is easy to see that if 1,2,…,k are:
1) All negative, then at x0 the surface is a maximum.
2) All positive, then at x0 the surface is a minimum.
3) Of mixed signs, that is, some are positive and the
others are negative, then x0 is a saddle point of
the fitted surface.
The canonical equation for the percent yield surface is:
Yˆ  96.613  1.6091w12  2.2034 w22
64
Moore details
The magnitude of the individual values of the i tell
how quickly the surface height changes along the Wi
axes as one moves away from x0.
Today there are computer software packages available
that perform the steps of locating the coordinates of the
stationary point, predict the response at the stationary
point, and compute the eigenvalues and the
eigenvectors.
65
For example, the solution for optimum response
generated from PROC RSREG of the Statistical Analysis
System (SAS) for the chemical reaction data, is in
following table:
66
Recapitulate
Process to
optimize
Contours and optimal direction
Input and output
variables
Experiments in the
Optimal direction
Experimental and
Operational regions
Locate a new
Experimental region
Series of experiments
New series of experiments
Yes
Fitting
First-order
model
Fitting
First-order
model
Yes
Model
Adequate ?
No
Fitting a
Second-order
model
Model
Adequate ?
No
67
Doing this:

Coordinates of the stationary point.

Description of the shape of the response surface
near the stationary point by contour plots.

Canonical analysis.

If needed, Ridge analysis (not detailed here).
68
Field of use of the method
In agriculture
 In food industry
 In pharmaceutical industry
 In all kinds of the light and heavy
industries
 In medical domain


Etc….
69
Bibliography

André KHURI and John CORNELL: “Response
Surfaces – Designs and Analyses” , Dekker, Inc., ASQC
Quality Press, New York.

Irwin GUTTMAN: “Linear Models: An Introduction”,
John Wiley & Sons, New York.

George BOX, William HUNTER & J. Stuart HUNTER:
“Statistics for experimenters: An Introduction to
Design, Data Analysis, and Model Building” , John
Wiley & Sons, New York.

George BOX & Norman DRAPPER: “Empirical ModelBuilding and Response Surfaces” , John Wiley & Sons,
New York.
70
71