Statistical Experiment Design
Download
Report
Transcript Statistical Experiment Design
Statistical Experimental
Design
A Primer
by
H. B. Oblad (Bruce)
Getting Answers Easier - Overview
The Old Method
The Better Method
Simple Statistics for the Lab
Let’s Try It Out!
The Old Method
Experiments one variable at a time in
sequence. Effect of Temperature on Yield
Yield, wt %
Pressure = 1000 psi
Time = 20 min
Temperature, °C
Next Set of Experiments
Yield, wt %
• Effect of Pressure
Temperature = 300 °C
Time = 20 min
Pressure, psi
More Experiments
Yield, wt %
• Effect of Time on Yield
Temperature = 300 °C
Pressure = 1000 psi
Time, min
What Have We Learned?
Pressure
• 13 Experiments in 3 Factors
Temp
• What combinations of conditions have we
covered? What’s still unknown?
• Do we know anything about the
repeatability of our lab technique?
• Are the responses straight or curved?
• Can we build a meaningful model that
leads to a mechanism?
• Could we have done less work and gotten
more information?
• Minor information about effects of factors.
• Know nothing about interactions.
Pressure
A Smarter Way
Temp
2-Level Factorial Design
• 8 Tests (XY X = levels, Y= factors)
• Now know what happens over a large
experimental volume.
• Now know the effects of factors at two
surfaces. Effect of factors tested 4 x each
• Some information about interactions
between factors.
• Repeatability is still unknown.
• Curvature?
Pressure
An Even
Smarter Way
Temp
2-Level Factorial Design
w/ Center Points
• 11 Tests (3 cntr pts), 13 Tests (5 cntr pts)
• Now know the effects of factors at two
surfaces and within the volume.
• More information about interactions
between factors.
• Repeatability is now estimated or known.
• Curvature can be estimated.
• Predictive model is easy to create.
Box-Behnken Design
3 Factor, 3 Level
A fractional factorial design
Spherical, so extrapolation is less risky. 15 tests (3 cp), 17 tests (5 cp)
Simple Statistics
• Bell Curve = Normal Dist. = Gaussian Dist.
• Total population or very large sample
• Errors in lab methodology are assumed random
and normally distributed except for time. Must
randomize order to bury effect of time into error.
• Repeated tests may be pooled to estimate std.
dev. and variance.
Bell Curve = Normal Dist.
68% of area is
<>+/-1 std. dev.
94% of area us
<>+/- 2 std. dev.
99% of area is
<>+/- 3 std. dev.
Means Testing
• If the means and standard deviations of
the measurements are equal, the things
being measured are of the same
population. Opposite is true also (null
hyp.) Use Student’s t-test.
Means Testing
• If the means are the same, the things are
of the same population. Use Welch’s t-test
Analysis of Variance
(ANOVA)
• Variance (standard deviation2) of means of
several sample groups is determined by Ftest. Probability criterion is used for
pass/fail or probability of F being equal is
given.
Factors, Responses
and Interactions
• Numeric Factors are variable inputs to a process
e.g. feed rate, temperature, pressure,
component concentration, knobs, levers
• Categorical Factors are discrete inputs e.g.
catalyst type, feed material, operator
• Responses are effects of changes in factors e.g.
Reaction rate increases w/ temp.
• Factors that affect each other are said to interact
e.g. drinking, driving, vs drunken driving
Rubber Band Experiment
•
•
•
•
•
•
•
What affects the distance traveled?
Factors? How many?
Numeric or categorical?
Which design to use?
Can we make a predictive model?
Any interaction of factors?
Can we understand the problem better?