COMMUNICATING DESIGN OF EXPERIMENTS (DOE) TO NON-STATISTICIANS

Steve Schmidt, PhD. - Air Academy Associates, LLC - (719) 531-0777
Ken Case, PhD. - Oklahoma State University - (405) 377-7586





Design of Experiments (DOE) has become one of the most popular statistical techniques of the 1990's. DOE originated in 1920 by a British scientist, Sir R. A. Fisher, as a method to maximize the knowledge gained from experimental data and it has evolved over the last 70 years. Unfortunately, most of the development of DOE was mathematically complex and thus, its use has been restricted to those well versed in mathematics. Recent widespread popularity of DOE is associated with the works of Taguchi [2], a Japanese engineer, who focused on the practical use versus mathematical perfection of the technique. In short, Taguchi's work began a revolution in the presentation of DOE material where mathematical theory is downplayed in order to enhance the clarity and practicality of the subject. Thus, scientists, engineers, technicians, and managers who are not mathematical experts are now becoming experimental design practitioners. However, close scrutiny of the Taguchi method [3] revealed limitations, which has led to the most recent evolution in DOE, a blended approach of Taguchi and classical techniques.

To understand DOE, consider a specific area of science, engineering, physics etc. as a process or activity within nature. Complete or perfect knowledge of any aspect of nature is known only by nature, while scientists, engineers, etc. typically have only some subset of that knowledge. See Figure 1.

The gap between the current level of knowledge and nature's perfect knowledge tends to widen as industry today seeks to understand more complex processes.

The typical engineering, scientific, or physics approach is to spend years of research trying to narrow the gap utilizing theoretical knowledge. DOE, on the other hand, will allow one to quickly narrow the gap through proper planning, designing, data collection, analysis, and confirmation.

 




Consider again nature's perfect knowledge of any process (activity or specified area of science etc.) where there is a true mathematical relationship that can describe process outputs as a function of all the process inputs, e.g. Newton's law of motion F = MA. If nature knows this relationship but we don't, how can we obtain the relationship quickly without getting tied up in years of theoretical research as Newton did.

A simple DOE for 2 inputs such as mass (M) and acceleration (A), each tested at 2 levels, can be set up as the four experimental runs (or trials) of low (Lo) and High (Hi) settings shown in Table 1.

Run

M

A

1

Lo

Lo

2

Lo

Hi

3

Hi

Lo

4

Hi

Hi

Table 1

The DOE set of test conditions is used to interrogate nature about how things perform under certain conditions. The specific set of conditions (or design matrix) is constructed in order to analyze the resulting data for the purpose of building a model to approximate the real model contained in nature (i.e., approximate nature's perfect knowledge). The example in Table 1 is a full factorial of 2 factors each at 2 levels. Several other experimental design types are available to meet your specific objective i.e., linear modeling, non-linear modeling, screening, etc.[1] Assume our experiment generated an average set of Force results for each experimental run shown in Table 2. The low and high settings of Mass (5, 10) and Acceleration (100, 200) are chosen based on ranges of interest to the experimenter.

 

  Input 

Input 

Avg. Output 

Run

M

A

F¯ 

1

5

100

500

2

5

200

1000

3

10

100

1000

4

10

200

2000

Table 2

To maximize the amount of knowledge gained, the modeling analysis of a DOE is conducted on standardized input values where low input settings are coded ( 1) and high input settings are coded (+1). These coded values will standardize the scale and the units of the input variables. The result is a new matrix of coded inputs shown in Table 3.

Run 

M

MxA

1

-1

-1

1

500

2

-1

1

-1

1000

3

1

-1

-1

1000

4

1

1

1

2000

Table 3

Notice the addition of an MxA column in Table 3 which is generated by the product of the coded M and A columns and will be used to analyze an interactive or combined effect of M and A. The columns M, A, and MxA represent the three effects to be evaluated, i.e. the linear effects of M and A and the interaction effect of M with A (MxA). The analysis is conducted as shown in Table 4.

Run 

A

MxA

1

-1

-1

 1

500

2

-1

 1

-1

1000

3

 1

-1

-1

1000

4

 1

  1

 1

2000

Avg (+1)

1500

1500

1250

1125

Avg (-1)

750

750

1000

 

Avg (+1) - Avg (-1) = Delta

750

750

250

Table 4

To obtain the numbers in red (last three rows), you have to do the following for each of the M, A and MxA columns: find the average of the output when the effect column values are 1, find the average of the output when the effect column values are +1, and then find the difference of the two effect averages (Delta). For example, when the M effect column is at 1 (runs 1 and 2) the output is 500 and 1000 which averages to 750.

The model generated from the DOE is built using least squares regression which can be simplified for 2 level designs as shown below:


 where

    F hat is the predicted average for Force
    Mc is the coded variable for mass
    Ac is the coded variable for acceleration
    Avg(Avg. F) is the grand experimental mean
    Delta(M) is the size of the linear effect for M, etc.

(2) Thus, our actual model becomes

(3) Remembering that our prediction model is for coded values (i.e. -1, +1 scale) of M and A, we can transform it into a model of actual Ma and Aa values using the following relationship.


where

    Ma = actual setting value of mass
    Mc = coded setting value of mass
    MH = experimental high setting of actual values
    ML = experimental low setting of actual values

For our example


 
or Ma = 7.5 + 2.5Mc


Thus,

 

The previously described calculations are a simple way to go back and forth from coded to actual setting values, as shown on the combined scale in Figure 2.

A similar transformation occurs for acceleration (A), as shown below.

 
or Aa = 150 + 50 Ac


Thus,

(4) Substituting the actual transformed variables into the equation in (2) will produce the following new model .

 



In our case, we obtained the exact same model as Newton because our F data did not contain noise. In reality, there exists noise or error in experimental data and thus the DOE models will only be approximations to nature's true relationship. But, imagine how quickly Newton could have closed the gap if he would have had DOE in his box of scientific tools!

Since DOE was not developed until the 1900's, Newton never had access to the tool. Newton, however, was not opposed to the use of new tools. For example, when he ran into a road block in his research he developed calculus as a new tool to assist him. It is, therefore, logical to speculate that Newton, if he was living today, would eagerly embrace new tools such as DOE to assist him in gaining knowledge from experimental data.

The previous example used data generated from an F = MA model instead of a physical experiment. In our next example, we will tackle a similar problem using actual experimental data. Suppose we go back in time to Germany during the early 1800 s. In 1826, German scientist George Simon Ohm determined the formula now known as Ohm s law. Ohm's law establishes the mathematical relationship between voltage (V), current (I), and resistance (R) as V = IR. Today, we know that this law applies to both direct and alternating currents, but the formula, as given in its original simple form, applied only to steady DC situations. It is known that Ohm conducted much careful experimental work. Unfortunately, he did not have access to DOE, or his work might have been shortened considerably.

What if we pretend to be ignorant of Ohm's law, and see how Ohm might have proceeded with even the most basic of today's DOE tools. Of course, nature knows those factors affecting voltage, so we will interrogate nature in a methodical way. First, however, we have to establish those experimental variables to include in a test using DOE. Specifically, we need to identify any factors we believe may affect the response of interest in this case "voltage." Suppose the factors in Table 5 are felt to be important; each is addressed as to how it will be incorporated into the DOE:


 

Factor

How factor will be addressed in DOE

Current (I) An experimental variable -use two levels of 0.005 and 0.020 amperes
Resistance (R) An experimental variable use two levels of 100 and 470 ohms.
Wattage of resistor(W) An experimental variable use two levels of 0.25 and 1.0 watts.
Room temperature and humidity Hold constant by running the entire experiment in one sitting. Randomize experimental runs to spread any unknown variation of these over the entire experiment.
Measurement equipment Hold constant by using the same calibrated laboratory-grade instruments, a voltmeter and an ammeter, throughout the experiment.
Technician Hold constant by employing a trained electronics technician to run the entire experiment.
Experimental technique Hold constant by closely following standard operating procedures agreed upon in advance.

Table 5 

We all know that wattage of the resistor (W) should have no effect, but remember that we are pretending to be ignorant of Ohm's law; we also know that room temperature and humidity should have a negligible or imperceptible effect under normal room conditions. The other factors are "real" considerations worthy of treatment as described above.

A full factorial is chosen for this experiment. In a full factorial, all possible combinations of experimental levels of all variables are run. Since there are only three experimental variables of interest, the number of runs in the full factorial equals 23 = 8 which is certainly feasible. This will permit us to evaluate all possible effects and interactions of the three experimental variables (I, R, and W) and to write a mathematical model including any and all contributing terms. The experimental array is shown in Table 6. Note that availability of resistors at particular wattages, safety through low voltages and currents, the need to be within the wattage ratings of all resistors, and instrumentation ranges and precision were major factors used in the selection of experimental variable settings.

Run

Current (I)

Resistance (R)

Wattage (W)

0.005

100

0.25

0.005

100

1.00

0.005

470

0.25

4  

0.005

470

1.00

5  

0.020

100

0.25

0.020

100

1.00

0.020

470

0.25

0.020

470

1.00

Table 6

Table 7 presents the array of coded input values, actual responses, and averages used in the modeling analysis of the DOE. The coded values of I, R, and W are set at 1 for low settings of each experimental variable and at +1 for high settings. The 1 and +1 values in the interaction columns do not reflect experimental settings, but are determined from the product of the coded values in their component columns and are used to analyze the strength of the interactive effect of the variables.

Multiple replications (reps) of the experiment provide five different voltage measurements. These voltages vary somewhat due to the "noise" always involved in experimental research. The "noise" is due to the continued re-establishment of the operating conditions at each of the experimental run settings (different resistors were used for each rep, and current was re-established for each rep). In this experiment, each of the runs was performed in a random order for the first rep. Then, each was performed again in a different random order for the second rep. This was continued throughout the five reps. The term represents the average of the five reps for each of the eight experimental runs. Appended to the bottom of Table 7 (in the shaded area) are the relevant average calculations, similar to those illustrated in Table 4.


 

Effects

Reps (of Voltage V)

 

Run

I

R

IR

W

IW

RW

IRW

Y1

Y2

Y3

Y4

Y5

Y bar

1

-1

-1

1

-1

1

1

-1

0.489

0.503

0.496

0.497

0.520

0.501

2

-1

-1

1

1

-1

-1

1

0.485

0.487

0.518

0.518

0.486

0.499

3

-1

1

-1

-1

1

-1

1

2.328

2.340

2.349

2.441

2.270

2.346

4

-1

1

-1

1

-1

1

-1

2.379

2.327

2.425

2.419

2.375

2.385

5

1

-1

-1

-1

-1

1

1

1.995

2.018

2.012

2.002

1.961

1.998

6

1

-1

-1

1

1

-1

-1

2.204

2.058

2.015

1.971

2.129

2.075

7

1

1

1

-1

-1

-1

-1

9.464 

9.103

9.352

9.481

9.582

9.396

8

1

1

1

1

1

 1

1

9.506

9.255 

9.486

9.382

9.567

9.439

Avg (+1)

5.727

5.892

4.959

3.600

3.590

3.581

3.570

 

 

 

 

 

3.580

Avg (-1)

1.433     

1.268

2.201

3.560

3.569

3.579

3.589

 

 

 

 

 

 

Delta

4.294

4.624

2.758

0.040

0.021

0.002

-0.019 

 

 

 

 

 

 

Table 7 

We can now proceed with the analysis and modeling. First, note the values of Delta in the last row of Table 7. The first three are considerably larger than the rest, indicating that their columns (I, R, and IR) are of substantial importance in modeling voltage, while the other column effects are due to experimental noise and are of negligible importance for the model ( the significance of effects is customarily determined using statistical criteria. In this experiment, the software package Q-Edge [4] was used for analysis and decisions regarding significant effects). Returning to the four step modeling illustration presented earlier for F = MA, the following equations result: 


 
where

   V hat is the predicted average for V
   Ic is the coded variable for current
   Rc is the coded variable for resistance
   Avg. (Avg. V) is the grand experimental mean
   Delta (I) is the size of the linear effect for I, etc.

(2) The actual model becomes

 

(3) Remembering that our prediction model is for coded values (i.e., -1, +1 scale) of I and R, we can transform it into a model using actual Ia and Ra values as follows.

 

 or Ia=0.0125 + 0.0075Ic Thus,




Likewise,

 
or Ra = 285 + 185Rc Thus,


(4) Substituting the actual transformed variables into the V hat equation in (2) will produce the new V hat model.




     
        = 3.580 + 286.267Ia - 3.578 + 0.012Ra - 3.562 - 0.994IaRa - 0.012Ra - 283.254Ia + 3.541

       = - 0.019 + 3.013Ia + 0.000Ra + 0.994IaRa

Under actual experimental conditions, we got a model very close to nature's V = IR. Note that the wattage of the resistor (W) dropped out as insignificant during the analysis and does not appear in the model. Of course, we knew this should happen, but it is always comforting to know that if we had not known that W is not an affecting factor, DOE would have told us! The coefficient for Ia of 3.013 appears large relative to the coefficient of IaRa. However, you must remember that the actual range of Ia is .005 to .020. Thus, the prediction of V is only changing from about .015 to .060 over the range of Ia. Therefore, the most likely reason for a non-zero coefficient for Ia is the low test values (.005 and .020) used in the DOE.

If we were to now use the model to predict the voltage across a resistor of 220 ohms when the current is 0.012 amps, the model would predict V hat = 2.641 volts. Nature (using Ohm's law) knows that the voltage would be V = IR = 2.640 volts. Even though the experimental model did not yield a perfect V = IR, knowing that the voltage is in large part determined by the interaction or combined effect of I and R might have reduced Ohm's research time significantly.

Obviously, not all models in nature are as simple as F = MA and V = IR, however the purpose here is to show how DOE can enable today's scientist, engineers, and physicists to close the gap quickly with a model from experimental data. For more information on evaluating non-linear relationships and/or large numbers of input variables, see a good text on DOE!


References

  1. S. R. Schmidt and R. G. Launsby, Understanding Industrial Designed Experiments, 4th ed., Air Academy Press, Colorado Springs, CO, 1994.
  2. G. Taguchi, System of Experimental Design, Kraus International Publications, White Plains, NY, 1987.
  3. J. J. Pignatiello, "An Overview of the Strategy and Tactics of Taguchi" IIE Transactions, vol. 20, 1988, pp. 247 - 254.
  4. Q-Edge (The Quality Edge), Statistical Software, Mike Bishop, Air Academy Press, Colorado Springs, CO.