Choosing the Right DOE Software

by H. Stith Bennett

Have you ever heard a quality colleague say something like "We did a DOE on it" or "Our people do DOEs all the time"? Every time I hear quality lingo used this way, I have contradictory feelings. I'm optimistic because experimentation seems to have achieved some measure of popular acceptance in the quality world. At the same time, this trendy use of a three-letter acronym promotes confused thinking regarding experiments.

Conventionally, DOE stands for design of experiments. So the phrases I mentioned previously become "We did a design of experiments on it" and "Our people do design of experiments all the time." I find such language confusing. The implication is that once an experiment is designed, it's done.

Can quality practitioners understand enough about the complex topic of designed experiments to use them effectively? Can computers help with experiments? Quality Digest's "DOE Software Buyers Guide" provides a way to begin sorting it all out. You will get the most out of it if you understand some of the frequently asked questions about experiments. The following are some basic questions to ponder.

What makes an experiment? Consider that all experiments have some rudiments in common. There are input variables, and there are outcome variables. The experimenter changes the input values and observes what, if anything, happens to the outcome values. All of us do this kind of thing every day. You change something and you see what happens. Usually, you change one thing at a time.

Although you can occasionally gain insights in this way, it is generally a poor way to gain knowledge. Nevertheless, in many business and industrial settings, this approach has historically been called "experimenting."

Designed experiments provide a much better way of going about things. In fact, you can think of experimental designs as ways to "fix up" everyday experiments so they succeed. When working with experimental designs, expect to pay in increased time and involvement to resolve your knowledge problems. But realize that the solution is designed experiments.

What kinds of experimental designs exist? There are more than can be counted or categorized conveniently. The names of some designs are derived from the settings where they were developed or extensively used. "Split plot" designs, for example, originated from agricultural research. Some design names refer to the people who invented or popularized them. "Plackett-Burman" and "Taguchi" are examples. Other design names refer to technical elements of the design itself. "Central composite" and "fractional factorial" are examples.

References to experimental designs are rooted in the ever-growing literature on experimentation. During the course of the last century, the universe of experimental design names has "just grown." Perhaps it comes as a disappointment that no clear naming convention exists to distinguish designs. For example, a Taguchi design may share some, but not all technical features with a composite design.

Furthermore, two names may be used to mean the same basic kind of design. For example, one person might refer to a design as a "second-order, three-level factorial," while another might refer to it as a "Box-Behnken" design.

What's the best design? Unless your grasp of matrix algebra is good and you have done extensive reading of the literature of experiments and linear models, don't pretend you know how to compare the merits of all designs. Instead, use your time wisely. Create a statement of your experimental motive. Then seek help from someone who is truly knowledgeable, use appropriate computer software to guide you to a design or use the simplest design you know how to use. Then forge ahead against all odds and do the experiment.

Why experiment? Instead of tackling experimentation by sorting through all the possible design types, start by figuring out your own experimental motive. Why do you want to experiment? What kind of knowledge do you want to have when the experiment ends?

Two examples of the many possible experimental motives are screening and empirical modeling. A screening motive asks this question: "Out of all these inputs, which ones affect the outcomes?" Screening motivates the first stages of inquiries to sift through many input variables of unknown importance. An empirical modeling motive asks another question: "If the inputs are set to certain values, what will the outcome values be?" Empirical modeling seeks input values that result in desirable outcome values.

Recent DOE converts often fail to understand that motives for experimenting can differ. Resist the temptation to identify emotionally with some favorite category of allegedly superior designs.

Understanding your motive for experimenting provides you with the information you'll need to select an appropriate design. You will come up with answers to questions like these: "How many input and outcome variables must be handled? How much time and money can you invest? What higher-order relationships need to be resolved? How are experiments performed?"

As DOE fever has spread in recent years, many quality practitioners have failed to do correctly the experiments they have spent so much time learning to design correctly. Botched data acquisition, nonexistent validation of required input-level values, screwed-up timing, inattention to changing environmental variables and imposed human bias can limit the possibility that even the best experimental design can advance understanding.

Why is it nonsense to say, "We did a DOE"? Somehow, the three letters that stand for the design of experiments have been confused with the doing of experiments. Despite the implications of popular lingo, designing an experiment is not doing one. Designing an experiment is just the first of many steps in gaining knowledge through experimentation. And the correct design is no more important than correct planning, correct execution and correct analysis of the experimental data.

Consider that the results of an experiment with the "wrong" design that is nevertheless well-executed have more value than the results of an experiment with the "right" design where the execution is a failure. Of course, you really want it all: good design, planning, execution and analysis.

What should DOE software offer? Consider your motives, and think of what you want to do. You will want to plan for the successful performance of experiments and actually perform experiments. You will want to acquire and analyze experimental data, arrive at appropriate findings based on the analysis and com- municate your findings in reports. And, most importantly, you will want to apply your findings to your process in a way that gets results.

Look for features that help you do what you want to do. Some DOE software merely helps you select a design. Most recent DOE software can also analyze experimental data resulting from the designs it provides. A new trend is to provide users with expert advice while interpreting any data analysis performed. However, you probably will be on your own when it comes to the nuts and bolts of planning and performing your experiment.

Usually, you also must initiate the creation of the data set that the software analyzes, although the software may provide a form with a set of database fields or a link to a third-party spreadsheet so it can locate the experimental data you enter for later analysis. Most DOE software has reporting capabilities or is linked to a database management system that makes reports. Don't expect a lot of help from DOE software in applying your findings.

My best advice to DOE software shoppers is to create a written set of requirements based on your motives for experimenting. Avoid listing software features. Rather, try to list what you need accomplished. If you have a hard time imagining everything, flowchart the process with which you're going to experiment. What are the possible input and output variables? What reports do you need? How will experimental results be implemented? What kinds of training and support do you need? Prioritize your requirements.

Be realistic. Recognize the gaps between what DOE software can do and what your experimental motive requires to be done. Figure out how you will fill those gaps before you make a purchase.

Visualize your experimental setting. Can you see exactly what tasks the software performs? Can you see the tasks you (or other people) perform? Is there anything wrong or missing from this picture? If this process brings up new questions, seek the answers.

About the author
H. Stith Bennett has taught at the University of Illinois, the University of Washington and the University of Missouri. In 1980, he began Colorado Observations, a general research and statistical consulting firm. In 1987, he co-founded SynchroStat Systems Corp., a software development company specializing in real-time statistical applications for industry and science.