
Mon, 09/10/2018 - 12:03
In “Data Snooping Part 1” (Quality Digest, Aug. 6, 2018) we discovered the basis for the first caveat of data snooping. Here we discover three additional caveats of data snooping.
Last month we discovered:
Here we will use the data set from Part…
Mon, 08/06/2018 - 12:03
Data mining is the foundation for the current fad of “big data.” Today’s software makes it possible to look for all kinds of relationships among the variables contained in a database. But owning a pick and shovel will not do you much good if you do…
Mon, 07/16/2018 - 12:03
The ultimate purpose for collecting data is to take action. In some cases the action taken will depend upon a description of what is at hand. In others the action taken will depend upon a prediction of what will be. The use of data in support of…
Mon, 06/04/2018 - 12:03
Some properties of a probability model are hard to describe in practical terms. The explanation for this rests upon the fact that most probability models will have both visible and invisible portions. Understanding how to work with these two…
Mon, 05/07/2018 - 12:03
In The Music Man, the con man Prof. Harold Hill sells band instruments and uniforms and then tells the kids that they can play music if they will “just think about the notes and then play them.” In many ways this “think system” is similar to what…
Mon, 04/02/2018 - 12:03
Last month we looked at what the empirical rule tells us about the data in a histogram. This month we will consider if there are any commonalities between different probability models that will allow us to make categorical statements without having…
Mon, 03/05/2018 - 12:03
How can we use descriptive statistics to characterize our data? When I was teaching at the University of Tennessee I found a curious statement in a textbook that offered a practical answer to this question. This statement was labeled as “the…
Mon, 02/05/2018 - 12:03
Whenever we make a measurement, we have to decide how many digits to record. Traditional answers for this question are often little more than guesswork glorified by time. And with digital readouts, are all the displayed digits real? This column…
Mon, 01/08/2018 - 12:03
The precision to tolerance ratio is commonly used to characterize the usefulness of a measurement system. While this ratio is appealingly simple, it overstates the damage due to measurement error. In this paper we show how to compute honest…
Mon, 12/04/2017 - 12:03
Capability ratios are widely used and sometimes misunderstood. The computer will gladly offer up values of each of the commonly used capability and performance indexes. Yet there is little appreciation of the inherent uncertainty contained in each…