Chauvenet criterion is a stronger statistical test that may be used to determine possible Outliers and decide whether or not a bad data point should be discarded. An outlier among residuals is one that is far greater than the rest and lies three or four standard deviations or further from the mean of all the residuals. An outlier indicates a data point which is not at all typical of the rest of the data. Then an outlier should be submitted to particularly careful examination.
Suppose to have a set of residuals:
where
is ith real value and
is the value estimated in the
Response Surface.
We can calculate the number of standard deviations by:
modeFRONTIERTM identifies as outliers the points that have:
where
depends on the total number of designs (n).
Some of the values for
used in modeFRONTIERTM are given in the following table.
| Number of Designs | ![]() | Number of Designs | ![]() |
|---|---|---|---|
| 5 | 1.65 | 30 | 2.39 |
| 6 | 1.73 | 40 | 2.49 |
| 7 | 1.81 | 50 | 2.57 |
| 8 | 1.86 | 60 | 2.64 |
| 9 | 1.91 | 80 | 2.74 |
| 10 | 1.96 | 100 | 2.81 |
| 12 | 2.04 | 150 | 2.93 |
| 14 | 2.10 | 200 | 3.02 |
| 16 | 2.15 | 300 | 3.14 |
| 18 | 2.20 | 400 | 3.23 |
| 20 | 2.24 | 500 | 3.29 |
| 25 | 2.33 | 1000 | 3.48 |
Note: The Chauvenet criterion can be repeatedly applied to clean up sets of data. This criterion must be applied with great care to avoid the throwing away of correctly designs with relevant, even surprising, information.