7.
Product and Process Comparisons
7.2.
|
Comparisons based on data from one process
|
|
| Questions answered in this section |
For a single process, the current state of the
process can be compared with a nominal or hypothesized state. This section
outlines techniques for answering the following questions from data gathered
from a single process:
-
Do the observations come from a particular distribution?
-
Chi-Square Goodness of Fit test for a continuous or
discrete distribution
-
Kolmogorov- Smirnov test for a continuous distribution
-
Anderson-Darling test for a continuous distribution
-
Are the data consistent with the assumed process mean?
-
Confidence interval approach
-
Sample sizes required
-
Are the data consistent with a nominal standard deviation?
-
Confidence interval approach
-
Sample sizes required
-
Does the proportion of defectives meet requirements?
-
Confidence intervals for large sample sizes
-
Confidence intervals for small sample sizes
-
Sample sizes required
-
What intervals contain a fixed percentage of the data?
-
Approximate intervals that contain most of the population
values
-
Percentiles
-
Tolerance intervals
-
Tolerance intervals using EXCEL
-
Tolerance intervals based on the smallest and largest
observations
|
| General forms of testing |
These questions are addressed either by an hypothesis
test or by a confidence interval. |
| Parametric vs. non-parametric testing |
All hypothesis testing procedures can be broadly
described as either parametric or nonparametric/distribution-free. Parametric
test procedures are those that:
-
Involve hypothesis testing of specified paramters (such as "the population
mean=50 grams"...).
-
Require a stringent set of assumptions about the underlying sampling distributions.
|
| When to use nonparametric methods? |
When do we require non-parametric or distribution
free methods? Here are a few circumstances that may be candidates:
-
The measurements are only categorical; i.e., they are nominally scaled,
or ordinally (in ranks) scaled.
-
The assumptions underlying the use of parametric methods cannot be met.
-
The situation at hand requires an investigation of such features as randomness,
independence, symmetry, or goodness of fit rather than the testing of hypotheses
about specific values of particular population parameters.
|
| Difference between non-parametric and distribution-free |
Some authors distinguish between non-parametric
and distribution free procedures.
Distribution-free test procedures are broadly defined as:
-
Those whose test statistic does not depend on the form of the underlying
population distribution from which the sample data were drawn, or
-
Those for which the data are nominally or ordinally scaled.
Non-parametric test procedures are defined as those that are not
concerned with the parameters of a distribution. |
| Advantages and disadvantages of nonparametric
methods. |
Distribution-free or nonparametric methods have
several advantages, or benefits:
-
They may be used on all types of data-categorical data, which nominally
scaled or are in rank form, called ordinally scaled, as well as interval
or ratio scaled data.
-
For small sample sizes they are easy to apply.
-
They make fewer and less stringent assumptions than their parametric counterparts
-
Depending on the particular procedure they may be almost as powerful
as the corresponding parametric procedure when the assumptions of the latter
are met, and when this is not the case, they may be more powerful.
Of course there are also disadvantages:
-
If the assumptions of the parametric methods can be met, it is generally
more efficient to use them.
-
For large sample sizes, data manipulations tend to become more laborious,
unless computer software is available.
-
Often special tables of critical values are needed for the test statistic,
and these values cannot always be generated by computer software. On the
other hand the critical values for the parametric tests are readily available
and generally easy to incorporate in computer programs.
|