|
1.
Exploratory Data Analysis
1.3. EDA Techniques 1.3.3. Graphical Techniques: Alphabetic
|
|||
|
Purpose: Check randomness |
Autocorrelation plots
(Box and Jenkins, pp. 28-32)
are a commonly-used tool for
checking randomness in a data set. This randomness is
ascertained by computing autocorrelations for data values
at varying time lags. If random, such autocorrelations
should be near zero for any and all time lag separations.
If non-random, then one or more of the autocorrelations
will be significantly non-zero.
In addition, autocorrelation plots are used in the model identification stage for Box-Jenkins autoregressive, moving average time series models. |
||
|
Sample Plot: Autocorrelations should be near-zero for randomness. Such is not the case in this example and thus the randomness assumption fails |
This sample autocorrelation plot shows that the time series is not random, but rather has a high-degree of auto-correlation between adjacent and near-adjacent observations. |
||
|
Definition: r(h) versus h |
Autocorrelation plots are formed by
|
||
| Questions |
The autocorrelation plot can provide answers to the following questions:
|
||
|
Importance: Ensure validity of engineering conclusions |
Randomness (along with fixed model, fixed variation, and fixed distribution) constitute one of the four assumptions that typically underlie all measurement processes. The randomness assumption is critically important for the following three reasons:
|
||
| Examples | |||
| Related Techniques |
Partial Autocorrelation
Plot Lag Plot Spectral Plot Seasonal Subseries Plot |
||
| Case Study | The autocorrelation plot is demonstrated in the beam deflection data case study. | ||
| Software | Autocorrelation plots are available in most general purpose statistical software programs including Dataplot. | ||