|
4.
Process Modeling
4.1. Introduction to Process Modeling 4.1.4. What are the some of the different statistical methods for model building?
|
|||
| Modeling Workhorse | Linear least sum of squares regression is by far the most widely used modeling method. It is what most people mean when they say they have used "regression", "linear regression" or "least squares" to fit a model to their data. Not only is linear least square regression the most widely used modeling method itself, but it has been adapted to a broad range of situations that are outside its direct scope. It plays a strong underlying role in many other modeling methods, including the other methods discussed in this section, nonlinear least squares regression, weighted least squares regression and LOESS. | ||
| Definition of a Linear Least Squares Model |
Used directly, with an appropriate data
set, linear least sum of squares regression can be used to fit the data
with any function of the form
![]() in which
|
||
| Why "Least Squares"? |
Linear least sum of squares regression also gets its name from the way the
estimates of the unknown parameters are computed. The "method of least
squares" that is used to obtain parameter estimates was independently
developed in the late 1700's and the early 1800's by the mathematicians
Karl Friedrich Gauss, Adrien Marie Legendre and (possibly) Robert Adrain
[Stigler (1978)]
[Harter (1983)]
[Stigler (1986)]
working in Germany, France and America, respectively. In the least squares
method the unknown parameters are estimated by minimizing the sum of the
squared deviations between the data and the model. The minimization process
reduces the overdetermined system of equations formed by the data to a
sensible system of (where
is the number of parameters
in the functional part of the model) equations in
unknowns.
This new system of equations is then solved to obtain the parameter estimates. To learn more about how the method of least squares is used to estimate the
the parameters see Section 4.4.3.1.
|
||
| Examples of Linear Models |
As just mentioned above, linear models are not limited to being straight lines
or planes, but include a fairly wide range shapes. For example, a simple
quadratic curve
![]() is linear in the statistical sense. A straight line model in
![]() or a polynomial in
![]() is also linear in the statistical sense, though not with respect to the observed explanatory variable, .
|
||
| Nonlinear Model Example |
Just at models that are linear in the statistical sense do not
have to be linear with respect to the explanatory variables, nonlinear
models can be linear with respect to the explanatory variables, but
not with respect to the parameters. For example,
![]() is linear in , but it cannot
be written in the general form of a linear model presented
above. This
is because the slope of this line is expressed as the product of two
parameters. As a result, nonlinear least squares regression could be
used to fit this model, but linear least squares cannot be. For further
examples and discussion of nonlinear models see the next section,
Section 4.1.4.2.
|
||
| Advantages of Linear Least Squares | Linear least squares regression has earned its place as the primary tool for process modeling because of its effectiveness and completeness. | ||
| Though there are types of data that are better described by functions that are nonlinear in the parameters, many processes in science and engineering are well described by linear models. This can either be because the processes are inherently linear or because, over short ranges, any process can be approximated well by a linear model. | |||
| The estimates of the unknown parameters obtained from linear least squares regression are the optimal estimates from of a broad class of possible parameter estimates under the usual assumptions used for process modeling. Practically speaking, linear least squares regression makes very efficient use of the data. Good results can be obtained with relatively small datasets. | |||
| Finally, the theory associated with linear regression is well understood and allows for construction of different types of easily-interpretable statistical intervals for predictions, calibrations, and optimizations. These statistical intervals can then be used to give clear answers to scientific and engineering questions. | |||
| Disadvantages of Linear Least Squares | The main disadvantages to linear least squares are limitations in the shapes that linear models can take on over long ranges, poor extrapolation properties and sensitivity to outliers. | ||
| Linear models curve relatively slowly, so for inherently nonlinear processes it becomes increasingly difficult to find a linear model that fits the data well as the range of the data increases. As the explanatory variables go to infinity, the output of the linear model will also always go to positive or negative infinity. This means that linear models may not be effective for exptrapolating the results of a process for which data cannot be collected in the region of interest. Of course extrapolation is dangerous regardless of the model type, but linear models are particularly bad choices for processes in which it is known that the response does not ultimately approach infinity. For example, a linear model would not be a good choice for extrapolations of material strength over time or of chemical reaction rate based on a amount of catalyst in the system. In both of these cases it seems clear that response cannot continue to increase indefinitely. | |||
| Finally, while the method of least squares often gives optimal estimates of the unknown parameters, it is very sensitive to the presence of unusual data points in the data used to fit a model. One or two outliers can sometimes seriously skew the results of a least squares analysis. This makes model validation, especially with respect to outliers, critical to obtaining sound answers to the questions motivating the construction of the model. | |||