SlideShare ist ein Scribd-Unternehmen logo
1 von 72
Curve Fitting
a. Least Squares Regression
i. Linear Regression
ii. Polynomial Regression
iii. Multiple Linear Regression
iv. General Linear Regression
v. Nonlinear Regression
b. Interpolation
i. Newton’s Interpolating Polynomials
ii. Lagrange Interpolating Polynomials
iii. Inverse Interpolation
iv. Spline Interpolation
c. Fourier Approximation
Members
• Ardiente, Justine
• Erandio, Dogie Brian
• Calderon, Alec
• Pares, Carl
• Soliman, Gerick
• Telan, Joel
• Tulabing, Tulabing
• Villarosa, Mark
Motivation
Data is often given for discrete values along a continuum. However, you may require
estimates at points between the discrete values. The present part of this Presetation describes
techniques to fit curves to such data to obtain intermediate estimates. In addition, you may
require a simplified version of a complicated function. One way to do this is to compute
values of the function at a number of discrete values along the range of interest. Then, a
simpler function may be derived to fit these values. Both of these applications are known as
curve fitting.
There are two general approaches for curve fitting that are distinguished from each other on
the basis of the amount of error associated with the data. First, where the data exhibits a
significant degree of error or “noise,” the strategy is to derive a single curve that represents the
general trend of the data. Because any individual data point may be incorrect, we make no
effort to intersect every point. Rather, the curve is designed to follow the pattern of the points
taken as a group. One approach of this nature is called least-squares Regression.
Second, where the data is known to be very precise, the basic approach is to fit a curve or a
series of curves that pass directly through each of the points. Such data usually originates from
tables. Examples are values for the density of water or for the heat capacity of gases as a
function of temperature. The estimation of values between well-known discrete points is called
interpolation.
Least-Squares Regression
Linear Regression
The simplest example of a least-squares
approximation is fitting a straight line to a set of
paired observations:
(x1, y1), (x2, y2), . . . , (xn, yn).
The mathematical expression for the straight line is
y = a0 + a1x + e (17.1)
••Where a0 and a1 are coefficients representing the
intercept and the slope, respectively, and
e is the error, or residual, between the model and the
observations, which can be represented
by rearranging Eq. (17.1) as
e = y − a0 − a1x
Thus, the error, or residual, is the discrepancy
between the true value of y and the approximate
value, a0 + a1x, predicted by the linear equation.
FIGURE 17.1
(a) Data exhibiting significant
error. (b) Polynomial fit
oscillating beyond the range of
the data. (c) More satisfactory
result using the least-squares
fit.
Criteria for a “Best” Fit
where n = total number of points. However, this
is an inadequate criterion, as illustrated by
Fig. 17.2a which depicts the fit of a straight line
to two points. Obviously, the best fit is the
|Linear Regression
One strategy for fitting a “best” line through the
data would be to minimize the sum of the
residual errors for all the available data, as in
FIGURE 17.2
Examples of some criteria
for “best fit” that are
inadequate for egression:
(a) minimizes the sum of the
residuals, (b) minimizes the
sum of the absolute values
of the residuals, and (c)
minimizes the maximum
error of any individual point.
To determine values for a0 and a1, Eq. (17.3) is
differentiated with respect to each
coefficient:
Least-Squares Fit of a Straight Line
|Linear Regression
To determine values for a0 and a1, Eq. (17.3) is
differentiated with respect to each
coefficient:
Now, realizing that ∑a0 = na0, we can express the
equations as a set of two simultaneous
linear equations with two unknowns (a0 and a1):
These are called the normal equations. They
can be solved simultaneously.
This result can then be used in conjunction
with Eq. (17.4) to solve for.
Least-Squares Fit of a Straight Line
|Linear Regression
where ¯y and ¯ x are the means of y and x,
respectively.
Example 17.1
Problem Statement. Fit a straight line to the x and y values in the first two columns of
Table 17.1.
Solution. The following quantities can be computed:
|Linear Regression
Using Eqs. (17.6) and (17.7),
Example 17.1
|Linear Regression
Therefore, the least-squares fit is
y = 0.07142857 + 0.8392857x
The line, along with the data, is shown in Fig. 17.1c.
General Comments on Linear
Regression
Before proceeding to curvilinear and multiple linear regression, we must emphasize the
introductory nature of the foregoing material on linear regression. We have focused on the
simple derivation and practical use of equations to fit data. You should be cognizant of
the fact that there are theoretical aspects of regression that are of practical importance but
are beyond the scope of this presentation. For example, some statistical assumptions that are
inherent in the linear least-squares procedures are
1. Each x has a fixed value; it is not random and is known without error.
2. The y values are independent random variables and all have the same variance.
3. The y values for a given x must be normally distributed.
Such assumptions are relevant to the proper derivation and use of regression. For example,
the first assumption means that (1) the x values must be error-free and (2) the regression
of y versus x is not the same as x versus y.
You are urged to consult other references such as Draper and Smith (1981) to appreciate
aspects and nuances of regression that are beyond the scope of this presentation.
Polynomial Regression
|Polynomial Regression
In Sec. 17.1, a procedure was developed to derive the equation of a straight line
using the least-squares criterion. Some engineering data, although exhibiting a
marked pattern such as seen in Fig. 17.8, is poorly represented by a straight line. For
these cases, a curve would be better suited to fit the data. As discussed in the
previous section, one method to accomplish this objective is to use transformations.
Another alternative is to fit polynomials to the data using polynomial regression.
The least-squares procedure can be readily extended to fit the data to a higher-
order polynomial. For example, suppose that we fit a second-order polynomial or
quadratic:
y = a0 + a1x + a2x2 + e
For this case the sum of the squares of the residuals is [compare with Eq. (17.3)]
Following the procedure of the previous section, we take the
derivative of Eq. (17.18) with
respect to each of the unknown coefficients of the polynomial,
as in
These equations can be set equal to zero and rearranged to develop the following set of
normal
equations:
|Polynomial Regression
where all summations are from i = 1 through n. Note that the above three equations are
linear
and have three unknowns: a0, a1, and a2. The coefficients of the unknowns can be
calculated directly from the observed data.
For this case, we see that the problem of determining a least-squares second-order
polynomial is equivalent to solving a system of three simultaneous linear equations.
Techniques to solve such equations were discussed in Part Three.
The two-dimensional case can be easily extended to an mth-order polynomial as
y = a0 + a1x + a2x2 + ·· ·+amxm + e
|Polynomial Regression
The foregoing analysis can be easily extended to this more general case. Thus, we can
recognize that determining the coefficients of an mth-order polynomial is equivalent to solving
a system of m + 1 simultaneous linear equations. For this case, the standard error is
formulated as
This quantity is divided by n − (m + 1) because (m + 1) data-derived coefficients—
a0, a1, . . . , am—were used to compute Sr; thus, we have lost m + 1 degrees of freedom. In
addition to the standard error, a coefficient of determination can also be computed for
polynomial
regression with Eq. (17.10).
EXAMPLE 17.5
Problem Statement. Fit a second-order polynomial to the data in the first two columns
of Table 17.4.
Solution. From the given data,
|Polynomial Regression
Computations for an error analysis of the quadratic least-squares fit.
|Polynomial Regression
FIGURE 17.11
Fit of a second-order polynomial.
Therefore, the simultaneous linear equations are
|Polynomial Regression
Solving these equations through a technique such as Gauss elimination gives
a0 = 2.47857, a1 = 2.35929, and a2 = 1.86071. Therefore, the least-squares quadratic
equation for this case is
y = 2.47857 + 2.35929x + 1.86071x2
The standard error of the estimate based on the regression polynomial is [Eq. (17.20)]
The coefficient of determination is
|Polynomial Regression
and the correlation coefficient is r = 0.99925.
These results indicate that 99.851 percent of the original uncertainty has been
explained by the model. This result supports the conclusion that the quadratic
equation represents an excellent fit, as is also evident from Fig. 17.11.
MULTIPLE LINEAR REGRESSION
|MULTIPLE LINEAR
REGRESSION
A useful extension of linear regression is the case where y is a linear function of two or
more independent variables. For example, y might be a linear function of x1 and x2, as in
y = a0 + a1x1 + a2x2 + e
Such an equation is particularly useful when fitting experimental data where the variable
being studied is often a function of two other variables. For this two-dimensional case, the
regression “line” becomes a “plane” (Fig. 17.14).
FIGURE 17.14
Graphical depiction of multiple
linear regression where y is a
linear function of x1 and x2.
As with the previous cases, the “best” values of the coefficients are determined by setting
up the sum of the squares of the residuals,
and differentiating with respect to each of the unknown coefficients,
The coefficients yielding the minimum sum of the squares of the residuals are obtained by
setting the partial derivatives equal to zero and expressing the result in matrix form as
|MULTIPLE LINEAR
REGRESSION
EXAMPLE 17.6
The following data was calculated from the equation y = 5 + 4x1 −
3x2:
|MULTIPLE LINEAR
REGRESSION
Use multiple linear regression to fit this data.
Solution. The summations required to develop Eq.
(17.22) are computed in Table 17.5.
The result is
which can be solved using a method such as Gauss elimination for
a0 = 5 a1 = 4 a2 = −3
which is consistent with the original equation from which the data was derived.
|MULTIPLE LINEAR
REGRESSION
TABLE 17.5
Computations required
to develop the normal
equations for Example
17.6.
The foregoing two-dimensional case can be easily extended to m dimensions,
as in
y = a0 + a1x1 + a2x2 +· · ·+amxm + e
where the standard error is formulated as
and the coefficient of determination is computed as in Eq. (17.10). An algorithm to set up
the normal equations is listed in Fig. 17.15.
Although there may be certain cases where a variable is linearly related to two or more
other variables, multiple linear regression has additional utility in the derivation of power
equations of the general form
|MULTIPLE LINEAR
REGRESSION
GENERAL LINEAR LEAST
SQUARES
To this point, we have focused on the mechanics of obtaining least-squares fits of some
simple functions to data. Before turning to nonlinear regression, there are several issues
that we would like to discuss to enrich your understanding of the preceding material.
|GENERAL LINEAR LEAST
SQUARES
In the preceding pages, we have introduced three types of regression: simple linear, polynomial,
and multiple linear. In fact, all three belong to the following general linear leastsquares
model:
y = a0z0 + a1z1 + a2z2 +· · ·+amzm + e (17.23)
where z0, z1, . . . , zm are m + 1 basis functions. It can easily be seen how simple and multiple
linear regression fall within this model—that is, z0 = 1, z1 = x1, z2 = x2, . . . , zm = xm.
Further, polynomial regression is also included if the basis functions are simple monomials
as in z0 = x0 = 1, z1 = x, z2 = x2, . . . , zm = xm.
Note that the terminology “linear” refers only to the model’s dependence on its
parameters—that is, the a’s. As in the case of polynomial regression, the functions themselves
can be highly nonlinear. For example, the z’s can be sinusoids, as in
y = a0 + a1 cos(ωt) + a2 sin(ωt)
|GENERAL LINEAR LEAST
SQUARES
General Matrix Formulation for Linear Least Squares
|GENERAL LINEAR LEAST
SQUARES
Such a format is the basis of Fourier analysis described in Chap. 19.
On the other hand, a simple-looking model like
f(x) = a0(1 − e−a1x )
is truly nonlinear because it cannot be manipulated into the format of Eq. (17.23). We will
turn to such models at the end of this chapter.
For the time being, Eq. (17.23) can be expressed in matrix notation as
{Y} = [Z]{A} + {E} (17.24)
where [Z] is a matrix of the calculated values of the basis functions at the measured values
of the independent variables,
|GENERAL LINEAR LEAST
SQUARES
where m is the number of variables in the model and n is the number of data points.
Because
n ≥ m + 1, you should recognize that most of the time, [Z] is not a square matrix.
The column vector {Y} contains the observed values of the dependent variable
{Y}T = [ y1 y2 · · · yn ]
The column vector {A} contains the unknown coefficients
{A}T = [a0 a1 · · · am |
and the column vector {E} contains the residuals
{E}T = [e1 e2 · · · en ]
As was done throughout this chapter, the sum of the squares of the residuals for this
model can be defined as
|GENERAL LINEAR LEAST
SQUARES
This quantity can be minimized by taking its partial derivative with respect to
each of the coefficients and setting the resulting equation equal to zero. The
outcome of this process is the normal equations that can be expressed concisely
in matrix form as
It can be shown that Eq. (17.25) is, in fact, equivalent to the normal
equations developed previously for simple linear, polynomial, and
multiple linear regression.
The coefficient of determination and the standard error can also be
formulated in terms of matrix algebra. Reccal the r2 is defined as
R2 =
𝑆𝑡
−𝑆𝑟
𝑆𝑡
= 1 -
𝑆 𝑟
𝑆𝑡
|GENERAL LINEAR LEAST
SQUARES
The matrix notation will also have relevance when we turn to nonlinear
regression in the last section of this chapter.
From Eq. (PT3.6), recall that the matrix inverse can be employed to solve
Eq. (17.25), as in
Our primary motivation for the foregoing has been to
illustrate the unity among the three approaches and
to show how they can all be expressed simply in the
same matrix notation.
Substituting the definitions of Sr and St gives
𝑟2 = −
𝑦𝑖
−ŷ𝑖
2
𝑦𝑖
− 𝑦𝑖
2
Where = ŷ the prediction of the least-squares fit. The residualsbetween the best fit curve
and the data, 𝑦𝑖 − ŷ𝑖 ,can be expressed in vector form as
{y} – [Z]{a}
Matrix algebra can then be used to manipulate this vector to compute the coefficient
of determination and the standard error of the estimate.
NONLINEAR REGRESSION
|NONLINEAR REGRESSION
There are many cases in engineering where nonlinear models must be fit to data. In the
present context, these models are defined as those that have a nonlinear dependence
on their parameters. For example,
This equation cannot be manipulated so that it conforms to the general form of Eq. (1 3.7).
As with linear least squares, nonlinear regression is based on determining the values of the
parameters that minimize the sum of the squares of the residuals. However, for the nonlinear
case, the solution must proceed in an iterative fashion.
There are techniques expressly designed for nonlinear regression. For example, the Gauss-
Newton method uses a Taylor series expansion to express the original nonlinear equation in
an approximate, linear form. Then least-squares theory can be used to obtain new
estimates of the parameters that move in the direction of minimizing the residual. Details on
this approach are provided elsewhere (Chapra and Canale, 2002).
|NONLINEAR REGRESSION
An alternative is to use optimization techniques to directly determine the least-squares fit.
For example, Eq. (13.12) can be expressed as an objective function to compute the sum
of the squares:
An optimization routine can then be used to determine the values of ao and al that mini-
mize the function.
MATLAB's fminsearch function can be used for this purpose. It has the general syntax
[x, fval] = fminsearch(fun,x0,options,p1,p2,...)
where x = a vector of the values of the parameters that minimize the function fun, fva/ =
the value of the function at the minimum, xo = a vector of the initial guesses for the para-
meters, options = a structure containing values of the optimization parameters as created
with the opt imset function (recall Section 6.4), and p1, p2, etc. = additional arguments
that are passed to the objective function. Note that if options is omitted, MATLAB uses
default values that are reasonable for most problems. If you would like to pass additional
ar-guments (p1, p2 ,...), but do not want to set the options, use empty brackets [ ] as a
place holder.
Interpolation
NEWTON’S INTERPOLATING
POLYNOMIAL
|NEWTON’S INTERPOLATING
POLYNOMIAL
There are a variety of alternative forms for expressing an interpolating polynomial beyond
the familiar format of Eq. (14.2). Newton's divided-difference interpolating polynomial is
among the most popular and useful forms. Before presenting the general equation, we
will introduce the first- and second-order versions because of their simple visual
interpretation.
The simplest form of interpolation is to connect two data points with a straight line. This
technique, called linear interpolation, is depicted graphically in Fig. 14.2. Using similar
triangles,
4.2.1 Linear Interpolation
The simplest form of interpolation is to connect two data points with a straight line. This
technique, called linear interpolation, is depicted graphically in Fig. 14.2. Using similar
triangles,
which can be rearranged to yield
which is the Newton linear-interpolation formula. The notation(x) designates that this is a
first-order interpolating polynomial. Notice that besides representing the slope of the line
connecting the points, the Tenn l f (x2) — f (x1))/ (X2 — xi) is a finite-divided-difference
approximation of the first derivative [recall Eq. (4.20)1. In general, the smaller the interval
between the data points, the better the approximation. This is due to the fact that, as the
interval decreases, a continuous function will be better approximated by a straight line. This
characteristic is demonstrated in the following example.
|NEWTON’S INTERPOLATING
POLYNOMIAL
FIGURE 14.2 Graphical depiction of linear
interpolation. The shaded areas indicate the
similar triangles used to derive the Newton
lineoninterpolotion formula [Eq. (14.5)]
Example 14.2
Estimate the natural logarithm of 2 using linear interpolation. First, perform the
computation by interpolating between In 1= 0 and In 6 = 1.791759. Then, repeat the
procedure, but use a smaller interval from In to In 4 (1.386294). Note that the true
value of In 2 is 0.6931472.
We use Eq. (14.5) from x1 = 1 to x2 = 6 to give
which represents an error of 𝜀1, = 48.3%. Using the smaller interval from x1 = I
to x2 = 4 yields
|NEWTON’S INTERPOLATING
POLYNOMIAL
Estimate the natural logarithm of 2 using linear interpolation. First, perform the
computation by interpolating between In 1= 0 and In 6 = 1.791759. Then, repeat the
procedure, but use a smaller interval from In to In 4 (1.386294). Note that the true
value of In 2 is 0.6931472.
We use Eq. (14.5) from x1 = 1 to x2 = 6 to give
which represents an error of 𝜀1, = 48.3%. Using the smaller interval from x1 = I
to x2 = 4 yields
|NEWTON’S INTERPOLATING
POLYNOMIAL
Thus, using the shorter interval reduces the percent relative error to et interpolations
are shown in Fig. 14.3, along with the true function.
Linear estimates
FIGURE 14.3 Two linear interpolations to
estimate In 2. Note how the smaller
interval provides a better estimate.
General Form of Newton's Interpolating Polynomials
The preceding analysis can be generalized to fit an (n - 1)th-order polynomial to n data
points. The (n - 1)th-order polynomial is
As was done previously with linear and quadratic interpolation, data points can be used to
evaluate the coefficients b1, b2 , . . ., bn. For an (n — 1)th-order polynomial, n data points
are required: [x1, f (x1)], [x2, f(x2)], . . . , [xn, f (xn)]. We use these data points and the following
equations to evaluate the coefficients:
|NEWTON’S INTERPOLATING
POLYNOMIAL
where the bracketed function evaluations are
finite divided differences. For example, the first
finite divided difference is represented
generally as
Similarly, the nth finite divided difference is
The second finite divided difference, which represents the difference of two first divided
differences, is expressed generally as
|NEWTON’S INTERPOLATING
POLYNOMIAL
where the bracketed function evaluations are finite divided differences. For example, the
first finite divided difference is represented generally as
FIGURE 14.5 Giciphiced depiction of the recursive nature of Hite divided differences. This
representation is rehired to as a divided difference table
These differences can he used to evaluate the coefficients in Eqs. (14.11) through (14.14),
which can then be substituted into Eq. (14.10) to yield the general form of. Newton's inter-
polating polynomial:
|NEWTON’S INTERPOLATING
POLYNOMIAL
We should note that it is not necessary that the data points used in Eq. (14.18) be equally
spaced or that the abscissa values necessarily be in ascending order, as illustrated in the
following example. Also, notice how Eqs. (14.15) through (14.17) are recursive that is,
higher-order differences are computed by taking differences of lower-order differences
(Fig. 14.5)
|NEWTON’S INTERPOLATING
POLYNOMIAL
Example 14.4
In Example 14.3, data points at x1 = 1, x2 = 4, and x3 = 6 were used to estimate Ln 2
with a parabola. Now, adding a fourth point [x4 = 5; f(x4) = 1.609438], estimate Ln 2
with a third-order Newton's interpolating polynomial.
The third-order polynomial, Eq. (14.10) with n = 4, is
The first divided differences for the problem are [Eq. (14.15)]
|NEWTON’S INTERPOLATING
POLYNOMIAL
The second divided differences are [Eq.(14.16)]
The third divided difference is [Eq. (14.17) with n = 4]
|NEWTON’S INTERPOLATING
POLYNOMIAL
Thus, the divided difference table is
The results for f (x 1) , f [x2, x1], f[x3, x2, x1 ]and f [x4, x3, X2, X1] represent the
coefficients b1, b2, b3, and b4, respectively, of Eq. (14.10). Thus, the interpolating
cubic is
|NEWTON’S INTERPOLATING
POLYNOMIAL
which can be used to evaluate f3(2) = 0.6287686, which represents a
relative error of εt = 9.3%. The complete cubic polynomial is shown in
Fig. 14.6.
|NEWTON’S INTERPOLATING
POLYNOMIAL
FIGURE 14.6 The use of
cubic interpolation to
estimate In 2.
LAGRAANGE INTERPOLATING
POLYNOMIAL
|LAGRAANGE INTERPOLATING
POLYNOMIAL
Suppose we formulate a linear interpolating polynomial as the weighted average of the
two values that we are connecting by a straight line:
where the L's are the weighting coefficients. It is logical that the first weighting coefficient is
the straight line that is equal to 1 at x1 and 0 at x2:
Similarly, the second coefficient is the straight line that is equal 'to I at x2 and 0 at x1 :
Substituting these coefficients into Eq. 14.19 yields the straight line that connects the points
(Fig. 14.8):
|LAGRAANGE INTERPOLATING
POLYNOMIAL
where the nomenclature f1(x) designates that this is a first-order polynomial. Equation
(14.20) is referred to as the linear Lagrange interpolating polynomial.
The same strategy can be employed to fit a parabola through three points. For
this case three parabolas would be used with each one passing through one of the points
and equaling zero at the other two. Their sum would then represent the unique parabola
that connects the three points. Such a second-order Lagrange interpolating polynomial
can be written as
Notice how the first term is equal to f(x1) at x1 and is equal to zero at x2 and x3. The other
terms work in a similar fashion.
Both the first- and second-order versions as well as higher-order Lagrange polynomials can
be represented concisely as
|LAGRAANGE INTERPOLATING
POLYNOMIAL
FIGURE 14.8 A visual depiction of the
rationale behind Lagrange
interpolating polynomials. The figure
shows the first-order case. Each of
the two terms of Eo,. 114 20) passes
through one of the points and is zero
at the other. The summation of the
wo terms must, therefore, be the
unique straight line that connects the
two points.
where
where n = the number of data points and designates the "product of."
Example 14.5
Use a Lagrange interpolating polynomial of the first and second order to evaluate
the density of unused motor oil at T = 15°C based on the following data:
The first-order polynomial [Eq. (1 4.20)] can be used to obtain the estimate at
|NEWTON’S INTERPOLATING
POLYNOMIAL
In a similar fashion, the second-order polynomial is developed as [Eq. (14.21)]
|NEWTON’S INTERPOLATING
POLYNOMIAL
INVERSE INTERPOLATION
|INVERSE INTERPOLATION
As the nomenclature implies, the f (x) and x values in most interpolation contexts are the
dependent and independent variables, respectively. As a consequence, the values of
the x's are typically uniformly spaced. A simple example is a takble of values derived for
the function f (x) = 1 /x :
Now suppose that you must use the same data, but you are given a value for f (x) and must
determine the corresponding value of x. For instance, for the data above, suppose that you
were asked to determine the value of x that corresponded to f (x) = 0.3. For this case,
because the function is available and easy to manipulate, the correct answer can be
deter-mined directly as x = 1/0.3 = 3.3333.
|INVERSE INTERPOLATION
Such nonuniform spacing on the abscissa often leads to oscillations in the resulting
interpolating polynomial. This can occur even for lower-order polynomials. An alternative
strategy is to fit an nth-order interpolating polynomial, f,,(x), to the original data
[i.e., with f (x) versus x]. In most cases, because the x's are evenly spaced, this polynomial
will not be ill-conditioned. The answer to your problem then amounts to finding the value of
x that makes this polynomial equal to the given f (x). Thus, the interpolation problem
reduces to a roots problem!
Such a problem is called inverse interpolation. For a more complicated case, you might be
tempted to switch the f (x) and x values [i.e., merely plot x versus f (x)] and use an
approach like Newton or Lagrange interpolation to determine the result. Unfortunately,
when you reverse the variables, there is no guarantee that the values along the new
abscissa [the f (x)' s] will be evenly spaced. In fact, in many cases, the values will be
"telescoped." That is, they will have the appearance of a logarithmic scale with some
adjacent points bunched together and others spread out widely. For example,
for f (x) = 1 /x the result is
|INVERSE INTERPOLATION
The answer to the inverse interpolation problem of finding the x corresponding to f (x) =7--
0.3 would therefore involve determining the root of
For example, for the problem just outlined, a simple approach would be to fit a quadratic
polynomial to the three points: (2, 0.5), (3, 0.3333), and (4, 0.25). The result would be
For this simple case, the quadratic formula can be used to calculate
Thus, the second root, 3.296, is a good approximation of the true value of 3.333. If additional
accuracy were desired, a third- or fourth-order polynomial along with one of the root-
location methods from Chaps. 5 or 6 could be employed.
Applied Numerical Methods Curve Fitting: Least Squares Regression, Interpolation
Applied Numerical Methods Curve Fitting: Least Squares Regression, Interpolation

Weitere ähnliche Inhalte

Was ist angesagt?

Non linear curve fitting
Non linear curve fitting Non linear curve fitting
Non linear curve fitting Anumita Mondal
 
Graph Data Structure
Graph Data StructureGraph Data Structure
Graph Data StructureKeno benti
 
Iteration method-Solution of algebraic and Transcendental Equations.
Iteration method-Solution of algebraic and Transcendental Equations.Iteration method-Solution of algebraic and Transcendental Equations.
Iteration method-Solution of algebraic and Transcendental Equations.AmitKumar8151
 
Multiple intigration ppt
Multiple intigration pptMultiple intigration ppt
Multiple intigration pptManish Mor
 
Numerical differentiation
Numerical differentiationNumerical differentiation
Numerical differentiationandrushow
 
Monte Carlo Simulation lecture.pdf
Monte Carlo Simulation lecture.pdfMonte Carlo Simulation lecture.pdf
Monte Carlo Simulation lecture.pdfWellingtonIsraelQuim
 
Rod Cutting Problem
Rod Cutting ProblemRod Cutting Problem
Rod Cutting ProblemQuaziRossi
 
APG Pertemuan 5 : Inferensia Vektor Rata-rata 1 Populasi
APG Pertemuan 5 : Inferensia Vektor Rata-rata 1 PopulasiAPG Pertemuan 5 : Inferensia Vektor Rata-rata 1 Populasi
APG Pertemuan 5 : Inferensia Vektor Rata-rata 1 PopulasiRani Nooraeni
 
Curve fitting - Lecture Notes
Curve fitting - Lecture NotesCurve fitting - Lecture Notes
Curve fitting - Lecture NotesDr. Nirav Vyas
 
Probabilitas dan-statistika 4
Probabilitas dan-statistika 4Probabilitas dan-statistika 4
Probabilitas dan-statistika 4didikbhzt
 
Threaded binary tree
Threaded binary treeThreaded binary tree
Threaded binary treeArunaP47
 
Binary Heap Tree, Data Structure
Binary Heap Tree, Data Structure Binary Heap Tree, Data Structure
Binary Heap Tree, Data Structure Anand Ingle
 

Was ist angesagt? (20)

Non linear curve fitting
Non linear curve fitting Non linear curve fitting
Non linear curve fitting
 
Graph Data Structure
Graph Data StructureGraph Data Structure
Graph Data Structure
 
Iteration method-Solution of algebraic and Transcendental Equations.
Iteration method-Solution of algebraic and Transcendental Equations.Iteration method-Solution of algebraic and Transcendental Equations.
Iteration method-Solution of algebraic and Transcendental Equations.
 
Curve fitting
Curve fitting Curve fitting
Curve fitting
 
Numerical analysis ppt
Numerical analysis pptNumerical analysis ppt
Numerical analysis ppt
 
Multiple intigration ppt
Multiple intigration pptMultiple intigration ppt
Multiple intigration ppt
 
Numerical differentiation
Numerical differentiationNumerical differentiation
Numerical differentiation
 
Monte Carlo Simulation lecture.pdf
Monte Carlo Simulation lecture.pdfMonte Carlo Simulation lecture.pdf
Monte Carlo Simulation lecture.pdf
 
Hashing
HashingHashing
Hashing
 
Mean value theorem
Mean value theoremMean value theorem
Mean value theorem
 
Rod Cutting Problem
Rod Cutting ProblemRod Cutting Problem
Rod Cutting Problem
 
APG Pertemuan 5 : Inferensia Vektor Rata-rata 1 Populasi
APG Pertemuan 5 : Inferensia Vektor Rata-rata 1 PopulasiAPG Pertemuan 5 : Inferensia Vektor Rata-rata 1 Populasi
APG Pertemuan 5 : Inferensia Vektor Rata-rata 1 Populasi
 
Heapsort ppt
Heapsort pptHeapsort ppt
Heapsort ppt
 
Hamiltonian path
Hamiltonian pathHamiltonian path
Hamiltonian path
 
Curve fitting - Lecture Notes
Curve fitting - Lecture NotesCurve fitting - Lecture Notes
Curve fitting - Lecture Notes
 
Probabilitas dan-statistika 4
Probabilitas dan-statistika 4Probabilitas dan-statistika 4
Probabilitas dan-statistika 4
 
Curve Fitting
Curve FittingCurve Fitting
Curve Fitting
 
Threaded binary tree
Threaded binary treeThreaded binary tree
Threaded binary tree
 
Binary Heap Tree, Data Structure
Binary Heap Tree, Data Structure Binary Heap Tree, Data Structure
Binary Heap Tree, Data Structure
 
Linear regression theory
Linear regression theoryLinear regression theory
Linear regression theory
 

Andere mochten auch

Least square method
Least square methodLeast square method
Least square methodSomya Bagai
 
Method of least square
Method of least squareMethod of least square
Method of least squareSomya Bagai
 
The least square method
The least square methodThe least square method
The least square methodkevinlefol
 
METHOD OF LEAST SQURE
METHOD OF LEAST SQUREMETHOD OF LEAST SQURE
METHOD OF LEAST SQUREDanial Mirza
 
Polynomials and Curve Fitting in MATLAB
Polynomials and Curve Fitting in MATLABPolynomials and Curve Fitting in MATLAB
Polynomials and Curve Fitting in MATLABShameer Ahmed Koya
 
Applied numerical methods lec8
Applied numerical methods lec8Applied numerical methods lec8
Applied numerical methods lec8Yasser Ahmed
 
Curve fitting
Curve fittingCurve fitting
Curve fittingdusan4rs
 
case study of curve fitting
case study of curve fittingcase study of curve fitting
case study of curve fittingAdarsh Patel
 
Polynomial functions
Polynomial functionsPolynomial functions
Polynomial functionsnicole379865
 
Hotspots of biodiversity
Hotspots of biodiversityHotspots of biodiversity
Hotspots of biodiversitySomya Bagai
 
interpolation
interpolationinterpolation
interpolation8laddu8
 
Evolution & structure of erp
Evolution & structure of erpEvolution & structure of erp
Evolution & structure of erpSomya Bagai
 
Automating pKa Curve Fitting Using Origin
Automating pKa Curve Fitting Using OriginAutomating pKa Curve Fitting Using Origin
Automating pKa Curve Fitting Using OriginBrian Bissett
 
SJUT/Mat210/Regression/Intro 2013-14S2
SJUT/Mat210/Regression/Intro 2013-14S2SJUT/Mat210/Regression/Intro 2013-14S2
SJUT/Mat210/Regression/Intro 2013-14S2John Ham
 
09 numerical differentiation
09 numerical differentiation09 numerical differentiation
09 numerical differentiationMohammad Tawfik
 

Andere mochten auch (20)

Least square method
Least square methodLeast square method
Least square method
 
Method of least square
Method of least squareMethod of least square
Method of least square
 
The least square method
The least square methodThe least square method
The least square method
 
Curve fitting
Curve fittingCurve fitting
Curve fitting
 
METHOD OF LEAST SQURE
METHOD OF LEAST SQUREMETHOD OF LEAST SQURE
METHOD OF LEAST SQURE
 
Numerical method (curve fitting)
Numerical method (curve fitting)Numerical method (curve fitting)
Numerical method (curve fitting)
 
Curve fitting
Curve fittingCurve fitting
Curve fitting
 
Polynomials and Curve Fitting in MATLAB
Polynomials and Curve Fitting in MATLABPolynomials and Curve Fitting in MATLAB
Polynomials and Curve Fitting in MATLAB
 
Applied numerical methods lec8
Applied numerical methods lec8Applied numerical methods lec8
Applied numerical methods lec8
 
Interpolation Methods
Interpolation MethodsInterpolation Methods
Interpolation Methods
 
Curve fitting
Curve fittingCurve fitting
Curve fitting
 
case study of curve fitting
case study of curve fittingcase study of curve fitting
case study of curve fitting
 
Polynomial functions
Polynomial functionsPolynomial functions
Polynomial functions
 
Hotspots of biodiversity
Hotspots of biodiversityHotspots of biodiversity
Hotspots of biodiversity
 
interpolation
interpolationinterpolation
interpolation
 
Slideshare ppt
Slideshare pptSlideshare ppt
Slideshare ppt
 
Evolution & structure of erp
Evolution & structure of erpEvolution & structure of erp
Evolution & structure of erp
 
Automating pKa Curve Fitting Using Origin
Automating pKa Curve Fitting Using OriginAutomating pKa Curve Fitting Using Origin
Automating pKa Curve Fitting Using Origin
 
SJUT/Mat210/Regression/Intro 2013-14S2
SJUT/Mat210/Regression/Intro 2013-14S2SJUT/Mat210/Regression/Intro 2013-14S2
SJUT/Mat210/Regression/Intro 2013-14S2
 
09 numerical differentiation
09 numerical differentiation09 numerical differentiation
09 numerical differentiation
 

Ähnlich wie Applied Numerical Methods Curve Fitting: Least Squares Regression, Interpolation

Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inferenceKemal İnciroğlu
 
Linear regression [Theory and Application (In physics point of view) using py...
Linear regression [Theory and Application (In physics point of view) using py...Linear regression [Theory and Application (In physics point of view) using py...
Linear regression [Theory and Application (In physics point of view) using py...ANIRBANMAJUMDAR18
 
Direct Methods to Solve Lineal Equations
Direct Methods to Solve Lineal EquationsDirect Methods to Solve Lineal Equations
Direct Methods to Solve Lineal EquationsLizeth Paola Barrero
 
Direct Methods to Solve Linear Equations Systems
Direct Methods to Solve Linear Equations SystemsDirect Methods to Solve Linear Equations Systems
Direct Methods to Solve Linear Equations SystemsLizeth Paola Barrero
 
Beginning direct3d gameprogrammingmath05_matrices_20160515_jintaeks
Beginning direct3d gameprogrammingmath05_matrices_20160515_jintaeksBeginning direct3d gameprogrammingmath05_matrices_20160515_jintaeks
Beginning direct3d gameprogrammingmath05_matrices_20160515_jintaeksJinTaek Seo
 
Series_Solution_Methods_and_Special_Func.pdf
Series_Solution_Methods_and_Special_Func.pdfSeries_Solution_Methods_and_Special_Func.pdf
Series_Solution_Methods_and_Special_Func.pdfmohamedtawfik358886
 
tw1979 Exercise 1 Report
tw1979 Exercise 1 Reporttw1979 Exercise 1 Report
tw1979 Exercise 1 ReportThomas Wigg
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionMaria Theresa
 
Linear equation in one variable PPT.pdf
Linear equation in one variable PPT.pdfLinear equation in one variable PPT.pdf
Linear equation in one variable PPT.pdfmanojindustry
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionMaria Theresa
 
Comparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionComparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionIOSRJM
 

Ähnlich wie Applied Numerical Methods Curve Fitting: Least Squares Regression, Interpolation (20)

Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
Curve fitting
Curve fittingCurve fitting
Curve fitting
 
Linear regression [Theory and Application (In physics point of view) using py...
Linear regression [Theory and Application (In physics point of view) using py...Linear regression [Theory and Application (In physics point of view) using py...
Linear regression [Theory and Application (In physics point of view) using py...
 
Es272 ch5a
Es272 ch5aEs272 ch5a
Es272 ch5a
 
Direct Methods to Solve Lineal Equations
Direct Methods to Solve Lineal EquationsDirect Methods to Solve Lineal Equations
Direct Methods to Solve Lineal Equations
 
Direct methods
Direct methodsDirect methods
Direct methods
 
Direct Methods to Solve Linear Equations Systems
Direct Methods to Solve Linear Equations SystemsDirect Methods to Solve Linear Equations Systems
Direct Methods to Solve Linear Equations Systems
 
Direct methods
Direct methodsDirect methods
Direct methods
 
Chapter05
Chapter05Chapter05
Chapter05
 
Glossary ib
Glossary ibGlossary ib
Glossary ib
 
Beginning direct3d gameprogrammingmath05_matrices_20160515_jintaeks
Beginning direct3d gameprogrammingmath05_matrices_20160515_jintaeksBeginning direct3d gameprogrammingmath05_matrices_20160515_jintaeks
Beginning direct3d gameprogrammingmath05_matrices_20160515_jintaeks
 
Series_Solution_Methods_and_Special_Func.pdf
Series_Solution_Methods_and_Special_Func.pdfSeries_Solution_Methods_and_Special_Func.pdf
Series_Solution_Methods_and_Special_Func.pdf
 
tw1979 Exercise 1 Report
tw1979 Exercise 1 Reporttw1979 Exercise 1 Report
tw1979 Exercise 1 Report
 
Chapter#8
Chapter#8Chapter#8
Chapter#8
 
Corr And Regress
Corr And RegressCorr And Regress
Corr And Regress
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Linear equation in one variable PPT.pdf
Linear equation in one variable PPT.pdfLinear equation in one variable PPT.pdf
Linear equation in one variable PPT.pdf
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
ilovepdf_merged
ilovepdf_mergedilovepdf_merged
ilovepdf_merged
 
Comparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionComparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distribution
 

Kürzlich hochgeladen

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 

Kürzlich hochgeladen (20)

Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 

Applied Numerical Methods Curve Fitting: Least Squares Regression, Interpolation

  • 1.
  • 2. Curve Fitting a. Least Squares Regression i. Linear Regression ii. Polynomial Regression iii. Multiple Linear Regression iv. General Linear Regression v. Nonlinear Regression b. Interpolation i. Newton’s Interpolating Polynomials ii. Lagrange Interpolating Polynomials iii. Inverse Interpolation iv. Spline Interpolation c. Fourier Approximation
  • 3. Members • Ardiente, Justine • Erandio, Dogie Brian • Calderon, Alec • Pares, Carl • Soliman, Gerick • Telan, Joel • Tulabing, Tulabing • Villarosa, Mark
  • 4. Motivation Data is often given for discrete values along a continuum. However, you may require estimates at points between the discrete values. The present part of this Presetation describes techniques to fit curves to such data to obtain intermediate estimates. In addition, you may require a simplified version of a complicated function. One way to do this is to compute values of the function at a number of discrete values along the range of interest. Then, a simpler function may be derived to fit these values. Both of these applications are known as curve fitting. There are two general approaches for curve fitting that are distinguished from each other on the basis of the amount of error associated with the data. First, where the data exhibits a significant degree of error or “noise,” the strategy is to derive a single curve that represents the general trend of the data. Because any individual data point may be incorrect, we make no effort to intersect every point. Rather, the curve is designed to follow the pattern of the points taken as a group. One approach of this nature is called least-squares Regression. Second, where the data is known to be very precise, the basic approach is to fit a curve or a series of curves that pass directly through each of the points. Such data usually originates from tables. Examples are values for the density of water or for the heat capacity of gases as a function of temperature. The estimation of values between well-known discrete points is called interpolation.
  • 6. Linear Regression The simplest example of a least-squares approximation is fitting a straight line to a set of paired observations: (x1, y1), (x2, y2), . . . , (xn, yn). The mathematical expression for the straight line is y = a0 + a1x + e (17.1) ••Where a0 and a1 are coefficients representing the intercept and the slope, respectively, and e is the error, or residual, between the model and the observations, which can be represented by rearranging Eq. (17.1) as e = y − a0 − a1x Thus, the error, or residual, is the discrepancy between the true value of y and the approximate value, a0 + a1x, predicted by the linear equation. FIGURE 17.1 (a) Data exhibiting significant error. (b) Polynomial fit oscillating beyond the range of the data. (c) More satisfactory result using the least-squares fit.
  • 7. Criteria for a “Best” Fit where n = total number of points. However, this is an inadequate criterion, as illustrated by Fig. 17.2a which depicts the fit of a straight line to two points. Obviously, the best fit is the |Linear Regression One strategy for fitting a “best” line through the data would be to minimize the sum of the residual errors for all the available data, as in FIGURE 17.2 Examples of some criteria for “best fit” that are inadequate for egression: (a) minimizes the sum of the residuals, (b) minimizes the sum of the absolute values of the residuals, and (c) minimizes the maximum error of any individual point.
  • 8. To determine values for a0 and a1, Eq. (17.3) is differentiated with respect to each coefficient: Least-Squares Fit of a Straight Line |Linear Regression To determine values for a0 and a1, Eq. (17.3) is differentiated with respect to each coefficient: Now, realizing that ∑a0 = na0, we can express the equations as a set of two simultaneous linear equations with two unknowns (a0 and a1): These are called the normal equations. They can be solved simultaneously.
  • 9. This result can then be used in conjunction with Eq. (17.4) to solve for. Least-Squares Fit of a Straight Line |Linear Regression where ¯y and ¯ x are the means of y and x, respectively.
  • 10. Example 17.1 Problem Statement. Fit a straight line to the x and y values in the first two columns of Table 17.1. Solution. The following quantities can be computed: |Linear Regression Using Eqs. (17.6) and (17.7),
  • 11. Example 17.1 |Linear Regression Therefore, the least-squares fit is y = 0.07142857 + 0.8392857x The line, along with the data, is shown in Fig. 17.1c.
  • 12. General Comments on Linear Regression Before proceeding to curvilinear and multiple linear regression, we must emphasize the introductory nature of the foregoing material on linear regression. We have focused on the simple derivation and practical use of equations to fit data. You should be cognizant of the fact that there are theoretical aspects of regression that are of practical importance but are beyond the scope of this presentation. For example, some statistical assumptions that are inherent in the linear least-squares procedures are 1. Each x has a fixed value; it is not random and is known without error. 2. The y values are independent random variables and all have the same variance. 3. The y values for a given x must be normally distributed. Such assumptions are relevant to the proper derivation and use of regression. For example, the first assumption means that (1) the x values must be error-free and (2) the regression of y versus x is not the same as x versus y. You are urged to consult other references such as Draper and Smith (1981) to appreciate aspects and nuances of regression that are beyond the scope of this presentation.
  • 13.
  • 15. |Polynomial Regression In Sec. 17.1, a procedure was developed to derive the equation of a straight line using the least-squares criterion. Some engineering data, although exhibiting a marked pattern such as seen in Fig. 17.8, is poorly represented by a straight line. For these cases, a curve would be better suited to fit the data. As discussed in the previous section, one method to accomplish this objective is to use transformations. Another alternative is to fit polynomials to the data using polynomial regression. The least-squares procedure can be readily extended to fit the data to a higher- order polynomial. For example, suppose that we fit a second-order polynomial or quadratic: y = a0 + a1x + a2x2 + e For this case the sum of the squares of the residuals is [compare with Eq. (17.3)] Following the procedure of the previous section, we take the derivative of Eq. (17.18) with respect to each of the unknown coefficients of the polynomial, as in
  • 16. These equations can be set equal to zero and rearranged to develop the following set of normal equations: |Polynomial Regression where all summations are from i = 1 through n. Note that the above three equations are linear and have three unknowns: a0, a1, and a2. The coefficients of the unknowns can be calculated directly from the observed data. For this case, we see that the problem of determining a least-squares second-order polynomial is equivalent to solving a system of three simultaneous linear equations. Techniques to solve such equations were discussed in Part Three. The two-dimensional case can be easily extended to an mth-order polynomial as y = a0 + a1x + a2x2 + ·· ·+amxm + e
  • 17. |Polynomial Regression The foregoing analysis can be easily extended to this more general case. Thus, we can recognize that determining the coefficients of an mth-order polynomial is equivalent to solving a system of m + 1 simultaneous linear equations. For this case, the standard error is formulated as This quantity is divided by n − (m + 1) because (m + 1) data-derived coefficients— a0, a1, . . . , am—were used to compute Sr; thus, we have lost m + 1 degrees of freedom. In addition to the standard error, a coefficient of determination can also be computed for polynomial regression with Eq. (17.10).
  • 18. EXAMPLE 17.5 Problem Statement. Fit a second-order polynomial to the data in the first two columns of Table 17.4. Solution. From the given data, |Polynomial Regression
  • 19. Computations for an error analysis of the quadratic least-squares fit. |Polynomial Regression FIGURE 17.11 Fit of a second-order polynomial.
  • 20. Therefore, the simultaneous linear equations are |Polynomial Regression Solving these equations through a technique such as Gauss elimination gives a0 = 2.47857, a1 = 2.35929, and a2 = 1.86071. Therefore, the least-squares quadratic equation for this case is y = 2.47857 + 2.35929x + 1.86071x2 The standard error of the estimate based on the regression polynomial is [Eq. (17.20)]
  • 21. The coefficient of determination is |Polynomial Regression and the correlation coefficient is r = 0.99925. These results indicate that 99.851 percent of the original uncertainty has been explained by the model. This result supports the conclusion that the quadratic equation represents an excellent fit, as is also evident from Fig. 17.11.
  • 22.
  • 24. |MULTIPLE LINEAR REGRESSION A useful extension of linear regression is the case where y is a linear function of two or more independent variables. For example, y might be a linear function of x1 and x2, as in y = a0 + a1x1 + a2x2 + e Such an equation is particularly useful when fitting experimental data where the variable being studied is often a function of two other variables. For this two-dimensional case, the regression “line” becomes a “plane” (Fig. 17.14). FIGURE 17.14 Graphical depiction of multiple linear regression where y is a linear function of x1 and x2.
  • 25. As with the previous cases, the “best” values of the coefficients are determined by setting up the sum of the squares of the residuals, and differentiating with respect to each of the unknown coefficients, The coefficients yielding the minimum sum of the squares of the residuals are obtained by setting the partial derivatives equal to zero and expressing the result in matrix form as |MULTIPLE LINEAR REGRESSION
  • 26. EXAMPLE 17.6 The following data was calculated from the equation y = 5 + 4x1 − 3x2: |MULTIPLE LINEAR REGRESSION Use multiple linear regression to fit this data. Solution. The summations required to develop Eq. (17.22) are computed in Table 17.5. The result is
  • 27. which can be solved using a method such as Gauss elimination for a0 = 5 a1 = 4 a2 = −3 which is consistent with the original equation from which the data was derived. |MULTIPLE LINEAR REGRESSION TABLE 17.5 Computations required to develop the normal equations for Example 17.6. The foregoing two-dimensional case can be easily extended to m dimensions, as in y = a0 + a1x1 + a2x2 +· · ·+amxm + e where the standard error is formulated as
  • 28. and the coefficient of determination is computed as in Eq. (17.10). An algorithm to set up the normal equations is listed in Fig. 17.15. Although there may be certain cases where a variable is linearly related to two or more other variables, multiple linear regression has additional utility in the derivation of power equations of the general form |MULTIPLE LINEAR REGRESSION
  • 29.
  • 31. To this point, we have focused on the mechanics of obtaining least-squares fits of some simple functions to data. Before turning to nonlinear regression, there are several issues that we would like to discuss to enrich your understanding of the preceding material. |GENERAL LINEAR LEAST SQUARES
  • 32. In the preceding pages, we have introduced three types of regression: simple linear, polynomial, and multiple linear. In fact, all three belong to the following general linear leastsquares model: y = a0z0 + a1z1 + a2z2 +· · ·+amzm + e (17.23) where z0, z1, . . . , zm are m + 1 basis functions. It can easily be seen how simple and multiple linear regression fall within this model—that is, z0 = 1, z1 = x1, z2 = x2, . . . , zm = xm. Further, polynomial regression is also included if the basis functions are simple monomials as in z0 = x0 = 1, z1 = x, z2 = x2, . . . , zm = xm. Note that the terminology “linear” refers only to the model’s dependence on its parameters—that is, the a’s. As in the case of polynomial regression, the functions themselves can be highly nonlinear. For example, the z’s can be sinusoids, as in y = a0 + a1 cos(ωt) + a2 sin(ωt) |GENERAL LINEAR LEAST SQUARES General Matrix Formulation for Linear Least Squares
  • 33. |GENERAL LINEAR LEAST SQUARES Such a format is the basis of Fourier analysis described in Chap. 19. On the other hand, a simple-looking model like f(x) = a0(1 − e−a1x ) is truly nonlinear because it cannot be manipulated into the format of Eq. (17.23). We will turn to such models at the end of this chapter. For the time being, Eq. (17.23) can be expressed in matrix notation as {Y} = [Z]{A} + {E} (17.24) where [Z] is a matrix of the calculated values of the basis functions at the measured values of the independent variables,
  • 34. |GENERAL LINEAR LEAST SQUARES where m is the number of variables in the model and n is the number of data points. Because n ≥ m + 1, you should recognize that most of the time, [Z] is not a square matrix. The column vector {Y} contains the observed values of the dependent variable {Y}T = [ y1 y2 · · · yn ] The column vector {A} contains the unknown coefficients {A}T = [a0 a1 · · · am | and the column vector {E} contains the residuals {E}T = [e1 e2 · · · en ] As was done throughout this chapter, the sum of the squares of the residuals for this model can be defined as
  • 35. |GENERAL LINEAR LEAST SQUARES This quantity can be minimized by taking its partial derivative with respect to each of the coefficients and setting the resulting equation equal to zero. The outcome of this process is the normal equations that can be expressed concisely in matrix form as It can be shown that Eq. (17.25) is, in fact, equivalent to the normal equations developed previously for simple linear, polynomial, and multiple linear regression. The coefficient of determination and the standard error can also be formulated in terms of matrix algebra. Reccal the r2 is defined as R2 = 𝑆𝑡 −𝑆𝑟 𝑆𝑡 = 1 - 𝑆 𝑟 𝑆𝑡
  • 36. |GENERAL LINEAR LEAST SQUARES The matrix notation will also have relevance when we turn to nonlinear regression in the last section of this chapter. From Eq. (PT3.6), recall that the matrix inverse can be employed to solve Eq. (17.25), as in Our primary motivation for the foregoing has been to illustrate the unity among the three approaches and to show how they can all be expressed simply in the same matrix notation. Substituting the definitions of Sr and St gives 𝑟2 = − 𝑦𝑖 −ŷ𝑖 2 𝑦𝑖 − 𝑦𝑖 2 Where = ŷ the prediction of the least-squares fit. The residualsbetween the best fit curve and the data, 𝑦𝑖 − ŷ𝑖 ,can be expressed in vector form as {y} – [Z]{a} Matrix algebra can then be used to manipulate this vector to compute the coefficient of determination and the standard error of the estimate.
  • 37.
  • 39. |NONLINEAR REGRESSION There are many cases in engineering where nonlinear models must be fit to data. In the present context, these models are defined as those that have a nonlinear dependence on their parameters. For example, This equation cannot be manipulated so that it conforms to the general form of Eq. (1 3.7). As with linear least squares, nonlinear regression is based on determining the values of the parameters that minimize the sum of the squares of the residuals. However, for the nonlinear case, the solution must proceed in an iterative fashion. There are techniques expressly designed for nonlinear regression. For example, the Gauss- Newton method uses a Taylor series expansion to express the original nonlinear equation in an approximate, linear form. Then least-squares theory can be used to obtain new estimates of the parameters that move in the direction of minimizing the residual. Details on this approach are provided elsewhere (Chapra and Canale, 2002).
  • 40. |NONLINEAR REGRESSION An alternative is to use optimization techniques to directly determine the least-squares fit. For example, Eq. (13.12) can be expressed as an objective function to compute the sum of the squares: An optimization routine can then be used to determine the values of ao and al that mini- mize the function. MATLAB's fminsearch function can be used for this purpose. It has the general syntax [x, fval] = fminsearch(fun,x0,options,p1,p2,...) where x = a vector of the values of the parameters that minimize the function fun, fva/ = the value of the function at the minimum, xo = a vector of the initial guesses for the para- meters, options = a structure containing values of the optimization parameters as created with the opt imset function (recall Section 6.4), and p1, p2, etc. = additional arguments that are passed to the objective function. Note that if options is omitted, MATLAB uses default values that are reasonable for most problems. If you would like to pass additional ar-guments (p1, p2 ,...), but do not want to set the options, use empty brackets [ ] as a place holder.
  • 41.
  • 44. |NEWTON’S INTERPOLATING POLYNOMIAL There are a variety of alternative forms for expressing an interpolating polynomial beyond the familiar format of Eq. (14.2). Newton's divided-difference interpolating polynomial is among the most popular and useful forms. Before presenting the general equation, we will introduce the first- and second-order versions because of their simple visual interpretation. The simplest form of interpolation is to connect two data points with a straight line. This technique, called linear interpolation, is depicted graphically in Fig. 14.2. Using similar triangles, 4.2.1 Linear Interpolation The simplest form of interpolation is to connect two data points with a straight line. This technique, called linear interpolation, is depicted graphically in Fig. 14.2. Using similar triangles,
  • 45. which can be rearranged to yield which is the Newton linear-interpolation formula. The notation(x) designates that this is a first-order interpolating polynomial. Notice that besides representing the slope of the line connecting the points, the Tenn l f (x2) — f (x1))/ (X2 — xi) is a finite-divided-difference approximation of the first derivative [recall Eq. (4.20)1. In general, the smaller the interval between the data points, the better the approximation. This is due to the fact that, as the interval decreases, a continuous function will be better approximated by a straight line. This characteristic is demonstrated in the following example. |NEWTON’S INTERPOLATING POLYNOMIAL FIGURE 14.2 Graphical depiction of linear interpolation. The shaded areas indicate the similar triangles used to derive the Newton lineoninterpolotion formula [Eq. (14.5)]
  • 46. Example 14.2 Estimate the natural logarithm of 2 using linear interpolation. First, perform the computation by interpolating between In 1= 0 and In 6 = 1.791759. Then, repeat the procedure, but use a smaller interval from In to In 4 (1.386294). Note that the true value of In 2 is 0.6931472. We use Eq. (14.5) from x1 = 1 to x2 = 6 to give which represents an error of 𝜀1, = 48.3%. Using the smaller interval from x1 = I to x2 = 4 yields |NEWTON’S INTERPOLATING POLYNOMIAL
  • 47. Estimate the natural logarithm of 2 using linear interpolation. First, perform the computation by interpolating between In 1= 0 and In 6 = 1.791759. Then, repeat the procedure, but use a smaller interval from In to In 4 (1.386294). Note that the true value of In 2 is 0.6931472. We use Eq. (14.5) from x1 = 1 to x2 = 6 to give which represents an error of 𝜀1, = 48.3%. Using the smaller interval from x1 = I to x2 = 4 yields |NEWTON’S INTERPOLATING POLYNOMIAL
  • 48. Thus, using the shorter interval reduces the percent relative error to et interpolations are shown in Fig. 14.3, along with the true function. Linear estimates FIGURE 14.3 Two linear interpolations to estimate In 2. Note how the smaller interval provides a better estimate.
  • 49.
  • 50. General Form of Newton's Interpolating Polynomials
  • 51. The preceding analysis can be generalized to fit an (n - 1)th-order polynomial to n data points. The (n - 1)th-order polynomial is As was done previously with linear and quadratic interpolation, data points can be used to evaluate the coefficients b1, b2 , . . ., bn. For an (n — 1)th-order polynomial, n data points are required: [x1, f (x1)], [x2, f(x2)], . . . , [xn, f (xn)]. We use these data points and the following equations to evaluate the coefficients: |NEWTON’S INTERPOLATING POLYNOMIAL where the bracketed function evaluations are finite divided differences. For example, the first finite divided difference is represented generally as
  • 52. Similarly, the nth finite divided difference is The second finite divided difference, which represents the difference of two first divided differences, is expressed generally as |NEWTON’S INTERPOLATING POLYNOMIAL where the bracketed function evaluations are finite divided differences. For example, the first finite divided difference is represented generally as
  • 53. FIGURE 14.5 Giciphiced depiction of the recursive nature of Hite divided differences. This representation is rehired to as a divided difference table These differences can he used to evaluate the coefficients in Eqs. (14.11) through (14.14), which can then be substituted into Eq. (14.10) to yield the general form of. Newton's inter- polating polynomial: |NEWTON’S INTERPOLATING POLYNOMIAL
  • 54. We should note that it is not necessary that the data points used in Eq. (14.18) be equally spaced or that the abscissa values necessarily be in ascending order, as illustrated in the following example. Also, notice how Eqs. (14.15) through (14.17) are recursive that is, higher-order differences are computed by taking differences of lower-order differences (Fig. 14.5) |NEWTON’S INTERPOLATING POLYNOMIAL
  • 55. Example 14.4 In Example 14.3, data points at x1 = 1, x2 = 4, and x3 = 6 were used to estimate Ln 2 with a parabola. Now, adding a fourth point [x4 = 5; f(x4) = 1.609438], estimate Ln 2 with a third-order Newton's interpolating polynomial. The third-order polynomial, Eq. (14.10) with n = 4, is The first divided differences for the problem are [Eq. (14.15)] |NEWTON’S INTERPOLATING POLYNOMIAL
  • 56. The second divided differences are [Eq.(14.16)] The third divided difference is [Eq. (14.17) with n = 4] |NEWTON’S INTERPOLATING POLYNOMIAL
  • 57. Thus, the divided difference table is The results for f (x 1) , f [x2, x1], f[x3, x2, x1 ]and f [x4, x3, X2, X1] represent the coefficients b1, b2, b3, and b4, respectively, of Eq. (14.10). Thus, the interpolating cubic is |NEWTON’S INTERPOLATING POLYNOMIAL which can be used to evaluate f3(2) = 0.6287686, which represents a relative error of εt = 9.3%. The complete cubic polynomial is shown in Fig. 14.6.
  • 58. |NEWTON’S INTERPOLATING POLYNOMIAL FIGURE 14.6 The use of cubic interpolation to estimate In 2.
  • 59.
  • 61. |LAGRAANGE INTERPOLATING POLYNOMIAL Suppose we formulate a linear interpolating polynomial as the weighted average of the two values that we are connecting by a straight line: where the L's are the weighting coefficients. It is logical that the first weighting coefficient is the straight line that is equal to 1 at x1 and 0 at x2: Similarly, the second coefficient is the straight line that is equal 'to I at x2 and 0 at x1 : Substituting these coefficients into Eq. 14.19 yields the straight line that connects the points (Fig. 14.8):
  • 62. |LAGRAANGE INTERPOLATING POLYNOMIAL where the nomenclature f1(x) designates that this is a first-order polynomial. Equation (14.20) is referred to as the linear Lagrange interpolating polynomial. The same strategy can be employed to fit a parabola through three points. For this case three parabolas would be used with each one passing through one of the points and equaling zero at the other two. Their sum would then represent the unique parabola that connects the three points. Such a second-order Lagrange interpolating polynomial can be written as Notice how the first term is equal to f(x1) at x1 and is equal to zero at x2 and x3. The other terms work in a similar fashion. Both the first- and second-order versions as well as higher-order Lagrange polynomials can be represented concisely as
  • 63. |LAGRAANGE INTERPOLATING POLYNOMIAL FIGURE 14.8 A visual depiction of the rationale behind Lagrange interpolating polynomials. The figure shows the first-order case. Each of the two terms of Eo,. 114 20) passes through one of the points and is zero at the other. The summation of the wo terms must, therefore, be the unique straight line that connects the two points. where where n = the number of data points and designates the "product of."
  • 64. Example 14.5 Use a Lagrange interpolating polynomial of the first and second order to evaluate the density of unused motor oil at T = 15°C based on the following data: The first-order polynomial [Eq. (1 4.20)] can be used to obtain the estimate at |NEWTON’S INTERPOLATING POLYNOMIAL
  • 65. In a similar fashion, the second-order polynomial is developed as [Eq. (14.21)] |NEWTON’S INTERPOLATING POLYNOMIAL
  • 66.
  • 68. |INVERSE INTERPOLATION As the nomenclature implies, the f (x) and x values in most interpolation contexts are the dependent and independent variables, respectively. As a consequence, the values of the x's are typically uniformly spaced. A simple example is a takble of values derived for the function f (x) = 1 /x : Now suppose that you must use the same data, but you are given a value for f (x) and must determine the corresponding value of x. For instance, for the data above, suppose that you were asked to determine the value of x that corresponded to f (x) = 0.3. For this case, because the function is available and easy to manipulate, the correct answer can be deter-mined directly as x = 1/0.3 = 3.3333.
  • 69. |INVERSE INTERPOLATION Such nonuniform spacing on the abscissa often leads to oscillations in the resulting interpolating polynomial. This can occur even for lower-order polynomials. An alternative strategy is to fit an nth-order interpolating polynomial, f,,(x), to the original data [i.e., with f (x) versus x]. In most cases, because the x's are evenly spaced, this polynomial will not be ill-conditioned. The answer to your problem then amounts to finding the value of x that makes this polynomial equal to the given f (x). Thus, the interpolation problem reduces to a roots problem! Such a problem is called inverse interpolation. For a more complicated case, you might be tempted to switch the f (x) and x values [i.e., merely plot x versus f (x)] and use an approach like Newton or Lagrange interpolation to determine the result. Unfortunately, when you reverse the variables, there is no guarantee that the values along the new abscissa [the f (x)' s] will be evenly spaced. In fact, in many cases, the values will be "telescoped." That is, they will have the appearance of a logarithmic scale with some adjacent points bunched together and others spread out widely. For example, for f (x) = 1 /x the result is
  • 70. |INVERSE INTERPOLATION The answer to the inverse interpolation problem of finding the x corresponding to f (x) =7-- 0.3 would therefore involve determining the root of For example, for the problem just outlined, a simple approach would be to fit a quadratic polynomial to the three points: (2, 0.5), (3, 0.3333), and (4, 0.25). The result would be For this simple case, the quadratic formula can be used to calculate Thus, the second root, 3.296, is a good approximation of the true value of 3.333. If additional accuracy were desired, a third- or fourth-order polynomial along with one of the root- location methods from Chaps. 5 or 6 could be employed.