Curve Fitting, Part 5

Curve Fitting

Part 5: Nonlinear Curve Fitting

Given a model function in which the parameters do not appear linearly, how can we find the parameters yielding the best least squares fit of the model? We attack this problem by using an iterative method based on another type of "linearization", namely first order Taylor approximations.

Suppose we want to fit the power model y = a t^b to our small four-point data set from Part 3. Let's use the notation

f(t; a,b) = a t^b

in order to explicitly emphasize the dependence of the model on the parameters a and b, given some fixed value of t.

We want to find the optimal values of the parameters a and b yielding the best fitting power function. Suppose we have an initial guess (a₀,b₀) for the parameters. For a fixed value t, we can expand f in a Taylor series about (a₀,b₀):

f(t; a,b) = f(t; a₀,b₀) + f_a(t; a₀,b₀) (a - a₀) + f_b(t; a₀,b₀) (b - b₀)
+ higher order terms.

Here the partial derivatives are

f_a(t; a,b) = t^b

and

f_b(t; a,b) = a t^bln(t)

If we let da = (a - a₀) and db = (b - b₀) and drop higher order terms in da and db from our Tayor expansion, we get the first order approximation

f(t; a,b) - f(t; a₀,b₀) = f_a(t; a₀,b₀) da + f_b(t; a₀,b₀) db.

Our data set is

(T₁,Y₁), (T₂,Y₂), (T₃,Y₃), (T₄,Y₄).

We would like to solve for a and b so that Y_i= f(T_i, a,b) for i = 1, ... 4. Thus, we would like to solve the following system for da and db (and hence a = a₀ + da and b = b₀ + db):

Y₁ - f(T₁; a₀,b₀) = f_a(T₁; a₀,b₀) da + f_b(T₁; a₀,b₀) db

Y₂ - f(T₂; a₀,b₀) = f_a(T₂; a₀,b₀) da + f_b(T₂; a₀,b₀) db

Y₃ - f(T₃; a₀,b₀) = f_a(T₃; a₀,b₀) da + f_b(T₃; a₀,b₀) db

Y₄ - f(T₄; a₀,b₀) = f_a(T₄; a₀,b₀) da + f_b(T₄; a₀,b₀) db

There are four equations in the two unknowns da and db. We can't find a solution to such an overdetermined system, but we can solve it in the least squares sense. Form the sum of squares of the differences between the left and right sides of each equation. Then solve this linear least squares problem for the da and db values that minimize the sum of squares by solving the normal equations.

To get an initial guess (a₀,b₀) for the parameters a and b, fit the power curve y = a t^bthrought the first point (1.0, 2.5) and the last point (4.0, 50.0) of our small data set. Show your calculations.
Your helper application worksheet is set up so that you can substitute your initial guess (a₀,b₀) to form the 4-vectors
y* = (Y₁ - f(T₁;a₀,b₀), Y₂ - f(T₂;a₀,b₀), ... , Y₄ - f(T₄;a₀,b₀) )^T
f_a = ( f_a(T₁; a₀,b₀), f_a(T₂; a₀,b₀), ... , f_a(T₄; a₀,b₀) )^T
and f_b = ( f_b(T₁; a₀,b₀), f_b(T₂; a₀,b₀), ... , f_b(T₄; a₀,b₀) )^T
Our least squares problem is equivalent to finding the closest vector to y* that lies within the two-dimensional subspace W = span(f_a,f_b). Solve the normal equations to find the least squares solution values da and db.
Use your solution values da and db to update a and b:

a = a₀+ da
b = b₀+ db
Now use your latest a and b values as your new initial guess (a₀,b₀) to the solution and repeat steps 2 and 3.
Repeat steps 2 and 3 again using your newest estimate of a and b as your initial guess. Do your values of a and b seem to be converging to the claimed optimal values of a = 0.848 and b= 2.935?
Since steps 2 and 3 should be repeated until convergence is achieved, your helper application worksheet has a looping structure set up to do this automatically. Execute the loop and watch the convergence. How much accuracy is achieved in the optimal values of a and b? How many iterations are required? Compute the residuals Y_i - f(T_i,a,b), for i = 1, ... , 4 corresponding to the optimal fit. Also compute the sum of squares of these residuals.

Let's now take up the ambitious task of fitting the logistic growth model

f(t; P0,K,r) = K P₀ / ( P₀ + (K - P₀) exp(-r t) )

to the U.S. population data of Part 1.

Use your helper application to find the partial derivative of f with respect to each parameter: P₀, K, and r.

Using the initial quess P₀ = 78, K = 700, and r = 0.0168, form the following vectors in R¹⁰:

y* = (Y₁ - f(T₁; P₀,K,r), Y₂ - f(T₂;P₀,K,r), ... , Y₁₀ - f(T₁₀;P₀,K,r) )^T

f_Po = ( f_Po(T₁; P₀,K,r), f_a(T₂; P₀,K,r), ... , f_a(T₁₀; P₀,K,r) )^T

f_K = ( f_K(T₁; P₀,K,r), f_b(T₂; P₀,K,r), ... , f_b(T₁₀; P₀,K,r) )^T

and f_r = ( f_r(T₁; P₀,K,r), f_b(T₂; P₀,K,r), ... , f_b(T₁₀; P₀,K,r) )^T.

Form the least square matrix X and solve the normal equations for dP₀, dK, and dr. Use these values to update P₀, K, and r.

Now use the looping structure in your helper application worksheet to iteratively solve for the optimal P₀, K, and r. How many iterations are needed?

Plot the least squares logistic curve that you just found together with a scatter plot of the U.S. population data. How good is the fit?

Make a residual plot for your optimal logistic fit. Compare the logistic fit to the quadratic fit from Part 1 and the exponential fit from Part 2.

modules at math.duke.edu