4 Sequences as Functions

We’ve seen many different types of function in this course and used them to model various situations.

In practice, one is often faced with the opposite problem: given experimental data, what type of

function should you try?

4.1 Polynomial Sequences: First, Second, and Higher Differences

To begin to answer this, ﬁrst ask yourself, “What is a sequence?” Hopefully you have a decent

intuitive idea already. More formally, a sequence a function whose domain is a set like the natural

numbers, for example

f : N → R : n 7→ 3n

−2

deﬁnes the sequence



f (1), f (2), f (3), . . .





1, 10, 25, 46, 73, . . .



This is indeed the intuitive idea of a function to many grade-

school students: continuity and domains including fractions

or even irrational numbers are more advanced concepts.

Suppose instead that all you have is a data set

x 1 2 3 4 5

y 1 10 25 46 73

0 1 2 3 4 5

perhaps arising from an experiment. Could you recover the original function y = f (x) directly from

this data? You could try plotting data points as we’ve done, though it is hard to decide directly

from the plot whether we should try a quadratic model, some other power function/polynomial, or

perhaps an exponential. Of course, the physical source of real-world data might also provide clues.

A more mathematical approach involves considering how data values change:

x 1

y 1

+15

+21

+27

The ﬁrst-differences in the x-values are constant whereas those for the y-values are increasing



n+1

− y





9, 15, 21, 27, . . .



You likely already notice the pattern: the sequence of ﬁrst-differences is increasing linearly as the

arithmetic sequence

n+1

− y

= 3 + 6n

To make this even clearer, note that the sequence of second-differences in the y-values is constant

(+6). These facts are huge clues that we expect a quadratic function.

But why? Well we can certainly check the following directly:

Linear Model If f (n) = an + b, then the sequence of ﬁrst-differences is constant

f ( n + 1) − f (n) = a

Quadratic Model If f (n) = an

+ bn + c, then the sequence of ﬁrst-differences is linear and the

second-differences are constant:

g(n) := f (n + 1) − f (n) = 2an + a + b, g(n + 1) − g(n) = 2a

The relationship between these results and the derivative(s) of the original function f (x) should feel

intuitive: what happens if you differentiate a quadratic twice?

Example 4.1. You are given the following data set

x 0 2 4 6 8 10

y 2 16 22 20 10 −8

The x-values have constant ﬁrst-differences while the y-values

have constant second-differences

First-differences: 14, 6, −2, −10, −18

Second-differences: −8, −8, −8, −8

2 4 6 8 10

We therefore suspect a quadratic model y = f (n) = an

+ bn + c. Rather than using the above

formulae, particularly since the x-differences are not 1, it is easier just to substitute:

2 = y(0) = c,

(

16 = f (2) = 4a + 2b + 2

22 = f (4) = 16a + 4b + 2

=⇒

(

2a + b = 7

8a + 2b = 10

=⇒ 4a = −4

whence a = −1, b = 9 and c = 2. A quadratic model is therefore

y = f (n) = −n

+ 9n + c = −n

+ 9n + 2

It is easily veriﬁed that the remaining data values satisfy this relationship.

There are at least two issues with our method:

1. The question we’re answering is, “Find a quadratic model satisfying given data.” Constant

second-differences don’t guarantee that only a quadratic model is suitable. For example,

y = −n

+ 9n + 2 + 297n(n −2)(n − 4)(n − 6)(n − 8)(n −10)

is a very complicated model satisfying the same data set!

2. It is very unlikely that experimental data will ﬁt such precise patterns (why not?). However,

if the differences are close to satisfying such patterns, then you should feel conﬁdent that a

linear/quadratic model is a good choice.

Example 4.2. Given the data set

x 0 2 4 6 8 10

y 3 23 41 59 77 93

with sequences of ﬁrst- and second-differences

First-differences: 20, 18, 18, 18, 16

Second-differences: −2, 0, 0, −2

0 2 4 6 8 10

do you think a linear or quadratic model would be superior?

If you wanted a linear model, you’d likely be inclined to try f (x) = 9x + b for some constant b. Here

are two options:

1. f (x) = 9x + 5 ﬁts the middle four data values perfectly, but as a predictor is too large at the

endpoints: f (0) = 5 > 3 and f (10) = 95 > 93.

2. f (x) = 9x + 5 −

doesn’t pass through any of the data values but seems to reduce the net error

to zero:

x 0 2 4 6 8 10

f (x) − y −

−

=⇒

∑

f (x) − y = 0

Neither model is perfect, but then this is what you expect with real-world data!

Exercises 4.1. 1. For each data set, ﬁnd a function y = f (x) modelling the data.

(a) x 2 4 6 8

y −1 2 7 14

(b) x 2 5 8 11 14

y −6 −15 −6 21 66

y 3 15 21 33

(Be careful with (c): the x-differences aren’t constant!)

2. Suppose a table of data values containing (x

, y

) has constant ﬁrst-differences in both variables

∆x = x

n+1

− x

= a, ∆y = b

Find the equation of the linear function y = f (x) through the data.

3. What relationship do you expect to ﬁnd with the sequential differences of a cubic function

f ( n) = an

+ bn

+ cn + d? What about a degree-m polynomial f (n) = an

+ bn

m−1

+ ···?

4. If f (n) = an

+ bn + c is a quadratic model for the data in Example 4.2 with constant second-

differences −1, show that a = −

. What might be reasonable values for b, c?

5. (Hard) Suppose f (x) is a twice-differentiable function and h > 0 is constant. Use the mean

value theorem from calculus to explain the following.

(a) First-differences f (x + h) − f (x) are proportional to f

′

(ξ) for some ξ ∈ (x, x + h).

(b) Second-differences satisfy



f (x + 2h) − f (x + h)



−



f (x + h) − f (x)



= f

′′

(ξ)hα for some

ξ between x and x + h and some α. Why is it unlikely that α is constant?

4.2 Exponential, Logarithmic & Power Sequences

To observe relationships between data values, you might also have to consider ratios between succes-

sive terms or skip values.

Example 4.3. From a ﬁrst glance at the given data, it is hard to decide whether an exponential or

a quadratic (or higher degree polynomial) model is more suitable. If we try to apply the constant-

difference method, we don’t seem to get anything helpful:

x 1

((

))

y 15

+120

135

+1080

1215

+9720

10935

+960

+8640

By the time we’re looking at second-differences, any conclusion

would be very weak since we only have two data values!

y (1000s)

0 2 4 6

If instead we think about ratios of y-values, then a different pattern emerges:

x 1

((

))

y 15

×9

135

×9

1215

×9

10935

The question remains: what type of function scales its output by 9 when 2 is added to its input:

f (x + 2) = 9 f (x)? This is a function that converts addition to multiplication: an exponential! If we try

y = f (x) = ba

for some constants a, b, then

f (x + 2) = ba

x+2

= ba

= a

f (x)

from which a suitable model is y = 5 · 3

We can see the pattern in the example more generally:

Exponential Model If f (x) = ba

, then adding a constant to x results in

f (x + k) = ba

x+k

= a

f (x)

If x-values have constant differences (+k), then y-values will be related by a constant ratio (×a

You might remember this as ‘addition–product’ or ‘arithmetic–geometric.’

Such a simple pattern is often disguised:

• Complete data might not be given so you might have to skip some data values to see a pattern.

For example, if our original data was

x 1 3 4 5 7

y 15 135 405 1215 10935

then the x-values are not in a strictly arithmetic sequence.

• As in Example 4.2, real-world/experimental data will only approximately exhibit such patterns.

Example 4.4. A population of rabbits is measured every two months resulting in the data set

t 0 2 4 6 8 10

P 5 7 10 14 19 28

The data seems very close to being quadratic; consider the ﬁrst and second sequences of P-differences

∆P =



2, 3, 4, 5, 9



, ∆∆P =



1, 1, 1, 4



However, the last difference doesn’t ﬁt the pattern. Instead, the fact that we expect an exponential

model is buried in the experiment: the data is measuring population growth! We therefore instead

consider the ratios of P-values:

t 0

))

P 5

×1.4

×1.43

×1.4

×1.36

×1.47

The ratios are very close to being constant, whence an exponential model is suggested! To exactly

match the ﬁrst and last data values, we could take the model

P(t) ≈ 5





t 0 2 4 6 8 10

P 5 7.057 9.960 14.057 19.839 28

Only P(8) doesn’t match when we take rounding to the nearest integer into account.

We’ve seen that addition-addition corresponds to a linear model and that addition-multiplication to

an exponential. There are two other natural combinations.

Logarithms These operate exactly as exponentials but in reverse. If f (n) = log

x + b, then multiply-

ing x by a constant results in a constant addition/subtraction to y:

f ( kx) = log

(kx) + b = log

k + log

x + b = log

k + f (x)

This could be summarized as ‘product–addition.’

Power Functions If f (x) = ax

, then multiplying x by a constant will do the same to y

f ( kx) = a(kx)

= ak

= k

f (x)

We have a ‘product–product’ relationship between successive terms.

Examples 4.5. Find the patterns in the following data and suggest a model y = f (x) in each case.

x 6 18 54 162

y 1 2 3 4

x 3 6 9 12

y 135 1080 3645 8640

The sequential approach in this chapter is a form of discrete calculus: using a pattern of differences to

predict the original function is similar to how we use knowledge of a derivative f

′

(x) to ﬁnd f (x).

Example 4.6. Suppose g(2) = 3 and g(4) = 9. What do you think should be the value of g(8)?

It depends on the type of model you try.

1. For a linear (addition-addition) model we know that

∆x = 2 corresponds to ∆y = 6, so

g(8) = g(4 + 2∆x) = g(4) + 2∆y = 9 + 12 = 21

2. For an exponential (addition-product) model, ∆x = 2

corresponds to a y-ratio r

= 3, so

g(8) = r

g(6) = r

g(4) = 9 ·9 = 81

3. For a power (product-product) model, r

= 2 corre-

sponds to a r

= 3, so

g(8) = g(2 ·4) = g(4r

) = r

g(4) = 3 ·9 = 27

0 2 4 6 8

1. g(x) = 3x − 3

2. g(x) = 3

x/2

3. g(x) = x

log

We do not need to calculate the models explicitly(!), though they are stated below the graph for

convenience.

Exercises 4.2. 1. Find the patterns in the following data sets and use them to ﬁnd a model y = f (x).

(a) x 0 1 2 3 4

y 80 120 180 270 405

(b) x 2 4 8 10

y 1 16 256 625

y 15 5 19 57 119

(d) x 1 3 4 6

y 1 36 216 7776

(e) x 20 60 180 540

y 2 4 6 8

(f) x 2 6 54 486 4374

y 2 4 8 12 16

2. Take logarithms of the power relationship y = ax

. What is the relationship between ln y and

ln x? Use this to give another reason why the inputs and outputs of power functions satisfy a

‘product–product’ relationship.

3. How does our analysis of exponential functions change if we add a constant to the model? That

is, how might you recognize a sequence arising from a function f (x) = ba

+ c?

4. Suppose f (5) = 12 and f (10) = 18. Find the value of f (20) supposing f (x) is a:

(a) Linear function;

(b) Exponential function;

If f (20) = 39, which of the three models do you think would be more appropriate?

4.3 Newton’s Method

To ﬁnish our discussion of sequences we revisit a (hopefully) familiar technique for approximating

solutions to equations. Variations of this approach have been in use for thousands of years.

Example 4.7. We motivate the method by considering an ancient method for approximating

√

known to the Babylonians 2500 years ago!

Suppose x

√

2. Then

√

2. It seems reasonable to guess that their average

n+1





should be a more accurate approximation to

√

2. If start with an initial guess x

= 2, then we obtain

the sequence



2 +



, x

= 1.4166 . . . , x

577

408

= 1.4142 . . . , . . .

This sequence certainly appears to be converging to

√

2. . .

Since it makes use of the average, this approach is sometimes called the method of the mean. It may be

applied to any square-root

√

a where a > 0: let x

> 0 and deﬁne,

n+1





(∗)

A rigorous proof that the sequence converges requires more detail than is appropriate for us (though

see Exercise 3), but two observations should make it seem more believable:

1. If the sequence (∗) has a limit L, then the limit must satisfy

L =



L +



=⇒ 2L

= L

+ a =⇒ L

= a =⇒ L =

√

where we take the positive root since all terms x

are plainly positive.

2. The iterations have a convincing geometric interpretation.

The sequence of iterates can be found by repeatedly tak-

ing the tangent line to the curve y = f (x) = x

− a and

intersecting it with the x-axis. To see why, observe that

the tangent line at x

has equation

y = f (x

) + f

′

)(x − x

)

= x

− a + 2x

(x − x

)

= 2x

x − x

− a

which intersects the x-axis (y = 0) when

x =

+ a





= x

n+1

y = x

− a

−a

n+1

This geometric idea generalizes. . .

Deﬁnition 4.8. Given a differentiable function f (x) with non-zero derivative, the Newton–Raphson

iterates of an initial value x

are deﬁned by the recurrence formula

n+1

:= x

−

f (x

)

′

)

Our two previous observations still hold:

1. If L = lim

n→∞

exists and f

′

(L) = 0, then

L = L −

f (L)

′

(L)

=⇒ f (L) = 0

That is, the limit L is a root of the function f (x).

2. The tangent line at



, f (x

)



forms a right-triangle with

base x

− x

n+1

and height f (x

), from which its slope is

′

) =

f (x

)

− x

n+1

Rearranging this gives the formula x

n+1

= x

−

f (x

)

′

)

y = f (x)

n+1



, f (x

)



Newton’s method is particularly nice for polynomials with integer coefﬁcients, since the iterates

form a sequence of rational numbers. This approach was often used obtain rational approximations to

irrational numbers before the advent of calculators.

Examples 4.9. 1. To ﬁnd a root of f (x) = x

+ 4x −6, start with x

= 2 and iterate

n+1

= x

−

+ 4x

−6

+ 4

3(x

+ 2)

4(x

+ 1)

which yields the sequence (to 3 d.p.)



339

280

, . . .



= (2, 1.5, 1.211, 1.121, 1.114, 1.114, . . .)

You can check with a calculator that 1.114 is approximately

a root.

1 2

2. The irrational number x =

√

2 +

√

3 is a root of the polynomial

f (x) = x

−10x

+ 1

By applying Newton’s method with x

= 3, we obtain the sequence (to 3 d.p.)

n+1

= x

−

−10x

+ 1

−20x

−10x

−1

−5)

=⇒ (x

) =



= 3.167, 3.147, . . .



Newton’s method can be attempted for any differentiable function, though the sequence isn’t guar-

anteed to converge: see for instance Exercise 5. You can ﬁnd graphical interfaces online for this (for

instance with Geogebra).

Exercises 4.3. 1. Use Newton’s method to ﬁnd a root of the given function to 4 decimal places.

(Use a calculator, but explain what you are doing!)

(a) f (x) = x

−4 (b) f (x) = 2x

+ x − 1 (c) f (x) = e

−

√

x −2

2. Use Newton’s method to ﬁnd a rational number approximation to

√

2 in lowest terms

where

10 < q < 100.

3. Suppose you perform Newton’s method for the function f (x) = x

− 2 starting with some

positive x

> 0.

(a) If x

> 0, show that x

n+1

−

√

2 =

−

√



−

√



−

√

2) .

(b) Explain why



−

√



−

√



. Hence conclude that the sequence of iterates (x

)

converges to

√

4. We might consider a method of the mean for approximating

√

2: given x

, deﬁne

n+1





(a) If the sequence (x

) converges, show that its limit is

√

(b) If x

√

2, show that

√

= 1. Compute x

and x

. Compare these with the values obtained using Newton’s

method for the function f (x) = x

−2 with the same initial condition x

= 1.

5. Let f (x) = x

−5x.

(a) What happens if you apply Newton’s method to this function with initial condition x

1? Draw a picture to illustrate.

(b) (Just for fun!) Investigate what happens for other values of x

. Can you make any conjec-

tures? Is is possible for x

to be positive and yet for x

→ −

√

5? Can you make any sense

of what happens if 1 < x