Math 8 — Functions and Modeling

Neil Donaldson

Spring 2024

Introduction

This course aims to refresh and provide some conceptual foundation to several mathematical topics

commonly encountered at grade-school level. The job of a teacher is often one of selection; choosing

examples and explanations suited to the level and experience of your students. To do this effectively,

you need to understand concepts at a higher level than you’ll likely ever teach. Not all of our topics

are central to the grade-school curriculum, and it is not our goal to teach you how to teach, though the

ideas and approaches we’ll explore are often suitable for a grade-school audience. The mathematics in

this course shouldn’t present much difﬁculty, requiring at most elementary calculus and a tiny bit of

linear algebra; you should instead be considering how to explain the material, particularly to students

with less mathematical knowledge than yourself.

We start with two motivational problems.

1. You wish to travel across the surface of a cube between two oppo-

site vertices so that your path is as short as possible.

Should you follow the path indicated?

If yes, explain why.

If not, how should you ﬁnd the shortest path?

2. Two houses are to be connected to the elec-

tricity supply using a single connection.

How should we determine where to place

the connection so as to minimize the required

length of wire?

What information do you need in order to

ﬁnd the connection point?

connection

wires

electric supply

House 1

House 2

The goal isn’t merely to ﬁnd the right answer! Consider how you might discuss these problems with

grade-school students of different ability levels. Why might calculus not be a sensible approach? Are

there any similarities between the two problems? Brainstorm some strategies. . .

We are grateful to materials from UT Austin’s UTeach program for suggesting several of the examples in this course

including these motivational problems.

1 Sets & Functions

1.1 Basic Deﬁnitions

Consider how central functions are to mathematics, and how long you’ve been using them. How

would you deﬁne “function” to someone with limited mathematical knowledge? Would you use

words like rule, assign, element, domain, vertical line test, etc.? How helpful are these to your audience?

Examples 1.1. How would you explain the idea that the following do or do not represent functions?

1. y = x

2. Mon: ﬁsh, Tue: pork, Wed: fajitas, Thur: carbonara, Fri: pizza, Sat: ﬁsh, Sun: pizza

3. ( 3, 5), (2, 6), (4, 2), (3, 1).

4. x

= y

After considering the example, perhaps you settle on a semi-formal deﬁnition:

A function f is rule which assigns to each input x exactly one output f (x)

Is this a useful deﬁnition? In what ways is it imprecise? How much does this matter?

The answers to these questions depend on your audience! From a formal palate a teacher selects

enough to convey something important without overburdening and intimidating their students. For

a more complete picture, we begin by thinking about what we allow to be inputs and outputs.

Deﬁnition 1.2. A set A is a collection of objects, known as elements.

The notation a ∈ A means that

a is an element of A, sometimes read as ‘a lies in A.’ Sets are usually written with upper case letters

and elements with lower.

A set B is a subset of a set A, written B ⊆ A, if every element of B is also an

element of A: that is,

b ∈ B =⇒ b ∈ A

The picture indicates sets A, B and elements a, b for which B ⊆ A, a ∈ A,

b ∈ B and a /∈ B.

Examples 1.3. 1. Suppose the elements of a set A are the numbers 1, 3, 5, 7 and 9. The simplest way

to write this is using roster notation: list the elements (in any order!) between braces

A = {1, 3, 5, 7, 9}

Subsets are commonly expressed in set-builder notation: {a ∈ A : condition on a}. For example,

B = {a ∈ A : 2 < a < 8} (the set of a in A such that a lies strictly between 2 and 8)

In roster notation, B = {3, 5, 7}: plainly B is a subset of A. Can you express B in other ways

using set-builder notation?

This is enough for our purposes, though a course in set theory will convince you that this deﬁnition has its own

problems. Selection is always at work, at all levels of education. . .

2. You should be familiar with common sets of numbers: we summarize these using informal

combinations of roster and set-builder notation.

Natural numbers N = {1, 2, 3, 4, . . .}. For instance 5 ∈ N but −3 ∈ N.

Integers Z = {. . . , −2, −1, 0, 1, 2, 3, . . .}. For instance −4 ∈ Z but

∈ Z.

Rational numbers or fractions: Q =



: p ∈ Z, q ∈ N



. For instance −

∈ Q; in this case

p = −6 is an integer, and q = 7 a natural number.

Real numbers R: for instance

√

2 ∈ R. A formal deﬁnition is difﬁcult, but you should be used

to identifying the real line with a ruler. Intervals are particularly important subsets, e.g.

[−4, π) = {x ∈ R : −4 ≤ x < π}

is a half-open interval.

You should also be familiar with the Cartesian plane: R

= {(x, y) : x, y ∈ R}. The notation

(3, 4) ∈ R

describes a point in the plane with co-ordinates x = 3, y = 4: don’t confuse this

with the interval ( 3, 4) = {x ∈ R : 3 < x < 4} which is a subset of R!

The subset relationships between these sets should be familiar:

N ⊆ Z ⊆ Q ⊆ R

You should also have informally encountered the notion of irrationality: for instance,

√

2 and π

are real numbers but not rational numbers.

The reason we need this language when discussing functions is that the inputs and outputs of a

function are elements of sets. Here is a very formal deﬁnition of “function.”

Deﬁnition 1.4. The Cartesian product of sets A, B is the set of ordered pairs

A × B =



(a, b) : a ∈ A, b ∈ B



A function from A to B is a non-empty subset f ⊆ A ×B which satisﬁes the vertical line test

For each a ∈ A, there is a unique b ∈ B such that (a, b) ∈ f (∗)

Instead of writing f ⊆ A ×B and (a, b) ∈ f , we use the more familiar notation

f : A → B and f (a) = b

To a function f : A → B are associated three sets:

• Domain: dom f = A is the set of inputs.

• Codomain: codom f = B is the set of possible outputs.

• Range: range f = {b ∈ B : b = f (a) for some a ∈ A} is the set of realized outputs.

This probably isn’t the deﬁnition you should give to 10

graders, or even to freshman calculus stu-

dents! But what should you do? How much of this is helpful in a a given context?

Example (1.1.2 cont.). We consider our earlier food-based example in this formal setting. To do this

properly, we have to carefully label the constituent sets. For instance:

A =



Mon, Tue, Wed, Thu, Fri



, B =



carbonara, fajitas, ﬁsh, pizza, pork



f =



(Mon, ﬁsh), (Tue, pork), (Wed, fajitas),

(Thu, carbonara), ( Fri, pizza), (Sat, ﬁsh), (Sun, pizza)



We made a choice with the codomain B: can you see how? What would be a different choice?

Try the other examples yourself.

Representing Functions

You should be familiar with several methods for representing a function.

Example 1.5. We consider the familiar formula/rule f (x) = x

in several contexts.

Table This presentation is most helpful when the domain is very small.

The table shows the situation when dom f = {−1, 0, 1, 2, 3} and

range f = {0, 1, 4, 9}

Arrows A pictorial arrow diagram might also be useful for illustrating

functions with small domains.

Graph This is simply the set of ordered pairs



x, f (x)



: x ∈ dom f



in the context of Deﬁnition 1.4, the graph is the function!

For formulæ whose inputs and outputs are real numbers, two con-

ventions are typically followed unless stated otherwise:

• The domain is implied to be all real numbers for which the

formula makes sense.

• The codomain is taken to be the set of real numbers.

If no other information is provided, we’d assume the function de-

ﬁned by the formula f (x) = x

has both domain and codomain the

entire set of real numbers: f : R → R.

The range of the function is the set of possible outputs, in this case

range f = {x

∈ R : x ∈ R} = [0, ∞)

is the half-open interval of non-negative real numbers.

For ‘calculus’ functions like these, the vertical line test really in-

volves vertical lines; every vertical line intersects the graph in pre-

cisely one point.

In the picture, the dots are the graph when the domain is the ﬁnite

set {−1, 0, 1, 2, 3} (as described in the table/arrow-diagram).

x −1 0 1 2 3

f (x) 1 0 1 4 9

−1

−2 0 2

Can you think of other ways to represent a function? How might you decide which to use?

Exercises 1.1. 1. Let d represent the cost in millions of dollars to produce n cars, where n is measured

in 1000s. As clearly as you can, explain what is meant by d(25) = 431.

2. Temperature readings T were recorded every two hours from midnight to noon. Time t was

measured in hours from midnight.

t 0 2 4 6 8 10 12

T (

◦

F) 82 75 74 75 84 90 93

(a) Plot the readings and use them to sketch a rough graph of T as a function of t.

(b) Use your graph to estimate the temperature at 10:30 a.m.

3. State parts 1, 3 and 4 of Example 1.1 using the formal language of Deﬁnition 1.4. If you have a

function, state the domain and range and explain how you know you have a function. If you

don’t have a function, explain why not.

(Since insufﬁcient information is provided, there is no single correct answer!)

4. (a) Let A = {1, 3, 5, 7, 9}. Explain in words what is meant by the set

B = {x ∈ A : x

> 10}

and state B in roster notation.

(b) Find the set C = {x ∈ N : (x −1)

< 16} in roster notation.

5. Suppose that f : {−2, −1, 0, 1, 2} → R is deﬁned by the formula f (x) = x

−4x + 1.

Describe f using a table, an arrow diagram and a graph.

6. Find the implied domain and range for the functions deﬁned by the following rules:

(a) f (x) =

−4

x−2

(b) g(x) =

√

−16x

√

4x −x

(What is the largest set of real numbers for which the formula makes sense?)

7. The unit circle is often represented by the implicit equation x

+ y

= 1.

(a) Draw the circle and explain why the full circle isn’t the graph of a function.

(b) Describe two functions f : [−1, 1] → R and g : [−1, 1] → R whose graphs together

comprise the circle. What are the ranges of the two functions?

1.2 Linear Polynomials

Perhaps the simplest functions are the linear polynomials, whose graphs are straight lines;

y = f (x) = mx + c where m, c are constants (∗)

These functions make for very simple models: increase the input by ∆x and the output changes by

∆y = m∆x regardless of the starting value x. Given some experimental data or a physical situation

relating two quantities x and y, a linear model is an linear polynomial (∗) relating these variables. In

practice, models are usually only approximate. Later in the course we’ll consider how to ﬁnd good

linear models for approximately linear data.

Example 1.6. Find the equation of the straight line through the points A = (1, 3) and B = (4, 1).

Substitute both points into the equation and solve

(

3 = m + c

1 = 4m + c

(∗)

=⇒ −2 = 3m

=⇒ m = −

, c = 3 −m =

There is some technique required in (∗): how would you explain

this to a grade-school student?

0 1 2 3 4 5

The gradient/slope m represents how far one climbs/falls on travelling one unit to the right.

The y-intercept c is the intersection of the graph with the y-axis.

Some bookkeeping is required here: how do we know that every (non-vertical) line corresponds to

such a linear function? This follows easily from a useful fact regarding parametrizations.

Theorem 1.7. Given distinct points A, B, the set of points on the line through A and B is

ℓ

A,B



(1 −t)A + tB : t ∈ R



In this description of a line, t = 0 corresponds to A and t = 1 to B. In essence, we are laying a ruler

onto the line whose units correspond to the distance

Proof. There are several ways to think about this; we use what is essentially vector addition.

The line through the origin and the point B − A is described by

the set of points

ℓ

O,B−A

= {t(B − A) : t ∈ R}

We simply shift this line by A.

The points corresponding to various values of t are marked.

−

B − A

Would you consider presenting an argument like this in a grade-school class? Is the argument helpful

for understanding how to visualize/describe a line? Discuss. . .

Exercise 5 uses this result to show how any non-vertical line may be described by a linear polynomial.

This is nothing more than a generalization of a simple example.

Example 1.8. The line through points A = (3, 6) and B = ( −1, 4) may be parametrized by

(x, y) = (1 −t)(3, 6) + t(−1, 4) =



3 −4t, 6 −2t



By solving for t in terms of x, we see that this has equation

y = 6 −2t = 6 −2 ·

3 − x

x −

Exercises 1.2. 1. The cost of gasoline is $4.20 per gallon on January 1

and $4.90 on March 1

. State

a linear function/model for how the cost of gasoline depends on time.

2. You have a choice of three different cell-phone plans.

(a) No monthly charge and 10¢ per minute for all calls.

(b) $10 per month and 5¢ per minute for all calls.

How should you determine which of the plans to purchase?

3. Revisit Exercise 1.1.2. Find an approximate linear model T(t) = mt + c for this data.

(There is no perfect answer!)

4. Suppose y = mx + c is the equation of a linear function. Choose any two points A, B on this

line, and thus ﬁnd an explicit parametrization in the style of Theorem 1.7.

5. Suppose A = (x

, y

) and B = (x

, y

) are given. If x

= x

, ﬁnd the equation y = mx + c of the

line through these points.

(You should recognize m as the familiar ‘rise over run’)

6. A straight line is sometimes described as the set of points (x, y) ∈ R

satisfying an equation of

the form

ax + by = c

for some constants a, b, c where a, b are not both zero. How does this approach differ from our

use of linear polynomials?

7. Suppose that a linear polynomial f (x) = mx + c is also a linear function:

For all λ, x ∈ R, f (λx) = λ f (x)

What can you say about f ?

(This is the meaning of linear you’ll encounter in a linear algebra class)

1.3 Quadratic Polynomials

Quadratic polynomials are functions of the form y = f (x) = ax

+ bx + c where a = 0. The simplest

is y = x

, the standard parabola opening upwards. Here are some commonly encountered activities:

1. Find the roots/zeros of f , the solutions x to the equation f (x) = 0.

2. Sketch the graph of the function f .

3. Use quadratic functions to model a real-world problem.

You likely know two methods for ﬁnding zeros: factorizing and the quadratic formula, each of which

has its problems. With experience it is easy to spot that

+ 2x −15 = (x −3)(x + 5) = 0 ⇐⇒ x = 3 or x = −5

though the required creativity can make this difﬁcult, particularly when coefﬁcients are large. Stu-

dents often prefer the quadratic formula since it always works, though at the cost of some intimidat-

ing algebra. We’ll think about factorization shortly. First, we see how completing the square lies behind

both the quadratic formula and the standard approach to graphing quadratic functions.

Example 1.9. Describe/graph the parabola y = −3x

+ 12x + 4.

Pay attention to the x terms; −3x

+ 12x = −3(x

−4x). Now

−3(x −2)

= −3(x

−4x + 4) = −3x

+ 12x −12

gives most of what we want: note how we divided the x-

coefﬁcient by two. To ﬁnish, just tidy everything up,

y = (−3x

+ 12x −12) + 16 = −3(x −2)

+ 16

−1 0 1 2 3 4

The parabola therefore opens downwards (−3 < 0) with its apex (maximum) at (x, y) = (2, 16).

This is easy, if intimidating, to repeat in general:

+ bx + c = a





+ c = a



x +



−

+ c

= a



x +



−

−4ac

( ∗)

The graph is that of the standard parabola which has been:

1. Vertically scaled by a;

2. Shifted horizontally by −

;

3. Shifted vertically by

4ac−b

By solving (∗) for x, we see that completing the square yields

the quadratic formula.

y = x

y = ax

+ bx + c

−

4ac−b

Theorem 1.10. If a = 0, then ax

+ bx + c = 0 ⇐⇒ x =

−b ±

√

−4ac

Example (1.9 cont). Our analysis suggests two methods for ﬁnding the roots.

1. Quadratic formula: with a = −3, b = 12, c = 4, we have

x =

−12 ±

−4(−3) ·4

2(−3)

−12 ±4

√

+ 3

−6

= 2 ±

√

= 2 ±

√

While it is always tempting to jump for a formula, it often leads to difﬁcult surd expressions.

We simpliﬁed by noticing the common factor of 4

inside the square root. Without this, we’d

be faced with

√

144 + 48 =

√

192.

2. Use the fact that we’ve already completed the square:

−3(x −2)

+ 16 = 0 ⇐⇒ (x −2)

⇐⇒ x = 2 ±

√

In many cases it is simpler to complete the square than to use the quadratic formula—remember

that they are equivalent!

Polynomials are often employed in modelling due to their simplicity and ease of evaluation. As you

saw in calculus, the motion of a falling body, or of any projectile can be modelled using quadratic

polynomials, an observation going back to at least to Galileo in the early 1600s: the distance travelled

by a falling body is proportional to the square of the time taken y(t) −y(0) ∝ t

Example 1.11. A body is dropped from a height of 125 meters, taking exactly 5 seconds to reach the

ground. Its height at time t seconds is given by y(t) = 125 −5t

This certainly ﬁts Galileo’s observation: y(t) −y(0) = −5t

is indeed

proportional to t

Over each interval of 1 s, we may ask how far the body falls; we

summarize in a table.

t 0 1 2 3 4 5

y(t) 125 120 105 80 45 0

y(t) − y(0) 0 −5 −20 −45 −80 −125

∆y −5 −15 −25 −35 −45

100

125

0 1 2 3 4 5

Since each interval has duration 1 s, each ∆y is the average speed of the falling body over that interval.

You’ll have seen problems like this in calculus; likely you want to differentiate to ﬁnd the velocity

y(t) = −10t m/s and acceleration y

′′

( t) = −10 m/s

. However, historically and in introductory

calculus, it is problems like these that motivate the deﬁnition of the derivative.

Armed with calculus, Galileo’s observation is that the height y(t) solves the differential equation

= −g

where g is the constant acceleration due to gravity; approximately 32 ft/s

or 10 m/s

. Unless you

are explicitly teaching calculus or Newtonian physics, this is probably a bad place to start!

The last line of the table really does suggest that speed is a linear function!

Example 1.12. Your frisbee is stuck 15 m up a tree. Standing 10 m from

the base, you throw a ball with the intent of knocking the frisbee out

of the tree.

The standard approach to modelling such problems involves consid-

ering the horizontal and vertical motions separately.

Horizontal x(t) = pt + q is a linear function of time.

Vertical y(t) = −10t

+ rt + s is a quadratic function of time.

Substituting for t yields a quadratic function for the trajectory

y(x) = ax

+ bx + c

We’ll leave the details of the solution to Exercise 4. For the present,

consider why there are multiple answers; can you explain why without

explicitly solving the problem?

start

Exercises 1.3. 1. Complete the square for each quadratic function, use it to ﬁnd the range and to

graph the function.

(a) f (x) = x

−6x + 5

(b) f (x) = −x

+ x + 1

+ 8x + 5

2. For the quadratic function y = 2x

−5x + 7, produce a table for x ∈ {0, 1, 2, 3, 4, 5, 6} similarly

to that in Example 1.11. What do you observe about ∆y?

3. (a) Find the equations of all quadratic polynomial functions which pass through the points

(1, 3) and (2, 4).

(b) More generally, if P = (a, b) and Q = (c, d) are given, where c = a, ﬁnd all quadratic

functions whose graphs contain P and Q.

4. Consider the frisbee/tree problem (Example 1.12).

(a) Assume that you’re standing at the origin and the frisbee is at the point (10, 15). Find all

trajectories.

(b) (Hard) Find a formula linking the initial speed and gradient of the parabola (the initial

speed and direction in which you throw the ball).

i. If you throw the ball in such a way that the initial vertical speed of the ball is twice its

horizontal speed, ﬁnd how fast you have to throw the ball in order to hit the frisbee.

ii. What is the minimum speed at which you could throw the ball if you want to dislodge

the frisbee?

(Hint: You’ll need some calculus! In the language of the original problem, the initial slope is m =

and speed v =

+ r

; why?)

1.4 Polynomials, Factorization & the Rational Roots Theorem

Recall our simple example of factorization in the previous section

+ 2x −15 = (x −3)(x + 5) = 0 ⇐⇒ x = 3 or x = −5

That this approach provides all roots depends on several familiar algebraic facts:

1. Factor Theorem: f (c) = 0 ⇐⇒ x −c is a factor of f (x).

2. No zero-divisors: pq = 0 ⇐⇒ p = 0 or q = 0.

3. A quadratic has at most two distinct roots.

We’ll examine this more closely at the end of this section. For students ﬁrst learning factorization,

it isn’t the why that’s the challenge, it’s the how. Multiplying out (x − 3)(x + 5) is mechanical, but

factorizing requires some creativity; we can’t really factor without somehow knowing that 3 and −5

are roots! Beyond making a lucky guess, how do we go about this?

Example 1.13. Let’s re-examine f (x) = x

+ 2x −15 = 0 in a couple of stages.

Integer solutions The simplest type of root would be an integer n. If f (n) = 0, observe that

+ 2n − 15 = 0 =⇒ n(n + 2) = 15 =⇒ 15 is divisible by n

=⇒ n = ±1, ±3, ±5, ±15

There are only eight possible candidates. It doesn’t take long to test all of them:

n 1 −1 3 −3 5 −5 15 −15

f (n) −12 −17 0 −12 20 0 240 180

The two integer solutions are therefore x = 3 and x = −5.

Rational Solutions If you believe that a quadratic polynomial has at most two solutions, then you’re

done. The next simplest possibility, however, is that a solution be a rational number x =

where

we may assume this is in simplest terms.

Substituting into the polynomial, we see that

+ 2

−15 = 0 ⇐⇒ p

+ 2pq − 15q

= 0

Remembering that p, q are integers, we rearrange this equation in two ways:

p(p + 2q) = 15q

Since the left side is a multiple of p, so also is the right. Since p, q have no

common factors, it follows that p divides into 15 (15 is a multiple of p).

= q(15q −2p) Since the right side is a multiple of q, so also is the left. Since p, q have no

common factors, we conclude that q = 1.

The upshot is that the only rational solutions to f (x) = 0 are the two integers we’ve already

found!

I.e., p ∈ Z and q ∈ N have no common factors: gcd(p, q) = 1.

Deﬁnition 1.14. A degree n polynomial is any function of the form

f (x) = a

+ a

n−1

+ ···+ a

x + a

where the coefﬁcients a

are constants with a

= 0.

A quadratic polynomial has degree 2 and a linear polynomial mx + c degree one

(if m = 0).

Our analysis in Example 1.13 is easily generalized in a famous result.

Theorem 1.15 (Rational Roots). Suppose f (x) = a

+ ··· + a

has integer coefﬁcients where a

and a

are non-zero. If x =

is a rational root in simplest terms, then q divides into a

and p divides

into a

Proof. Substitute into the function and multiply by q

to obtain an equation where everything is an

integer

+ a

n−1

q + ···+ a

n−1

| {z }

divisible by p

divisible by q

z }| {

+ a

= 0

By considering the braced terms and recalling that p, q have no common factors, we conclude that a

is divisible by q and a

by p.

Examples 1.16. 1. If x =

is a rational root in lowest terms of f (x) = 2x

− x −3, then q = 1 or 2

and p = ±1 or ±3. The possibilities are therefore

x ∈



±1, ±3, ±

, ±



all of which are easily checked:

x 1 −1 3 −3

−

f (x) −2 0 12 18 −3 −2 0 3

The two roots are indicated and the polynomial can be factorized f (x) = (2x −3)(x + 1).

2. If the cubic polynomial f (x) = x

−2x

+ 5 had any rational roots, the only possibilities would

be ±1, ±5. However, none of these work,

f (1) = 4, f (−1) = 2, f (5) = 80, f (−5) = −170

whence f (x) = 0 has no rational roots.

Unless there are very few candidates, it can be time-consuming to check them all by hand. Moreover,

unless you ﬁnd n distinct rational solutions, you still don’t know that you’ve found everything. The

rational roots theorem is therefore typically used together with factorization; it really just gives you

some options for where to start. This still isn’t easy, as the next example shows.

A non-zero constant polynomial has degree zero. Convention is for the zero polynomial y ≡ 0 to have degree −∞, so

that the theorem deg f g = deg f + deg g holds for all polynomials.

Example 1.17. Consider the cubic function f (x) = x

− x

− 7x + 10. The rational roots theorem

gives us eight candidates for rational roots: x = ±1, ±2, ±5, ±10. It is not difﬁcult to check the ﬁrst

few of these in your head, for instance,

f (2) = 8 −4 −14 + 10 = 0

By the factor theorem, x −2 must be a factor of f (x). The factorization can be performed in various

ways. Here are three options, though all are essentially versions of the same process.

Long/synthetic division You should have practiced this in high-school.

+ x − 5

x −2



− x

−7x + 10

− x

+ 2x

−7x

− x

+ 2x

−5x + 10

5x −10

=⇒ x

− x

−7x + 10 = (x −2)(x

+ x −5)

Multiply out and solve We know that f (x) = (x − 2)q(x) where q(x) is some quadratic polynomial.

Thus let q(x) = ax

+ bx + c and multiply out:

− x

−7x + 10 = (x −2)(ax

+ bx + c) = ax

+ ( b −2a)x

+ ( c −2b)x −2c

Equating coefﬁcients, we obtain the same factorization as before,

a = 1, b = −1 + 2a = 1, c =

−2

= −5

Term-by-term factorization With practice you can factorize in one line with no working!

(a) To create x

, the ﬁrst term of the quadratic factor must be x

−x

−7x + 10 = (x −2)(x

+ ···) = x

−2x

+ ···

(b) To correct the x

term, add x (i.e., x

−2x

= −x

(x −2)(x

+ x + ···) = x

−x

−2x + ···

(x −2)(x

+ x − 5) = x

−x

−7x + 10

(d) Since the last term 10 is correct, the factorization worked!

You might have seen other approaches involving arranging the coefﬁcients in a table. Regardless, the

calculations required to complete these methods are exactly those seen above; all these methods are

versions of the same thing.

Why Does Factorization Work?

The theory of factorization relies on some algebra. Here is a brief treatment.

Theorem 1.18 (Factor Theorem). Suppose f (x) is a degree n polynomial. Then:

1. A value c is a root if and only if f (x) = (x −c)q(x) for some (degree n −1) polynomial q(x).

2. The polynomial has at most n distinct roots.

Proof. 1. (⇐) This is essentially trivial: f (x) = (x −c)q(x) =⇒ f (c) = (c −c)q(c) = 0.

(⇒) This relies on the division algorithm for polynomials: if f , g are polynomials, then there are

unique polynomials q, r with

f (x) = g(x)q(x) + r(x) and deg r < deg g

In the special case where g(x) = x −c is linear, then r(x) must be a constant and so

f (x) = (x − c)q(x) + f (c)

2. Suppose c

, . . . , c

are distinct real roots. By part 1, f (x) = (x −c

) q

(x). Since

0 = f (c

) = (c

−c

) q

( c

) =⇒ q

( c

) = 0

we may factor x − c

from q

(x) to obtain

f (x) = (x − c

)(x −c

) q

(x), deg q

= n −2

Repeat this process to factor out all n linear polynomials x −c

f (x) = (x − c

) ···(x −c

) q

, deg q

= n −n = 0

It follows that q

= 0 is constant. Plainly f (c) = (c −c

) ···(c −c

) q

= 0 =⇒ c = c

for some

j, so there are no other roots.

Example (1.17 cont). We know that f (x) = x

− x

−7x + 10 = (x −2)(x

+ x −5). But then

f (x) = 0 ⇐⇒ x −2 = 0 or x

+ x −5 = 0

The former gives the root x = 2, and the latter can be attacked via the quadratic formula or complet-

ing the square; the polynomial therefore has exactly three real roots

x = 2,

−1 ±

√

For a given example, q, r may be found by synthetic division. This is similar (and may be demonstrated similarly) to

the more familiar division algorithm for integers: if m, n are integers, then there are unique integers q, r for which

m = qn + r and 0 ≤ r <

In elementary school, this is typically written m ÷n = q r r (q remainder r); e.g., 23 ÷4 = 5 r 3 corresponds to 23 = 5 ×4 + 3.

Example 1.19. We ﬁnish with a quick example of how long division (or any other factorization

method as in Example 1.17) computes the ingredients in the division algorithm.

If f (x) = x

+ 7x

−2 and g(x) = x

−2, then

x + 7

−2



+ 7x

−2

− x

+ 2x

+ 2x −2

−7x

+ 14

2x + 12

=⇒ x

+ 7x

−2 = (x

−2)(x + 7) + (2x + 12)

Otherwise said, f (x) = g(x)q(x) + r(x), where

q(x) = x + 7, r(x) = 2x + 12 and deg r = 1 < 2 = deg g.

Exercises 1.4. 1. Apply the rational root theorem to the polynomial x

+ 2x

− x −2 and use it to

factorize the polynomial.

2. Repeat the previous question for the polynomial 6x

+ x −2.

3. Use the rational roots theorem to prove that the polynomial 2x

−3x + 7 has no rational roots.

4. Factorize the following polynomials and thereby ﬁnd their (real) roots. Explain your steps

carefully.

(a) f (x) = x

+ 2x

−3x

(b) f (x) = x

−13x

+ 36

−7x −6

5. Show that the polynomial f (x) = x

−2x

− x

−4x

−4x −6 has exactly two real roots

by factorizing it.

6. The polynomial f (x) = 2x

−3x

+ 2x

+ 3x −9 has only one rational root. Find it and factorize

the polynomial as f (x) = g(x)q(x) where deg g = 1.

7. Find unique polynomials q(x) and r(x) for which f (x) = g(x)q(x) + r(x) and deg r < deg g.

(a) f (x) = x

+ 1 and g(x) = x + 2.

(b) f (x) = x

+ x

−2 and g(x) = x

+ 1.

8. Let f (x) = ax

+ bx

+ cx + d be a cubic polynomial. ‘Complete the cube’ by ﬁnding a constant

k such that

f (x) = a(x −k)

+ p(x −k) + q

has no (x − k)

term (here p, q are constants).

(Hint: evaluate f (x + k))

9. Suppose that deg f = k and deg g = l.

(a) Show that deg( f g) = kl.

(b) Is it always the case that deg( f + g) = max(k, l)? Why/why not?

1.5 Inverse Functions & the Horizontal Line Test

The informal idea of an inverse function is that f

−1

takes the output of f and returns its input (and

vice versa).

Example 1.20. Deﬁne a simple function using a table or an arrow diagram

x 1 2 3 4

f (x) 4 2 5 7

y 4 2 5 7

−1

( y) 1 2 3 4

The inverse f

−1

is the function obtained by reversing the arrows or ﬂipping

the table upside-down.

−1

Deﬁnition 1.21. A function f : A → B is invertible if it has an inverse: a function f

−1

: B → A for

which

−1



f (x)



= x and f



−1

( y)



= y (∗)

for all possible inputs x ∈ A and y ∈ B.

Certainly Example 1.20 satisﬁes the input–output properties (∗). Our concerns are identifying when

a function is invertible, how to make it so if not, and how to compute an inverse.

Examples 1.22. 1. The function f (x) = 2x has inverse f

−1

( y) =

The input–output conditions (∗) are certainly satisﬁed.

The graph admits an interpretation of f

−1

similar to the arrow diagram.

• The function f takes an input x, moves it vertically to the graph, then

projects to the y-axis. This interpretation is precisely the vertical line

test (Deﬁnition 1.4)!

• The inverse function reverses the arrows: transport an input y horizon-

tally to the graph, then project to the x-axis.

0 1 2

−1

2. Consider f (x) = x

−1. This time, when attempting to move a real

number y horizontally to the graph, we usually encounter one of

two problems:

(a) If y > −1, there are two choices of x (two intersections).

(b) If y < −1, there is no intersection with the graph.

The na

ıve approach of reversing the arrows is insufﬁcient to deﬁne an

inverse. However, a simple remedy arises by staring at the graph:

• Problem (a) goes away if we delete the left half of the graph.

Equivalently, we restrict the domain of f to [0, ∞).

• Problem (b) disappears if we insist that y ≥ −1. Equivalently,

we restrict the codomain of f to its range [−1, ∞).

−2 2

−1

After making these restrictions so that f : [0, ∞) → [−1, ∞), it is easily checked that

−1

( y) =

y + 1, f

−1

: [−1, ∞) → [0, ∞)

satisﬁes the input–output conditions (∗) and is therefore the inverse of f :

x ∈ [0, ∞) =⇒ f

−1



f (x)



−1) + 1 = x

y ∈ [−1, ∞) =⇒ f



−1

( y)





y + 1



−1 = y

What makes a function invertible? The ﬁxes in the last example can be rephrased succinctly:

Horizontal line test: every horizontal line must intersect the graph exactly once

This unpacks to two conditions, each of which addresses one of the problems seen in the example.

Deﬁnition 1.23. Let f : A → B be a function. We say that f is:

(a) 1–1/one-to-one if distinct inputs x

= x

∈ A have distinct outputs f (x

) = f (x

). Equivalently,

Given x

, x

∈ A, we have f (x

) = f (x

) =⇒ x

= x

If A, B are sets of real numbers, each horizontal line intersects the graph at most once.

(b) Onto if range f = B. Equivalently,

Given y ∈ B, there is some x ∈ A for which y = f (x)

If A, B ⊆ R, the horizontal line through y ∈ B intersects the graph at least once.

Putting these ideas together, a function is both 1–1 and onto precisely when every y ∈ B corresponds

to a unique x ∈ A for which y = f (x). In summary:

Theorem 1.24. f : A → B is invertible if and only if it is both 1–1 and onto. Its inverse is the function

−1

: B → A such that f

−1

( y) = x whenever y = f (x).

Example (1.22.2, mk. II). Consider the two properties in the context of the example f (x) = x

−1:

(a) f (x

) = f (x

) =⇒ x

−1 = x

−1 =⇒ x

= x

=⇒ x

= ±x

To force f to be 1–1, it is enough to restrict the domain so that all x have the same sign: the

obvious choice is dom f = [0, ∞).

(b) range f =



−1 : x ∈ [0, ∞)



= [−1, ∞). We force f to be onto by restricting its codomain to

[−1, ∞).

The inverse function is obtained by solving y = x

−1 for x:

= y + 1 =⇒ x = f

−1

( y) =

y + 1

The non-negative square root is used since x ∈ dom f = [0, ∞).

An algorithm for inverting functions Our discussion provides an algorithmic process for making

a function f : A → B invertible and ﬁnding an inverse.

(a) Check that f is 1–1. If not, restrict the domain until it is.

(b) Check that f is onto. If not, redeﬁne B = range f .

−1

( y).

Since x is typically preferred as an input, it is common to switch x, y at the end of step 3 and write

y = f

−1

(x). If A, B ⊆ R, switching x ↔ y is equivalent to reﬂecting the graph in the line y = x.

Note also that step (a) likely involves a choice; depending on how you restrict the domain, you can

ﬁnd multiple inverse functions! To see this in action, we return once more to our example.

Example (1.22.2, mk. III). Recall that if f (x) = x

−1, then

f (x

) = f (x

) =⇒ x

= ±x

Instead of restricting the domain to [0, ∞), we can instead force f to be 1–1 by taking the other half

of the graph; by choosing dom f = (−∞ , 0]. The range/codomain remains [−1, ∞ ), but the inverse

function is now different:

= y + 1 =⇒ x = −

y + 1 ∈ (−∞, 0] = dom f =⇒ f

−1

(x) = −

√

x + 1

This time the new domain for f forced us to use the negative square root.

−2 2

f (x) = x

−1

dom( f ) = [0, ∞)

dom( f ) = (−∞, 0]

0 3 6 9

−1

(x) =

√

x + 1

3 6 9

−2

−1

(x) = −

√

x + 1

We could choose other domains on which f is 1–1, but these are the most natural choices.

The moral is that you cannot invert a function unless you are precise about its domain and range!

We ﬁnish with an algebraically tougher example: you may feel that more detail is justiﬁed!

Example 1.25. Let y = f (x) =

(x−2)

. Its implied domain consists of all real numbers except 2.

The vertical line test is clearly visible on the graph: every vertical line

x = a, except x = 2, intersects the graph exactly once.

The range is the interval R

= (0, ∞) as can be seen by solving

f (x) = y ⇐⇒

x −2

= ±

√

y ⇐⇒ x = 2 ±

√

Any positive output y may be obtained via y = f



2 +

√



The ±-term shows that f fails the horizontal line test: it isn’t 1–1.

There are two natural choices for an inverse:

(a) Choose dom f = (2, ∞), then ±

√

y =

x−2

is positive. We

take the positive square root and obtain the inverse function

g : (0, ∞) → (2, ∞), g(x) = 2 +

√

(b) Choose dom f = (−∞, 2), then ±

√

y =

x−2

is negative and

we obtain a second inverse function

h : (0, ∞) → (−∞, 2), h(x) = 2 −

√

−1 0 1 2 3 4

−1

1 2 3 4 5 6

y = g(x)

y = h(x)

Exercises 1.5. 1. If dom f = R, check that f (x) = x

+ 8 passes the horizontal line test. Find f

−1

2. Consider f (x) = x

+ 2x −3. Similarly to Example 1.22, ﬁnd two inverses of f .

3. Sketch the graph of the following function

f (x) =











x if 0 ≤ x < 1

x −1 if 1 ≤ x < 2

x −2 if 2 ≤ x < 3

Find three domains on which f is 1–1 and thus compute three distinct inverses.

4. Show that the following function f : R → (

, ∞) is 1–1 and onto, sketch its graph and ﬁnd f

−1

f (x) =

(

3 −

x if x ≤ 2

2 −

if x > 2

5. (Hard) Find the implied domain and range of f (x) =

x+1

. Now ﬁnd an interval on which f

is 1–1 and compute its inverse.

6. An astute student observes that Deﬁnition 1.21 only describes the properties satisﬁed by an

inverse and asks why we keep referring to the inverse. How would you respond?