Math 121B — Linear Algebra

Neil Donaldson

Fall 2022

Linear Algebra, Stephen Friedberg, Arnold Insel & Lawrence Spence, 4th Ed 2003, Prentice Hall.

Review from 121A

We begin by recalling a few basic notions and notations.

Vector Spaces Bold-face v denotes a vector in a vector space V over a ﬁeld F. A vector space is closed

under vector addition and scalar multiplication

∀v

, v

∈ V, ∀λ

, λ

∈ F, λ

+ λ

∈ V

Examples. Here are four (families of) vector spaces over the ﬁeld R.

• R

= {xi + yj : x, y ∈ R} = {

(

)

: x, y ∈ R} is a vector space over the ﬁeld R.

• P

(R ); polynomials with degree ≤ n and coefﬁcients in R

• P(R); polynomials over R with any degree.

• C(R); continuous functions from R to R.

Linear Combinations and Spans Let β ⊆ V be a subset of a vector space V over F. A linear

combination of vectors in β is any ﬁnite sum

+ · · · + λ

where λ

∈ F and v

∈ β. The span of β comprises all linear combinations: this is a subspace of V.

Bases and Co-ordinates A set β ⊆ V is a basis of V if it has two properties:

Linear Independence Any linear combination yielding the zero vector is trivial; for distinct v

∈ β,

+ · · · + λ

= 0 =⇒ ∀j, λ

= 0

Spanning Set V = Span β; every vector in V is a (ﬁnite!) linear combination of elements of β.

Theorem. β is a basis of V ⇐⇒ every v ∈ V is a unique linear combination of elements of β.

The cardinality of all basis sets is identical; this is the dimension dim

Example. P

(R ) has standard basis β = {1, x, x

}: every degree ≤ 2 polynomial is a unique as

a linear combination p(x) = a + bx + cx

and so dim P

(R ) = 3. The real numbers a, b, c are the

co-ordinates of p with respect to β; the co-ordinate vector of p is written

[p]









Linearity and Linear Maps A function T : V → W between vector spaces V, W over the same ﬁeld

F is (F-)linear if it respects the linearity properties of V, W

∀v

, v

∈ V, ∀λ

, λ

∈ F, T( λ

+ λ

) = λ

T(v

) + λ

T(v

)

We write L(V, W) for the set (indeed vector space!) of linear maps from V to W: this is shortened to

L(V) if V = W. An isomorphism is an invertible/bijective linear map.

Theorem. If dim

V = n and β is a basis of V, then the co-ordinate map v 7→ [v]

is an isomorphism

of vector spaces V → F

Matrices and Linear Maps If V, W are ﬁnite-dimensional, then any linear map T : V → W can be

described using matrix multiplication.

Example. If A =



2 −1

0 −1

−4 3



, then the linear map L

: R

→ R

(left-multiplication by A) is









2 −1

0 −1

−4 3













2x − y

−y

3y − 4x





The linear map in fact deﬁnes the matrix A; we recover the columns of the matrix by feeding the

standard basis vectors to the linear map.





−4





= L









−1





= L





More generally, if T ∈ L(V, W) and β = {v

, . . . , v

} and γ = {w

, . . . , w

} are bases of V, W

respectively, then the matrix of T with respect to β and γ is

[T]



[T(v

)]

· · · [T(v

)]



∈ M

m×n

(F )

whose j

column is obtained by feeding the j

basis vector of β to T and taking its co-ordinate vector

with respect to γ. This ﬁts naturally with the co-ordinate isomorphisms

T(v) = w ⇐⇒ [ T]

[v]

= [w]

There are two special cases when V = W:

• If β = γ, then we simply write [T]

instead of [T]

• If T = I is the identity map, then Q

:= [I]

is the change of co-ordinate matrix from β to γ.

Being able to convert linear maps into matrix multiplication is a central skill in linear algebra. Test

your comfort by working through the following; if everything feels familiar, you should consider

yourself in a good place as far as pre-requisites are concerned!

Example. Let T : P

(R ) → P

(R ) be the linear map deﬁned by differentiation

T(a + bx + cx

) = b + 2cx (∗)

The standard bases of P

(R ) and P

(R ) are, respectively, β = {1, x, x

} and γ = {1, x}. Observe that

[T(1)]

= [0]





, [T(x)]

= [1]





, [T(x

)]

= [2x]





=⇒ [T]



[T(1)]

[T(x)]

[T(x

)]





0 1 0

0 0 2



Written in co-ordinates, we see the original linear map (∗)



T(a + bx + cx

)



= [T]

[a + bx + cx

]



0 1 0

0 0 2















[

b + 2cx

]

(†)

1. η = {1 + x, x + x

, x

+ 1} is also a basis of P

(R ). Show that

[T]



1 1 0

0 2 2



2. As in (†) above, the matrix multiplication



1 1 0

0 2 2













a + b

2b + 2c



corresponds to an equation [T(p)]

= [T]

[p]

for some polynomial p(x); what is p(x) in terms

of a, b, c?

3. Find the change of co-ordinate matrix Q

and check that the matrices of T are related by

[T]

= [T]

1 Diagonalizability & the Cayley–Hamilton Theorem

1.1 Eigenvalues, Eigenvectors & Diagonalization (Review)

Deﬁnition 1.1. Suppose V is a vector space over F and T ∈ L(V). A non-zero v ∈ V is an eigenvector

of T with eigenvalue λ ∈ F (together an eigenpair) if

T(v) = λv

For matrices, the eigenvalues/vectors of A ∈ M

(F ) are precisely those of L

∈ L(F

Suppose λ is an eigenvalue of T;

1. The eigenspace of λ is the nullspace E

:= N (T − λI).

2. The geometric multiplicity of λ of the dimension dim E

We say that T is diagonalizable if there exists a basis of eigenvectors; an eigenbasis.

We start by recalling a couple of basic facts, the ﬁrst of which is easily proved by induction.

Lemma 1.2. If v

, . . . , v

are eigenvectors corresponding to distinct eigenvalues, then {v

, . . . , v

}

is linearly independent.

Moreover, if dim

V = n and T ∈ L(V) has n distinct eigenvalues, then T is diagonalizable.

Eigenvalues and Eigenvectors in ﬁnite dimensions

If dim

V = n and ϵ is a basis, then the eigenvector deﬁnition is equivalent to a matrix equation

[T]

[v]

= λ[v]

In such a situation, T being diagonalizable means ∃β such that [T]

is a diagonal matrix

[T]







0 · · · 0

0 λ

0 · · · 0 λ







Thankfully there is systematic way to ﬁnd eigenvalues and eigenvectors in ﬁnite-dimensions:

1. Choose any basis ϵ of V and compute the matrix A = [T]

∈ M

(F ).

2. Observe that

λ ∈ F is an eigenvalue ⇐⇒ ∃[v]

∈ F

\ {0} such that A[v]

= λ[v]

⇐⇒ ∃[v]

∈ F

\ {0} such that

(

A − λI

)

[v]

= 0

⇐⇒ det

(

A − λI

)

= 0

This last is a degree n polynomial equation whose roots are the eigenvalues.

3. For each eigenvalue λ

, compute the eigenspace E

= N (T − λ

I) to ﬁnd the eigenvectors.

Remember that E

is a subspace of the original vector space V, so translate back if necessary!

Deﬁnition 1.3. The characteristic polynomial of T ∈ L(V) is the degree-n polynomial

p(t) := det(T − tI)

The eigenvalues of T are precisely the solutions to the characteristic equation p(t) = 0.

Examples 1.4. 1. A =



0 −1

1 0



has characteristic polynomial p(t) = t

+ 1 = (t + i)(t − i). As a

linear map L

∈ L(R

), A has no eigenvalues and no eigenvectors!

As a linear map L

∈ L(C

), we have two eigenvalues ±i. Indeed

(A − iI)v =



−i −1

1 −i



v =⇒ E

= Span





and similarly E

−i

= Span



−i



. We therefore have an eigenbasis β = {







−i



} (of C

), with

respect to which

]



i 0

0 −i



2. Let T ∈ L(P

(R )) be deﬁned by

T( f ) (x) = f (x) + (x − 1) f

′

(x)

With respect to the standard basis ϵ = {1, x, x

}, we have the non-diagonal matrix

A = [T]





1 −1 0

0 2 −2

0 0 3





=⇒ p(t) = det(A − tI) = (1 − t)(2 − t)(3 − t)

With three distinct eigenvalues, T is diagonalizable. To ﬁnd the eigenvectors, compute the

nullspaces:

= 1: 0 = (A − λ

I)[v

]



0 −1 0

0 1 −2

0 0 2



]

=⇒ [v

]

∈ Span





=⇒ E

= Span{1}

= 2: A − λ

I =



−1 −1 0

0 0 −2

0 0 1



=⇒ [v

]

∈ Span



−1



=⇒ E

= Span{1 − x}

= 3: A − λ

I =



−2 −1 0

0 −1 −2

0 0 0



=⇒ [v

]

∈ Span



−2



=⇒ E

= Span{1 − 2x + x

}

Making a sensible choice of non-zero eigenvectors, we obtain an eigenbasis, with respect to

which the linear map is necessarily diagonal

β = {v

, v

} = {1, 1 − x, 1 − 2x + x

} = {1, 1 − x, (1 − x)

}



a + b(1 − x) + c(1 − x)



= a + 2b(1 − x) + 3c(1 − x)

[T]





1 0 0

0 2 0

0 0 3





Conditions for diagonalizability of ﬁnite-dimensional operators

We now borrow a little terminology from the theory of polynomials.

Deﬁnition 1.5. Let F be a ﬁeld and p(t) a polynomial with coefﬁcients in F.

1. Let λ ∈ F be a root; p(λ) = 0. The algebraic multiplicity mult(λ) is the largest power of λ − t to

divide p(t). Otherwise said, there exists

some polynomial q(t) such that

p(t) = ( λ − t)

mult(λ)

q(t) and q(λ) = 0

2. We say that p(t) splits over F if it factorizes completely into linear factors; equivalently

∃a, λ

, . . . , λ

∈ F such that

p(t) = a(λ

− t)

· · · (λ

− t)

When p(t) splits, the algebraic multiplicities sum to the degree n of the polynomial

n = m

+ · · · + m

Of course, we are most interested when p(t) is the characteristic polynomial of a linear map T ∈ L(V).

If such a polynomial splits, then a = 1 and λ

, . . . , λ

are necessarily the (distinct) eigenvalues of T.

Example 1.6. The ﬁeld matters! For instance p(t) = t

+ 1 = (t − i)(t + i) = −( i − t)(−i − t) splits

over C but not over R. Its roots are plainly ±i.

For the purposes of review, we state the main result; this will be proved in the next section.

Theorem 1.7. Let V be ﬁnite-dimensional. A linear map T ∈ L(V) is diagonalizable if and only if,

1. Its characteristic polynomial splits over F, and,

2. The geometric and algebraic multiplicities of each eigenvalue are equal; dim E

= mult(λ

Example 1.8. The matrix A =



3 1 0

0 3 0

0 0 5



is easily seen to have eigenvalues λ

= 3 and λ

= 5. Indeed

p(t) = (3 − t)

(5 − t), mult(3) = 2, mult( 5) = 1

= Span





, E

= Span





, dim E

= dim E

= 1

This matrix is non-diagonalizable since dim E

= 1 = 2 = mult(3).

Everything prior to this should be review. If it feels very unfamiliar, revisit your notes from 121A,

particularly sections 5.1 and 5.2 of the textbook.

The existence follows from Descartes factor theorem and the division algorithm for polynomials.

Exercises 1.1 1. For each matrix over R; ﬁnd its characteristic polynomial, its eigenvalues/spaces,

and its algebraic and geometric multiplicities; decide if it is diagonalizable.

(a) A =



2 0 0 0

0 3 1 0

0 0 3 1

0 0 0 3



(b) B =



−1 6 0 0

−2 6 0 0

0 0 3 0

0 0 0 3



2. Suppose A is a real matrix with eigenpair (λ, v). If λ ∈ R show that (λ, v) is also an eigenpair.

3. Show that the characteristic polynomial of A =



3 −4

4 3



does not split over R. Diagonalize A

over C.

4. Give an example of a 2 × 2 matrix whose entries are rational numbers and whose characteristic

polynomial splits over R, but not over Q.

5. Diagonalize L

∈ L(C

) where C =



2i 1

2 0



6. If p(t) splits, explain why

det T = λ

mult(λ

)

· · · λ

mult(λ

)

where λ

, . . . , λ

are the distinct eigenvalues of T.

7. Suppose T ∈ L(V) is invertible with eigenvalue λ. Prove that λ

−1

is an eigenvalue of T

−1

with

the same eigenspace E

. If T is diagonalizable, prove that T

−1

is also diagonalizable.

8. If V is ﬁnite-dimensional and T ∈ L(V), we may deﬁne det T to equal det[T]

, where β is any

basis of V. Explain why the choice of basis does not matter; that is, if γ is any other basis of V,

we have det[T]

= det[T]

1.2 Invariant Subspaces and the Cayley–Hamilton Theorem

The proof of Theorem 1.7 is facilitated by a new concept, of which eigenspaces are a special case.

Deﬁnition 1.9. Suppose T ∈ L(V). A subspace W of V is T-invariant if T(W) ⊆ W. In such a case,

the restriction of T to W is the linear map

: W → W : w 7→ T(w)

Examples 1.10. 1. The trivial subspace {0} and the entire vector space V are invariant for any

linear map T ∈ L(V).

2. Every eigenspace is invariant; if v ∈ E

, then T(v) = λv ∈ E

3. Continuing Example 1.8, if A =



3 1 0

0 3 0

0 0 5



then W = Span

{

i, j

}

is an invariant subspace for the

linear map L

. Indeed

A(xi + yj) = (3x + y)i + 3yj ∈ W

W is an example of a generalized eigenspace; we’ll study these properly at the end of term.

To prove our diagonalization criterion, we need to see how to factorize the characteristic polynomial.

It turns out that factors of p(t) correspond to T-invariant subspaces!

Example 1.11. W = Span{i, j} is an invariant subspace of A =



1 2 4

0 3 1

0 0 2



∈ M

(R ). With respect to

the standard basis, the restriction [L

]

has matrix



1 2

0 3



. The characteristic polynomial p

( t) of the

restriction is plainly a factor of the whole,

p(t) = (1 − t)(2 − t)(3 − t) = (2 − t)p

( t)

Theorem 1.12. Suppose T ∈ L(V), that dim V is ﬁnite and that W is a T-invariant subspace of V.

Then the characteristic polynomial of the restriction T

divides that of T.

The proof simply abstracts the approach of the example.

Proof. Extend a basis β

of W to a basis β of V. Since T(w) ∈ Span β

for each w ∈ W, we see that

the matrix of [T] has block form

[T]



A B

O C



=⇒ p(t) = det(A − tI) det(C − tI) = p

( t) det( C − tI)

where p

( t) is the characteristic polynomial, and A = [T

]

the matrix of the restriction T

Corollary 1.13. If λ is an eigenvalue of T, then T

= λI

is a multiple of the identity, whence,

1. The characteristic polynomial of the restriction T

is p

( t) = (λ − t)

dim E

2. p

( t) divides the characteristic polynomial of T. In particular dim E

≤ mult(λ).

We are now in a position to state and prove an extended version of Theorem 1.7.

Theorem 1.14. Suppose dim

V = n and that T ∈ L(V) has distinct eigenvalues λ

, . . . , λ

. The

following are equivalent:

1. T is diagonalizable.

2. The characteristic polynomial splits over F and dim E

= mult(λ

) for each j; indeed

p(t) = p

( t) · · · p

( t) = (λ

− t)

dim E

· · · (λ

− t)

dim E

∑

j=1

dim E

= n

4. V = E

⊕ · · · ⊕ E

Example 1.15. A =



7 0 −12

0 1 0

2 0 −3



is diagonalizable. Indeed p(t) = (1 − t)

(3 − t) splits, and we have

1 3

mult(λ) 2 1

Span

n





o

Span





dim E

2 1

and R

= E

⊕ E

With respect to the eigenbasis β =

n









o

, the map is diagonal [L

]



1 0 0

0 1 0

0 0 3



Proof. (1 ⇒ 2) If T is diagonalizable with eigenbasis β, then [T]

is diagonal. But then

p(t) = ( λ

− t)

· · · (λ

− t)

splits and

∑

mult(λ

) = n. The cardinality n of an eigenbasis is at most

∑

dim E

since every

element is an (independent) eigenvector. By Corollary 1.13 (dim E

≤ mult(λ

)) we see that

n ≤

∑

dim E

≤

∑

mult(λ

) = n =⇒ ∀j, dim E

= mult(λ

)

whence the inequalities are equalities with each pair equal dim E

= mult(λ

)

(2 ⇒ 3) p(t) splits =⇒ n =

∑

mult(λ

) =

∑

dim E

(3 ⇒ 4) Assume E

⊕ · · · ⊕ E

exists.

If (λ

j+1

, v

j+1

) is an eigenpair, then v

j+1

∈ E

⊕ · · · ⊕ E

for

otherwise this would contradict Lemma 1.2.

By induction, E

⊕ · · · ⊕ E

exists; by assumption it has dimension n = dim V and therefore

equals V.

(4 ⇒ 1) For each j, choose a basis β

of E

. Then β := β

∪ · · · ∪ β

is a basis of V consisting of

eigenvectors of T; an eigenbasis.

Distinct eigenspaces have trivial intersection: i

= i

≤ j =⇒ E

∩ E

= {0}.

T-cyclic Subspaces and the Cayley–Hamilton Theorem

We ﬁnish this chapter by introducing a general family of invariant subspaces and using them to prove

a startling result.

Deﬁnition 1.16. Let T ∈ L(V) and let v ∈ V. The T-cyclic subspace generated by v is the span

⟨

⟩

= Span{v, T(v), T

( v), . . .}

Example 1.17. Recalling Example 1.10.3 Let A =



3 1 0

0 3 0

0 0 5



, and v = i + k. It is easy to see that

Av = 3i + 5k, A

v = 9i + 25k, . . . , A

v = 3

i + 5

all of which lie in Span{i, k}. Plainly this is the L

-cyclic subspace

⟨

i + k

⟩

The proof of the following basic result is left as an exercise.

Lemma 1.18.

⟨

⟩

is the smallest T-invariant subspace of V containing v, speciﬁcally:

⟨

⟩

is T-invariant.

2. If W ≤ V is T-invariant and v ∈ W, then

⟨

⟩

≤ W.

3. dim

⟨

⟩

= 1 ⇐⇒ v is an eigenvector of T.

We were lucky in the example that the general form A

v was so clear. It is helpful to develop a more

precise test for identifying the dimension and a basis of a T-cyclic subspace.

Suppose a T-cyclic subspace

⟨

⟩

= Span{v, T(v), T

( v), . . .} has ﬁnite dimension.

Let k ≥ 1 be

maximal such that the set

{v, T(v), . . . , T

k−1

( v)}

is linearly independent.

• If k doesn’t exist, the inﬁnite linearly independent set {v, T(v), . . .} contradicts dim

⟨

⟩

< ∞.

• By the maximality of k, T

( v) ∈ Span{v, T(v), . . . , T

k−1

( v)}; by induction this extends to

j ≥ k =⇒ T

( v) ∈ Span{v, T(v), . . . , T

k−1

( v)}

It follows that

⟨

⟩

= Span{v, T(v), . . . , T

k−1

( v)}, and we’ve proved a useful criterion.

Theorem 1.19. Suppose v = 0, then

dim

⟨

⟩

= k ⇐⇒ {v, T(v), . . . , T

k−1

( v)} is a basis of

⟨

⟩

⇐⇒ k is maximal such that {v, T(v), . . . , T

k−1

( v)} is linearly independent

⇐⇒ k is minimal such that T

( v) ∈ Span{v, T(v), . . . , T

k−1

( v)}

Necessarily the situation if dim V < ∞, when we are thinking about characteristic polynomials.

Examples 1.20. 1. According to the Theorem, in Example 1.17 we need only have noticed

• v = i + k and Av = 3i + 5k are linearly independent.

• That A

( i + k) = 9i + 25k ∈ Span{v, Av}.

We could then conclude that

⟨

⟩

= Span{v, Av} has dimension 2.

2. Let T(p(x)) = 3p(x) − p

′′

(x) viewed as a linear map T ∈ L(P

(R )) and consider the T-cyclic

subspace generated by the polynomial p(x) = x

T(x

) = 3x

− 2, T

) = T(3x

− 2) = 3(3x

− 2) − 6 = 9x

− 12, . . .

Observe that {x

, T(x

) } is linearly independent, but that

) = 9x

− 12 = −9x

+ 6(3x

− 2) ∈ Span{x

, T(x

) }

We conclude that dim





= 2. An alternative basis for





is plainly {1, x

We ﬁnish by considering the interaction of a T-cyclic subspace with the characteristic polynomial.

Surprisingly, the coefﬁcients of the characteristic polynomial and the linear combination coincide.

Continuing the Example, if W =





and β

= {x

, T(x

) } = {x

, 3x

− 2}, then

]



0 −9

1 6



=⇒ p

( t) = t

− 6t + 9

Theorem 1.21. Let T ∈ L(V) and suppose W =

⟨

⟩

has dim W = k with basis

= {w, T(w), . . . , T

k−1

( w)}

in accordance with Theorem 1.19, then

1. If T

( w) + a

k−1

( w) + · · · + a

w = 0, then the characteristic polynomial of T

( t) = (−1)



+ a

k−1

+ · · · + a

t + a



2. p

) = 0 is the zero map on W.

Proof. 1. This is an exercise.

2. Write S ∈ L(V) for the linear map

S := p

(T) = (−1)



+ a

k−1

+ · · · + a



Part 1 says S(w) = 0. Since S is a polynomial in T, it commutes with all powers of T:

∀j, S(T

( w)) = T

(S( w)) = 0

Since S is zero on the basis β

of W, we see that S

is the zero function.

With a little sneakiness, we can drop the W’s in the second part of the Theorem and observe an

intimate relation between a linear map and its characteristic polynomial.

Corollary 1.22 (Cayley–Hamilton). If V is ﬁnite-dimensional, then T ∈ L(V) satisﬁes its charac-

teristic polynomial; p(T) = 0.

Proof. Let w ∈ V and consider the cyclic subspace W =

⟨

⟩

generated by w. By Theorem 1.12,

p(t) = q

( t)p

( t)

for some polynomial q

. But the previous result says that p

(T) (w) = 0, whence

p(T)(w) = 0

Since we may apply this reasoning to any w ∈ V, we conclude that p(T) is the zero function.

Examples 1.23. 1. A =



2 1

3 4



has p(t) = t

− 6t + 5 and we conﬁrm:

− 6A =



7 6

18 19



− 6



2 1

3 4



= −5I

It may seem like a strange thing to do for this matrix, but the characteristic equation can be

used to calculate the inverse of A:

− 6A + 5I = 0 =⇒ A(A − 6I) = −5I =⇒ A

−1

(6I − A) =



4 −1

−3 2



2. We use the Cayley–Hamilton Theorem to compute A

when

A =





2 −1

0 1 −6

0 0 2





The characteristic polynomial is

p(t) = (2 − t)

(1 − t) = 4 − 8t + 5t

− t

By Cayley–Hamilton,

= AA

= A(5A

− 8A + 4I)

= 5A

− 8A

+ 4A = 5(5A

− 8A + 4I) − 8A

+ 4A

= 17A

− 36A + 20I = 17





4 −3

0 1 −18

0 0 4





− 36





2 −1

0 1 −6

0 0 2





+ 20





1 0 0

0 1 0

0 0 1









16 −15

562

0 1 −90

0 0 16





3. Recall Example 1.4.2, where the linear map T( f (x)) = f (x) + (x − 1) f

′

(x) had

p(t) = (1 − t)(2 − t)(3 − t) = −t

+ 6t

− 11t + 6

By Cayley–Hamilton, T

= 6T

− 11T + 6I. You can check this explicitly, after ﬁrst computing

( f (x)) = f (x) + 3(x − 1) f

′

(x) + (x − 1)

′′

(x), etc.

Cayley–Hamilton can also be used to simplify higher powers of T and even to compute the

inverse!

I =

− 6T

+ 11T) =⇒ T

−1

− 6T + 11I)

=⇒ T

−1

( f (x)) = f (x) −

(x − 1) f

′

(x) +

(x − 1)

′′

(x)

Exercises 1.2 1. For the linear map T = L

: R

→ R

where A =



3 0 0

0 2 4

0 0 2



ﬁnd the T-cyclic

subspace generated by the standard basis vector e





2. Let T = L

, where A =



1 2 4

0 3 1

0 0 2



and let v =



−1



. Compute T(v) and T

( v). Hence describe

the T-cyclic subspace

⟨

⟩

and its dimension.

3. Given A =



2 0 0 0

0 3 1 0

0 0 3 1

0 0 0 3



, ﬁnd two distinct L

-invariant subspaces W ≤ R

such that dim W = 3.

4. Suppose that W and X are T-invariant subspaces of V. Prove that the sum

W + X = {w + x : w ∈ W, x ∈ X}

is also T-invariant.

5. Prove Lemma 1.18.

6. Give an example of an inﬁnite-dimensional vector space V, a linear map T ∈ L(V), and a vector

v such that

⟨

⟩

= V.

7. Let β = {sin x, cos x, 2x sin x, 3x cos x} and T =

∈ L(Span β). Plainly the subspace W :=

Span{sin x, cos x} is T-invariant. Compute the matrices [T]

and [T

]

and observe that

p(t) =



( t)



8. Verify explicitly that A =



2 3

0 2



satisﬁes its characteristic polynomial.

9. Check the details of Example 1.23.3 and evaluate T

as a linear combination of I, T and T

. In

particular, check the evaluation of T

−1

( f (x)).

10. Suppose a, b are constants with a = 0 and deﬁne T( f (x)) = a f (x) + b f

′

(x).

(a) Find an expression for the inverse T

−1

( f (x)) if T ∈ L(P

(R ))

(b) Find an expression for the inverse T

−1

( f (x)) if T ∈ L(P

(R ))

Your answers should be written in terms of f and its derivatives.

11. Let T( f ) (x) = f

′

(x) +

f (t) dt be a linear map T ∈ L(P

(R )).

(a) Find the characteristic polynomial of T and identify its eigenspaces. Is T diagonalizable?

(b) Find a, b, c ∈ R such that T

= aT

+ bT + cI.

(R )) and dim Span{T

: k ∈ N

}? Explain.

12. If A =



a b

c d



has non-zero determinant, use the Cayley–Hamilton Theorem to obtain the usual

expression for A

−1

13. Recall Examples 1.10.3, 1.17, and 1.20.1 with A =



3 1 0

0 3 0

0 0 5



(a) If v =





, show that det(v, Av, A

v) = −4y

(b) Hence determine all L

-cyclic subspaces of R

14. (a) Consider Example 1.20.2 where T ∈ L(P

(R )) is deﬁned by T(p(x)) = 3p(x) − p

′′

(x).

Prove that all T-cyclic subspaces have dimension ≤ 2.

(b) What if we instead consider S ∈ L(P

(R ) deﬁned by S(p(x)) = 3p(x) − p

′

(x)?

15. We prove part 1 of Theorem 1.21.

(a) Explain why the matrix of T

with respect to the basis β

]







0 0 0 · · · 0 −a

1 0 0 0 −a

0 1 0 0 −a

0 0 0 0 −a

k−2

0 0 0 · · · 1 −a

k−1







∈ M

(F )

(b) Compute the characteristic polynomial p

( t) = det



]

− tI



by expanding the de-

terminant along the ﬁrst row.