3 Canonical Forms

3.1 Jordan Forms & Generalized Eigenvectors

Throughout this course we’ve concerned ourselves with variations of a general question: for a given

map T ∈ L(V), ﬁnd a basis β such that the matrix [T]

is as close to diagonal as possible. In this

chapter we see what is possible when T is non-diagonalizable.

Example 3.1. The matrix A =



−8 4

−25 12



∈ M

(R ) has characteristic equation

p(t) = (−8 −t)(12 −t) + 4 ·25 = t

−4t + 4 = (t −2)

and thus a single eigenvalue λ = 2. It is non-diagonalizable since the eigenspace is one-dimensional

= N



−10 4

−25 10



= Span





However, if we consider a basis β = {v

, v

} where v





is an eigenvector, then [L

]

is upper-

triangular, which is better than nothing! How simple can we make this matrix? Let v

(

)

, then



−8x + 4y

−25x + 12y



= 2







−10x + 4y

−25x + 10y



= 2v

+ (−5x + 2y)v

=⇒ [L

]



2 −5x + 2y

0 2



Since v

cannot be parallel to v

, the only thing we cannot have is a diagonal matrix. The next best

thing is for the upper right corner be 1; for instance we could choose

β = {v

, v

} =









=⇒ [L

]



2 1

0 2



Deﬁnition 3.2. A Jordan block is a square matrix of the form

J =







λ 1







where all non-indicated entries are zero. Any 1 ×1 matrix is also a Jordan block.

A Jordan canonical form is a block-diagonal matrix diag(J

, . . . , J

) where each J

is a Jordan block.

A Jordan canonical basis for T ∈ L(V) is a basis β of V such that [T]

is a Jordan canonical form.

If a map is diagonalizable, then any eigenbasis is Jordan canonical and the corresponding Jordan

canonical form is diagonal. What about more generally? Does every non-diagonalizable map have a

Jordan canonical basis? If so, how can we ﬁnd such?

Example 3.3. It can easily be checked that β = {v

, v

} =

n









o

is a Jordan canon-

ical basis for

A =





−1 2 3

−4 5 4

−2 1 4





(really L

∈ L(R

)). Indeed

= 2v

, Av

= 3v

, Av













1 + 3

2 + 3

0 + 3





= v

+ 3v

=⇒ [L

]





2 0 0

0 3 1

0 0 3





Generalized Eigenvectors

Example 3.3 was easy to check, but how would we go about ﬁnding a suitable β if we were merely

given A? We brute-forced this in Example 3.1, but such is not a reasonable approach in general.

Eigenvectors get us some of the way:

• v

is an eigenvector in Example 3.1, but v

is not.

• v

and v

are eigenvectors in Example 3.3, but v

is not.

The practical question is how to ﬁll out a Jordan canonical basis once we have a maximal independent

set of eigenvectors. We now deﬁne the necessary objects.

Deﬁnition 3.4. Suppose T ∈ L(V) has an eigenvalue λ. Its generalized eigenspace is

:= {x ∈ V : (T −λI)

( x) = 0 for some k ∈ N} =

[

k∈N

N(T − λ I)

A generalized eigenvector is any non-zero v ∈ K

As with eigenspaces, the generalized eigenspaces of A ∈ M

(F ) are those of the map L

∈ L(F

It is easy to check that our earlier Jordan canonical bases consist of generalized eigenvectors.

Example 3.1: We have one eigenvalue λ = 2. Since (A − 2I)



0 0



is the zero matrix, every

non-zero vector is a generalized eigenvector; plainly K

= R

Example 3.3: We see that

(A − 2I)v

= 0, (A − 3I)v

= 0, (A − 3I)

= (A −3I)v

= 0

whence β is a basis of generalized eigenvectors. Indeed

= Span{v

, v

}, K

= E

= Span{v

}

though verifying this with current technology is a little awkward. . .

In order to easily compute generalized eigenspaces, it is useful to invoke the main result of this

section. We postpone the proof for a while due to its meatiness.

Theorem 3.5. Suppose that the characteristic polynomial of T ∈ L(V) splits over F:

p(t) = (λ

−t)

···(λ

−t)

where the λ

are the distinct eigenvalues of T with algebraic multiplicities m

. Then:

1. For each eigenvalue; (a) K

= N(T − λI)

and (b) dim K

= m.

2. V = K

⊕···⊕K

: there exists a basis of generalized eigenvectors.

Compare this with the statement on diagonalizability from the start of the course.

With regard to part 2; we shall eventually be able to choose this to be a Jordan canonical basis. In

conclusion: a map has a Jordan canonical basis if and only if its characteristic polynomial splits.

Examples 3.6. 1. Observe how Example 3.3 works in this language:

A =





−1 2 3

−4 5 4

−2 1 4





=⇒ p(t) = (2 −t)

(3 −t)

= N(A −2I)

= Span









=⇒ dim K

= 1

= N(A −3I)

= N





2 −1 −1

0 0 0

2 −1 −1





= Span





























=⇒ dim K

= 2

= K

⊕K

2. We ﬁnd the generalized eigenspaces of the matrix A =



5 2 −1

0 0 0

9 6 −1



The characteristic polynomial is

p(t) = det(A −λI) = −t



5 − t −1

9 −1 −t



= −t(t

−5t + t −5 + 9) = −(0 −t)

(2 −t)

• λ = 0 has multiplicity 1; indeed K

= N(A − 0I)

= N(A) = Span



−1



is just the

eigenspace E

• λ = 2 has multiplicity 2,

= N(A −2I)

= N





3 2 −1

0 −2 0

9 6 −3





= N





0 −4 0

0 4 0

0 −12 0





= Span





























In this case the corresponding eigenspace is one-dimensional, E

= Span





⪇ K

, and

the matrix is non-diagonalizable.

Observe also that R

= K

⊕K

in accordance with the Theorem.

Properties of Generalized Eigenspaces and the Proof of Theorem 3.5

A lot of work is required to justify our main result. Feel free to skip the proofs at ﬁrst reading.

Lemma 3.7. Let λ be an eigenvalue of T ∈ L(V). Then:

1. E

is a subspace of K

, which is itself a subspace of V.

2. K

is T-invariant.

3. Suppose K

is ﬁnite-dimensional and µ = λ. Then:

(a) K

is ( T −µI)-invariant and the restriction of T − µI to K

is an isomorphism.

(b) If µ is another eigenvalue, then K

∩ K

= {0}. In particular K

contains no eigenvectors

other than those in E

Proof. 1. These are an easy exercise.

2. Let x ∈ K

, then ∃k such that (T − λI)

( x) = 0. But then

(T −λI)



T(x)



= (T −λI)



T(x) − λx + λx



= (T −λI)

k+1

( x) + λ(T − λI)

( x) = 0

Otherwise said, T(x) ∈ K

3. (a) Let x ∈ K

. Part 2 tells us that

(T −µI)(x) = T(x) −µx ∈ K

whence K

is ( T −µI)-invariant.

Suppose, for a contradiction, that T − µI is not injective on K

. Then

∃y ∈ K

\{0} such that (T − µI)(y) = 0

Let k ∈ N be minimal such that (T − λI)

( y) = 0 and let z = (T − λI)

k−1

( y) . Plainly

z = 0, for otherwise k is not minimal. Moreover,

(T −λI)(z) = (T − λI)

( y) = 0 =⇒ z ∈ E

Since T − µI and T −λI commute, we can also compute the effect of T − µI;

(T −µI)(z) = (T − µI)(T − λI)

k−1

( y) = (T − λI)

k−1

(T −µI)(x) = 0

which says that z is an eigenvector in E

; if µ isn’t an eigenvalue, then we already have

our contradiction! Even if µ is an eigenvalue, E

∩ E

= {0} provides the desired contra-

diction.

We conclude that (T − µI)

∈ L(K

) is injective. Since dim K

< ∞, the restriction is

automatically an isomorphism.

(b) This is another exercise.

Now to prove Theorem 3.5: remember that the characteristic polynomial of T is assumed to split.

Proof. (Part 1(a)) Fix an eigenvalue λ. By deﬁnition, we have N(T −λI)

≤ K

For the converse, parts 2 and 3 of the Lemma tell us (why?) that

( t) = (λ − t)

dim K

from which dim K

≤ m (∗)

By the Cayley–Hamilton Theorem, T

satisﬁes its characteristic polynomial, whence

∀x ∈ K

(

λI −T

)

dim K

( x) = 0 =⇒ K

≤ N(T − λI)

(Parts 1(b) and 2) We prove simultaneously by induction on the number of distinct eigenvalues of T.

(Base case) If T has only one eigenvalue, then p(t) = (λ − t)

. Another appeal to Cayley–

Hamilton says ( T −λI)

( x) = 0 for all x ∈ V. Thus V = K

and dim K

= m.

(Induction step) Fix k and suppose the results hold for maps with k distinct eigenvalues. Let T

have distinct eigenvalues λ

, . . . , λ

, µ, with multiplicities m

, . . . , m

, m respectively. Deﬁne

W = R(T −µI)

The subspace W has the following properties, the ﬁrst two of which we leave as exercises:

• W is T-invariant.

• W ∩ K

= {0} so that µ is not an eigenvalue of the restriction T

• Each K

≤ W: since (T −µI)

is an isomorphism (Lemma part 3), we can invert,

x ∈ K

=⇒ x = (T −µI)



(T −µI)

−1



( x) ∈ R( T −µI)

= W

We conclude that λ

is an eigenvalue of the restriction T

with generalized eigenspace K

Since T

has k distinct eigenvalues, the induction hypotheses apply:

W = K

⊕···⊕K

and p

( t) = (λ

−t)

dim K

···(λ

−t)

dim K

Since W ∩K

= {0} it is enough ﬁnally to use the rank–nullity theorem and count dimensions:

dim V = rank(T −µI)

+ null(T − µI)

= dim W + dim K

∑

j=1

dim K

+ dim K

(∗)

≤ m

+ ···+ m

+ m = deg(p(t)) = dim V

The inequality is thus an equality; each dim K

= m

and dim K

= m. We conclude that

V = K

⊕···⊕K

⊕K

which completes the induction step and thus the proof. Whew!

This is yet another argument where we consider a suitable subspace to which we can apply an induction hypothesis;

recall the spectral theorem, Schur’s lemma, bilinear form diagonalization, etc. Theorem 3.12 will provide one more!

Cycles of Generalized Eigenvectors

By Theorem 3.5, for every linear map whose characteristic polynomial splits there exists generalized

eigenbasis. This isn’t the same as a Jordan canonical basis, but we’re very close!

Example 3.8. The matrix A =



5 1 0

0 5 1

0 0 5



∈ M

(R ) is a single Jordan block, whence there is a single

generalized eigenspace K

= R

and the standard basis ϵ = {e

, e

} is Jordan canonical.

The crucial observation for what follows is that one of these vectors e

generates the others via re-

peated applications of A −5I:

= (A −5I)e

, e

= (A −5I)e

= (A −5I)

Deﬁnition 3.9. A cycle of generalized eigenvectors for a linear operator T is a set

(T −λI)

k−1

( x) , . . . , (T − λI)(x), x

where the generator x ∈ K

is non-zero and k is minimal such that (T − λI)

( x) = 0.

Note that the ﬁrst element ( T −λI)

k−1

( x) is an eigenvector.

Our goal is to show that K

has a basis consisting of cycles of generalized eigenvectors; putting these

together results in a Jordan canonical basis.

Lemma 3.10. Let β

be a cycle of generalized eigenvectors of T with length k. Then:

1. β

is linearly independent and thus a basis of Span β

2. Span β

is T-invariant. With respect to β

, the matrix of the restriction of T is the k × k Jordan

block [T

Span β

]

λ 1

In what follows, it will be useful to consider the linear map U = T −λI. Note the following:

• The nullspace of U is the eigenspace: N(U) = E

≤ K

• T commutes with U: that is TU = UT.

• β

= {U

k−1

( x) , . . . , U(x), x}; that is, Span β

⟨

⟩

is the U-cyclic subspace generated by x.

Proof. 1. Feed the linear combination

∑

k−1

j=0

( x) = 0 to U

k−1

to obtain

k−1

( x) = 0 =⇒ a

= 0

Now feed the same combination to U

k−2

, etc., to see that all coefﬁcients a

= 0.

2. Since T and U commute, we see that



( x)



= U



T(x)



= U



(U + λI)(x)



= U

j+1

( x) + λU

( x) ∈ Span β

This justiﬁes both T-invariance and the Jordan block claim!

The basic approach to ﬁnding a Jordan canonical basis is to ﬁnd the generalized eigenspaces and play

with cycles until you ﬁnd a basis for each K

. Many choices of canonical basis exist for a given map!

We’ll consider a more systematic method in the next section.

Examples 3.11. 1. The characteristic polynomial of A =



1 0 2

0 1 6

6 −2 1



∈ M

(R ) splits:

p(t) = (1 − t)



1 − t 6

−2 1 −t



+ 2



0 1 −t

6 −2



= (1 −t )



(1 −t)

+ 12 −12



= (1 −t )

With only one eigenvalue we see that K

= R

. Simply choose any vector in R

and see what

U = A − I does to it! For instance, with x = e





, U







o

n









o

provides a Jordan canonical basis of R

. We conclude

A = QJQ

−1



12 0 1

36 0 0

0 6 0



1 1 0

0 1 1

0 0 1



12 0 1

36 0 0

0 6 0



−1

In practice, almost any choice of x ∈ R

will generate a cycle of length three!

2. The matrix B =



7 1 −4

0 3 0

8 1 −5



∈ M

(R ) has characteristic equation

p(t) = (3 − t)(t

−2t − 3) = −(t + 1)

( t −3)

dim K

−1

= 1 =⇒ K

−1

= E

−1

= Span





, spanned by a cycle of length one.

Since dim K

= 2, we have

= N(B −3I)

= N



4 1 −4

0 0 0

8 1 −8



= N



−16 0 16

0 0 0

−32 0 32



= Span

n





o

This is spanned by a cycle of length two:





is an eigenvector and





= (B −3I)





We conclude that β =

n









o

is a Jordan canonical basis for B, and that

B = QJQ

−1



1 1 0

0 0 1

2 1 0



−1 0 0

0 3 1

0 0 3



1 1 0

0 0 1

2 1 0



−1

3. Let T =

on P

(R ). With respect to the standard basis ϵ = {1, x, x

, x

A = [T]



0 1 0 0

0 0 2 0

0 0 0 3

0 0 0 0



With only one eigenvalue λ = 0, we have a single generalized eigenspace K

= P

(R ). It is

easy to check that f (x) = x

generates a cycle of length three and thus a Jordan canonical basis:

β = {6, 6x , 3x

, x

} =⇒ [T]



0 1 0 0

0 0 1 0

0 0 0 1

0 0 0 0





6 0 0 0

0 6 0 0

0 0 3 0

0 0 0 1



−1



0 1 0 0

0 0 2 0

0 0 0 3

0 0 0 0



6 0 0 0

0 6 0 0

0 0 3 0

0 0 0 1



Our ﬁnal results state that this process works generally.

Theorem 3.12. Let T ∈ L(V) have an eigenvalue λ. If dim K

< ∞, then there exists a basis

= β

∪···∪ β

of K

consisting of ﬁnitely many linearly independent cycles.

Intuition suggests that we create cycles β

by starting with a basis of the eigenspace E

and extending

backwards: for each x, if x = (T −λI)(y), then x ∈ β

; now repeat until you have a maximum length

cycle. This is essentially what we do, though a sneaky induction is required to make sure we keep

track of everything and guarantee that the result really is a basis of K

Proof. We prove by induction on m = dim K

(Base case) If m = 1, then K

= E

= Span x for some eigenvector x. Plainly {x} = β

(Induction step) Fix m ≥ 2. Write n = dim E

≤ m and U = (T − λI)

(i) For the induction hypothesis, suppose every generalized eigenspace with dimension < m (for

any linear map!) has a basis consisting of independent cycles of generalized eigenvectors.

(ii) Deﬁne W = R(U) ∩ E

: that is

w ∈ W ⇐⇒

(

U(w) = 0 and

w = U(v) for some v ∈ K

Let k = dim W, choose a complementary subspace X such that E

= W ⊕ X and select a basis

k+1

, . . . , x

} of X. If k = 0, the induction step is ﬁnished (why?). Otherwise we continue. . .

(iii) The calculation in the proof of Lemma 3.10 (take j = 1) shows that R( U) is T-invariant; it is

therefore the single generalized eigenspace

of T

R(U)

(iv) By the rank–nullity theorem,

dim R(U) = rank U = dim K

−null U = m −dim E

< m

By the induction hypothesis, R(U) has a basis of independent cycles. Since the last non-zero

element in each cycle is an eigenvector, this basis consists of k distinct cycles β

∪ ··· ∪ β

whose terminal vectors form a basis of W.

(v) Since each

∈ R(U), there exist vectors x

, . . . , x

such that

= U(x

). Including the length-

one cycles generated by the basis of X, the cycles β

, . . . , β

now contain

dim R(U) + k + (n − k) = rank U + null U = m

vectors. We leave as an exercise the veriﬁcation that these vectors are linearly independent.

Corollary 3.13. Suppose that the characteristic polynomial of T ∈ L(V) splits (necessarily dim V <

∞). Then there exists a Jordan canonical basis, namely the union of bases β

from Theorem 3.12.

Proof. By Theorem 3.5, V is the direct sum of generalized eigenspaces. By the previous result, each

has a basis β

consisting of ﬁnitely many cycles. By Lemma 3.10, the matrix of T

has Jordan

canonical form with respect to β

. It follows that β =

is a Jordan canonical basis for T.

Exercises 3.1 1. For each matrix, ﬁnd the generalized eigenspaces K

, ﬁnd bases consisting of

unions of disjoint cycles of generalized eigenvectors, and thus ﬁnd a Jordan canonical form J

and invertible Q so that the matrix may be expressed as QJQ

−1

(a) A =



1 1

−1 3



(b) B =



1 2

3 2







11 −4 −5

21 −8 −11

3 −1 0





(d) D =







2 1 0 0

0 2 1 0

0 0 3 0

0 1 −1 3







2. If β = {v

, . . . , v

} is a Jordan canonical basis, what can you say about v

? Brieﬂy explain why

the linear map L

∈ L(R

) where A =



0 −1

1 0



has no Jordan canonical form.

3. Find a Jordan canonical basis for each linear map T:

(a) T ∈ L(P

(R )) deﬁned by T( f (x)) = 2 f (x) − f

′

(x)

(b) T( f ) = f

′

deﬁned on Span{1, t, t

, e

, te

}

deﬁned on M

(R )

4. In Example 3.11.1, suppose x =





. Show that almost any choice of a, b, c produces a Jordan

canonical basis β

5. We complete the proof of Lemma 3.7.

(a) Prove part 1: that E

≤ K

≤ V.

(b) Verify that T − µI and T − λI commute.

6. Consider the induction step in the proof of Theorem 3.5.

(a) Prove that W is T-invariant.

(b) Explain why W ∩ K

= {0}.

( t) = ( λ

− t)

dim K

···(λ

− t)

dim K

near the end of the proof is the

induction hypothesis for part 1(b). Why can’t we also assume that dim K

= m

and thus

tidy the inequality argument near the end of the proof?

7. We ﬁnish some of the details of Theorem 3.12.

(a) In step (ii), suppose dim W = k = 0. Explain why {x

, . . . , x

} is in fact a basis of K

, so

that the rest of the proof is unnecessary.

(b) In step (v), prove that the m vectors in the cycles β

, . . . , β

are linearly independent.

(Hint: model your argument on part 1 of Lemma 3.10)

3.2 Cycle Patterns and the Dot Diagram

In this section we obtain a useful result that helps us compute Jordan forms more efﬁciently and

systematically. To give us some clues how to proceed, here is a lengthy example.

Example 3.14. Precisely three Jordan canonical forms A, B, C ∈ M

(R ) correspond to the charac-

teristic polynomial p(t) = (5 −t)

A =





5 0 0

0 5 0

0 0 5





B =





5 1 0

0 5 0

0 0 5





C =





5 1 0

0 5 1

0 0 5





In all three cases the standard basis β = {e

, e

} is Jordan canonical, so how do we distinguish

things? By considering the number and lengths of the cycles of generalized eigenvectors.

• A has eigenspace E

= K

= R

. Since (A − 5I)v = 0 for all v ∈ R

, we have maximum

cycle-length one. We therefore need three distinct cycles to construct a Jordan basis, e.g.

= {e

}, β

= {e

}, β

= {e

} =⇒ β = β

∪ β

= {e

, e

}

• B has eigenspace E

= Span{e

, e

}. By computing

v =









=⇒ (B −5I)v =









=⇒ (B −5I)

v = 0

we see that β

is a cycle with maximum length two, provided b = 0 (v ∈ E

). We therefore

need two distinct cycles, of lengths two and one, to construct a Jordan basis, e.g.



(B −5I)e

, e



= {e

, e

}, β

= {e

} =⇒ β = β

∪ β

= {e

, e

}

• C has eigenspace E

= Span e

. This time

v =









=⇒ (C −5I)v =









, ( C −5I)

v =









, ( C −5I)

v = 0

generates a cycle with maximum length two provided c = 0. Indeed this cycle is a Jordan basis,

so one cycle is all we need:

β = β



(C −5I)

, (C −5I)e

, e



= {e

, e

}

Why is the example relevant? Suppose that dim

V = 3 and that T ∈ L(V) has characteristic polyno-

mial p(t) = (5 −t)

. Theorem 3.12 tells us that T has a Jordan canonical form, and that is is moreover

one of the above matrices A, B, C. Our goal is to develop a method whereby the pattern of cycle-

lengths can be determined, thus allowing us to be able to discern which Jordan form is correct. As a

side-effect, this will also demonstrate that the pattern of cycle lengths for a given T is independent of

the Jordan basis so that, up to some reasonable restriction, the Jordan form of T is unique. To aid us

in this endeavor, we require some terminology. . .

Deﬁnition 3.15. Let V be ﬁnite dimensional and K

a generalized eigenspace of T ∈ L(V). Follow-

ing the Theorem 3.12, assume that β

= β

∪ ··· ∪ β

is a Jordan canonical basis of T

, where the

cycles are arranged in non-increasing length. That is:

1. β

= {(T −λI)

−1

( x

), . . . , x

} has length k

, and

2. k

≥ k

≥ ··· ≥ k

The dot diagram of T

is a representation of the elements of β

, one dot for each vector: the j

column

represents the elements of β

arranged vertically with x

at the bottom.

Given a linear map, our eventual goal is to identify the dot diagram as an intermediate step in the

computation of a Jordan basis. First, however, we observe how the conversion of dot diagrams to a

Jordan form is essentially trivial.

Example 3.16. Suppose dim V = 14 and that T ∈ L(V) has the following eigenvalues and dot

diagrams:

= −4 λ

= 7 λ

= 12

• • • •

• •

•

• • •

Then generalized eigenspaces of T satisfy:

• K

−4

= N(T + 4I)

and dim K

−4

= 6;

• K

= N(T − 7I)

and dim K

= 5;

• K

= N(T − 12I) = E

and dim K

= 3;

T has a Jordan canonical basis β with respect to which its Jordan canonical form is

[T]







−4 1

0 −4

−4 1

0 −4

−4

7 1 0

0 7 1

0 0 7

7 1

0 7







Note how the sizes of the Jordan blocks are non-increasing within each eigenvalue. For instance, for

= −4, the sequence of cycle lengths (k

) is 2 ≥ 2 ≥ 1 ≥ 1.

Theorem 3.17. Suppose β

is a Jordan canonical basis of T

as described in Deﬁnition 3.15, and

suppose the i

row of the dot diagram has r

entries. Then:

1. For each r ∈ N, the vectors associated to the dots in the ﬁrst r rows form a basis of N(T −λI)

2. r

= null(T −λI) = dim V −rank(T −λI)

3. When i > 1, r

= null(T −λI)

−null(T − λI)

i −1

= rank(T −λI)

i −1

−rank(T − λI)

Example (3.14 cont). We describe the dot diagrams of the three matrices A, B, C, along with the

corresponding vectors in the Jordan canonical basis β and the values r

A :

• • • x

Since A − 5I is the zero matrix, r

= 3 −rank(A − 5I) = 3. The dot diagram has one row,

corresponding to three independent cycles of length one: β = β

∪ β

B : • •

•

(B −5I)x

Row 1: B −5I =



0 1 0

0 0 0



=⇒ rank(B −5I) = 1 and r

= 3 −1 = 2. The ﬁrst row {e

, e

} is

a basis of E

= N(B −5I).

Row 2: (B −5I)

is the zero matrix, whence r

= rank(B −5I) −rank(B −5I)

= 1 −0 = 1.

The dot diagram corresponds to β = β

∪ β

= {e

, e

}∪{e

C : •

•

(C −5I)

(C −5I)x

Row 1: C − 5I =



0 1 0

0 0 1

0 0 0



=⇒ r

= 3 −rank(C − 5I) = 1. The ﬁrst row {e

} is a basis of

= N(C − 5I).

Row 2: (C −5I)



0 0 1

0 0 0



=⇒ r

= rank( C −5I) −rank(C −5I)

= 2 − 1 = 1. The ﬁrst

two rows {e

, e

} form a basis of N(C −5I)

Row 3: (C −5I)

is the zero matrix, whence r

= rank(C −5I)

−rank(C −5I)

= 1 −0 = 1.

Proof. As previously, let U = T − λI.

1. Since each dot represents a basis vector U

( v

), any v ∈ K

may be written uniquely as a linear

combination of the dots. Applying U simply moves all the dots up a row and all dots in the top

row to 0. It follows that v ∈ N(U

) ⇐⇒ it lies in the span of the ﬁrst r rows. Since the dots

are linearly independent, they form a basis.

2. By part 1, r

= dim N(U) = null(T −λI) = dim V −rank( T −λI).

3. More generally,

= (r

+ ···+ r

) −(r

+ ···+ r

i −1

) = dim N(U

) −dim N(U

i −1

)

= null(U

) −null(U

i −1

) = rank(T −λI)

i −1

−rank(T − λI)

Since the ranks of maps ( T −λI)

are independent of basis, so also is the dot diagram. . .

Corollary 3.18. For any eigenvalue λ, the dot diagram is uniquely determined by T and λ. If we

list Jordan blocks for each eigenspace in non-increasing order, then the Jordan form of a linear map

is unique up to the order of the eigenvalues.

We now have a slightly more systematic method for ﬁnding Jordan canonical bases.

Example 3.19. The matrix A =



6 2 −4 −6

0 3 0 0

0 0 3 0

2 1 −2 −1



has characteristic equation

p(t) = (3 − t)



6 − t −6

2 −1 −t



= (2 −t )(3 −t)

We have two generalized eigenspaces:

• K

= E

= N(A −2I) = N



4 2 −4 −6

0 1 0 0

0 0 1 0

2 1 −2 −3



= Span





. The trivial dot diagram • corresponds

to this single eigenvector.

• K

= N(A −3I)

. To ﬁnd the dot diagram, compute powers of A −3I:

Row 1: A −3I =



3 2 −4 −6

0 0 0 0

2 1 −2 −4



has rank 2 and the ﬁrst row has r

= 4 −2 = 2 entries.

Row 2: (A −3I)



−3 0 0 6

0 0 0 0

−2 0 0 4



has rank 1 and the second row has r

= 2 −1 = 1 entry.

Since we now have three dots (equalling dim K

), the algorithm terminates and the dot diagram

for K

is • •

•

For the single dot in the second row, we choose something in N(A −3I)

which isn’t an eigen-

vector; perhaps the simplest choice is x

= e

, which yields the two-cycle

{

(A − 3I)x

, x

}









To complete the ﬁrst row, choose any eigenvector to complete the span: for instance x





We now have suitable cycles and a Jordan canonical basis/form:

β =

















, A = QJQ

−1



3 2 0 0

0 0 1 2

0 0 0 1

2 1 0 0



2 0 0 0

0 3 1 0

0 0 3 0

0 0 0 3



3 2 0 0

0 0 1 2

0 0 0 1

2 1 0 0



−1

Other choices are available! For instance, if we’d chosen the two-cycle generated by x

= e

, we’d

obtain a different Jordan basis but the same canonical form J:

β =







−4

−2











, A =



3 −4 0 0

0 0 0 2

0 0 1 1

2 −2 0 0



2 0 0 0

0 3 1 0

0 0 3 0

0 0 0 3



3 −4 0 0

0 0 0 2

0 0 1 1

2 −2 0 0



−1

We do one ﬁnal example for a non-matrix map.

Example 3.20. Let ϵ = {1, x, y, x

, y

, xy} and deﬁne T



f (x, y)



= 2

∂ f

∂x

−

∂ f

∂y

as a linear operator on

V = Span

ϵ. The matrix and characteristic polynomial of T is easy to compute:

[T]





0 2 −1 0 0 0

0 0 0 4 0 −1

0 0 0 0 −2 2

0 0 0 0 0 0





=⇒ p(t) = t

, [T

]





0 0 0 8 2 −4

0 0 0 0 0 0





, [T

]

= O

There is only one eigenvalue λ = 0 and therefore one generalized eigenspace K

= V. We could keep

working with matrices, but it is easy to translate the nullspaces of the matrices back to subspaces of

V, from which the necessary data can be read off:

N(T) = Span{1, x + 2y, x

+ 4y

+ 4xy} null T = 3, rank T = 3, r

= 3

N(T

) = Span{1, x, y, x

+ 2xy, 2y

+ xy} null T

= 5, rank T

= 1, r

= 3 −1 = 2

We now have ﬁve dots; since dim K

= 6, the last row has one, and the dot diagram is • • •

• •

•

Since the ﬁrst two rows span N(T

), we may choose any f

∈ N(T

) for the ﬁnal dot: f

= xy is

suitable, from which the ﬁrst column of the dot diagram becomes

(xy) • •

T(xy) •

−4 • •

2y − x •

Now choose the second dot on the second row to be anything in N(T

) such that the ﬁrst two rows

span N(T

): this time f

= x

−4y

is suitable, and the diagram becomes:

(xy) T(x

−4y

) •

T(xy) x

−4y

−4 4x + 8y •

2y − x x

−4y

The ﬁnal dot is now chosen so that the ﬁrst row spans N(T): this time f

= x

+ 4y

+ 4xy works.

The result is a Jordan canonical basis and form for T

β =



−4, 2y − x, xy, 4x + 8y, x

−4y

, x

+ 4y

+ 4xy



, J = [T]







0 1 0

0 0 1

0 0 0

0 1

0 0







As previously, many other choices of cycle-generators f

, f

are available; while these result in

different Jordan canonical bases, Corollary 3.18 assures us that we’ll always obtain the same canonical

form J.

Exercises 3.2 1. Let T be a linear operator whose characteristic polynomial splits. Suppose the

eigenvalues and the dot diagrams for the generalized eigenspaces K

are as follows:

= 2 λ

= 4 λ

= −3

• • •

• •

•

• •

•

• •

Find the Jordan form J of T.

2. Suppose T has Jordan canonical form

J =







2 1 0

0 2 1

0 0 2

2 1

0 2







(a) Find the characteristic polynomial of T.

(b) Find the dot diagram for each eigenvalue.

such that K

= N(T − λ

3. For each matrix A ﬁnd a Jordan canonical form and an invertible Q such that A = QJQ

−1

(a) A =





−3 3 −2

−7 6 −3

1 −1 2





(b) A =





0 1 −1

−4 4 −2

−2 1 1











0 −3 1 2

−2 1 −1 2

−2 −3 1 4







4. For each linear operator T, ﬁnd a Jordan canonical form J and basis β:

(a) T( f ) = f

′

on Span

, te

, t

, e

}

(b) T



f (x)



= x f

′′

(x) on P

(R )

+ b f

on Span

{1, x, y, x

, y

, xy}. How does your answer depend on a, b?

5. (Generalized Eigenvector Method for ODEs) Let A ∈ M

(R ) have an eigenvalue λ and sup-

pose β

= {v

k−1

, . . . , v

, v

} is a cycle of generalized eigenvectors for this eigenvalue. Show

that

x(t) := e

λt

k−1

∑

j=0

( t)v

satisﬁes x

′

( t) = Ax ⇐⇒

(

′

( t) = 0, and

′

( t) = b

j−1

( t) when j ≥ 1

Use this method to solve the system of differential equations

′







3 1 0 0

0 3 1 0

0 0 3 0

0 0 0 2







3.3 The Rational Canonical Form (non-examinable)

We ﬁnish the course with a very quick discussion of what can be done when the characteristic poly-

nomial of a linear map does not split. In such a situation, we may assume that

p(t) = (−1)



( t)



···



( t)



(∗)

where each ϕ

( t) is an irreducible monic polynomial over the ﬁeld.

Example 3.21. The following matrix has characteristic equation p(t) = (t

+ 1)

(3 −t)

A =

0 −1 0 0 0

1 0 0 0 0

0 0 0 −1 0

0 0 1 0 0

0 0 0 0 3

∈ M

(R )

This doesn’t split over R since t

+ 1 = 0 has no real roots. It is, however, diagonalizable over C.

A couple of basic facts from algebra:

• Every polynomial splits over C: every A ∈ M

(C ) therefore has a Jordan form.

• Every polynomial over R factorizes into linear or irreducible quadratic factors.

The question is how to deal with non-linear irreducible factors in the characteristic polynomial.

Deﬁnition 3.22. The monic polynomial t

+ a

k−1

+ ···+ a

has companion matrix







0 0 0 ··· 0 −a

1 0 0 0 −a

0 1 0 0 −a

0 0 0 0 −a

k−2

0 0 0 ··· 1 −a

k−1







(when k = 1, this is the 1 ×1 matrix (−a

))

If T ∈ L(V) has characteristic polynomial (∗), then a rational canonical basis is a basis for which

[T]







O ··· O

O C

O O ··· C







where each C

is a companion matrix of some (ϕ

( t))

where s

≤ m

. We call [T]

a rational canonical

form of T.

We state the main result without proof:

Theorem 3.23. A rational canonical basis exists for any linear operator T on a ﬁnite-dimensional

vector space V. The canonical form is unique up to ordering of companion matrices.

Example (3.21 cont). The matrix A is already in rational canonical form: the standard basis is rational

canonical with three companion blocks,

= C



0 −1

1 0



, C

= (3)

Example 3.24. Let A =



4 −3

2 2



∈ M

(R ). Its characteristic polynomial

p(t) = t

−6t + 14 = (t −3)

+ 5

doesn’t split over R and so it has no eigenvalues. Instead simply pick a vector, x =





(say), deﬁne

y = Ax =





, let β = {x, y} and observe that

]



0 −14

1 6



is a rational canonical form. Indeed this works for any x = 0: if β := {x, Ax}, then Cayley–Hamilton

forces

x = (6A −14I)x = −14x + 6Ax =⇒ [L

]



0 −14

1 6



whence β is a rational canonical basis and the form [L

]

is independent of x!

A systematic approach to ﬁnding rational canonical forms is similar to that for Jordan forms: for each

irreducible divisor of p(t), the subspace K

= N



ϕ(T)



plays a role analogous to a generalized

eigenspace; indeed K

= K

for the linear irreducible factor ϕ(t) = λ −t!

We ﬁnish with two examples; hopefully the approach is intuitive, even without theoretical justiﬁca-

tion.

Examples 3.25. If the characteristic polynomial of T ∈ L(R

) is

p(t) = (ϕ(t))

= (t

−2t + 3)

= t

−4t

+ 10t

−12t + 9

then there are two possible rational canonical forms; here is an example of each.

1. If A =

0 −15 0 −9

2 2 −3 0

0 −9 0 −6

−3 0 5 2

, then ϕ(A) = O is the zero matrix, whence N(ϕ(A)) = R

. Since ϕ(t)

isn’t the full characteristic polynomial, we expect there to be two independent cycles of length

two in the canonical basis. Start with something simple as a guess:





=⇒ x

= Ax



−3



=⇒ Ax



−3

−6



= −3x

+ 2x

Now make another choice that isn’t in the span of {x

, x





=⇒ x

= Ax



−3



=⇒ Ax



−6

−3



= −3x

+ 2x

We therefore have a rational canonical basis β = {x

, x

} and

A =



1 0 0 0

0 2 0 −3

0 0 1 0

0 −3 0 5



0 −3 0 0

1 2 0 0

0 0 0 −3

0 0 1 2



1 0 0 0

0 2 0 −3

0 0 1 0

0 −3 0 5



−1

Over C, this example is diagonalizable. Indeed each of the 2 ×2 companion matrices is diago-

nalizable over C.

2. Let B =



0 0 2 1

1 1 −1 −1

0 1 −2 −16

0 0 1 5



. This time

ϕ(B) = B

−2B + 3I =

3 2 −7 −29

−1 1 4 13

1 −3 −6 −17

0 1 1 2

=⇒ N(ϕ(B)) = Span



−1





−2



Anything not in this span will sufﬁce as a generator for a single cycle of length four: e.g.,





, x

= Bx





, x

= Bx





, x

= Bx



−1





−1

−14



= −9





+ 12





−10





+ 4



−1



We therefore have a rational canonical basis β = {x

, x

} and

B =



1 0 0 2

0 1 1 0

0 0 1 −1

0 0 0 1



0 0 0 −9

1 0 0 12

0 1 0 −10

0 0 1 4



1 0 0 2

0 1 1 0

0 0 1 −1

0 0 0 1



−1

In contrast to the ﬁrst example, B isn’t diagonalizable over C. It has Jordan form J =

λ 1 0 0

0 λ 0 0

0 0 λ 1

0 0 0 λ

where λ = 1 + i

√