5 Fractal Geometry

5.1 Natural Geometry, Self-similarity and Fractal Dimension

The objects of classical geometry (lines, curves, spheres, etc.) tend to seem ﬂatter and less interesting

as one zooms in: at small scales, every differentiable curve looks like a line segment! By contrast,

real-world objects tend to exhibit greater detail at smaller scales. A seemingly spherical orange is

dimpled on closer inspection: is its surface area that of a sphere, or is the area greater due to the

dimples? What if we zoom in further? Under a microscope, the dimples in the orange are seen to

have minute cracks and ﬁssures. With modern technology, we can ‘see’ almost to the molecular level;

what does surface area even mean at such a scale?

The Length of a Coastline In 1967 Benoit Mandelbrot asked a related question in a now-famous

paper, How Long Is the Coast of Britain? Statistical Self-Similarity and Fractional Dimension. His essential

point was that this question has no simple answer:

Should one measure by walking along the mean

high tide line? But where is this? Do we ‘walk’ round every pebble? Do we skirt every grain of sand?

Every molecule? As the scale of consideration shrinks, the measured length becomes absurdly large.

Here is a sketch of Mandelbrot’s approach.

• Given a ruler of length R, let N be the number required to trace round the coastline when laid

end-to-end.

• Plot log N against log(1/R) for several sizes of ruler. The data suggests a straight line!

log N ≈ log k + D log(1/R) = log(kR

−D

) =⇒ N ≈ kR

−D

The number D is Mandelbrot’s fractal dimension of the coastline.

This notion of fractal dimension is purely empirical, though it does seem to capture something about

the ‘roughness’ of a coastline: the bumpier the coast, the greater its fractal dimension. For mainland

Britain with its smooth east and rugged west coasts D ≈ 1.25. Given its many fjords, Norway has a

far rougher coastline and thus a higher fractal dimension D ≈ 1.52.

Example 5.1. As a sanity check, consider a smooth circular ‘coastline.’

Approximate the circumference using N rulers of length R: clearly

R = 2 sin

As N → ∞, the small angle approximation for sine applies,

R ≈

2π

=⇒ N ≈ 2π R

−1

2π

where the approximation improves as N → ∞. The fractal dimension of a circle is therefore D = 1.

The same analysis applies to any smooth curve (Exercise 3).

The ofﬁcial answer from the Ordnance Survey (the UK government mapping ofﬁce) is, ‘It depends.’ The all-knowing

CIA states 7723 miles, though offers no evidence as to why.

For more detail see the Fractal Foundation’s website. Mandelbrot coined the word fractal, though he didn’t invent the

concept from nothing. Rather he applied earlier ideas of Hausdorff, Minkowski and others, and observed how the natural

world contains many examples of fractal structures.

Our goal is to describe a related notion of fractional dimension for self-similar objects. To help moti-

vate the deﬁnition, recall some of the standard objects of pre-fractal geometry.

Segment A segment can be viewed as N copies of itself each scaled by a

factor r =

Square A square comprises N copies of itself scaled by a factor r =

√

Cube A cube comprises N copies of itself scaled by a factor r =

√

In each case observe that N =





where D is the usual dimension of the

object (1, 2 or 3). Inspired by this, we make a loose deﬁnition.

Deﬁnition 5.2. A geometric ﬁgure is self-similar if it may be subdivided into N similar copies of

itself, each scaled by a magniﬁcation factor r < 1. The fractal dimension of such a ﬁgure is

D := log

1/r

N =

log N

log( 1/r)

= −

log N

log r

Example 5.3. The botanical pictures below offer some evidence for non-integer fractal dimension

and for the idea that self-similarity is a natural phenomenon. The ‘tree’ comprises N = 3 copies of

itself, each scaled by r = 0.4. Its fractal dimension is therefore D = −

log 3

log 0.4

≈ 1.199.

The fern has N = 7 and r = 0.3 for a fractal dimension D = −

log 7

log 0.3

≈ 1.616.

Tree fractal D ≈ 1.199 Fern fractal D ≈ 1.616

With dimensions between 1 and 2, both objects exhibit an intuitive idea of fractal dimension: both

seem to occupy more space than mere lines, but neither has positive area. Moreover, the fern seems

to occupy more space—has higher dimension—than the tree. (The ‘trunk’ and ‘branches’ in the ﬁrst

picture aren’t really part of the fractal and are drawn only to give the picture a skeleton.)

Example 5.4 (Cantor’s Middle-third Set). This famous example dates from the late 1800s.

Starting with the unit interval C

= [0, 1], deﬁne a se-

quence of sets ( C

) where C

n+1

is obtained by deleting the

open ‘middle-third’ of each interval in C

; for instance





∪



, 1



Cantor’s set is essentially the limit of this sequence:

C :=

∞

n=0

Cantor’s set has several strange properties, none of which we establish rigorously.

Zero length The sum of the lengths of the disjoint sub-intervals comprising C

is length(C

) =





since we delete

of the remaining set at each step. It follows that

∀n ∈ N

, length(C) ≤





=⇒ length(C) = 0

We conclude that C contains no subintervals!

Uncountability There exists a bijection between C and the original interval [0, 1]! (This issue is of

limited interest to us, though you’ve likely encountered the notion elsewhere.)

Self-similarity Since C

n+1

consists of two copies of C

, each shrunk by a factor of

and one shifted

to the right, we abuse notation slightly to write

n+1

∪





‘Taking limits,’ Cantor’s set is seen to comprise two shrunken copies of itself:

C =

C ∪



C +



In particular, its fractal dimension is D =

log 2

log 3

≈ 0.631.

The Cantor set has many generalizations:

• Removing different fractions of every interval at each stage produces sets with other fractal

dimensions. For instance, removing the 2

and 4

ﬁfths results in D =

log 3

log 5

≈ 0.683.

• Higher-dimensional analogues include the Sierpi

nski triangle (D =

log 3

log 2

≈ 1.585) and carpet

(Example 5.10.3, D =

log 8

log 3

≈ 1.893), and the Menger sponge (D =

log 20

log 3

≈ 2.727).

Henry Smith discovered this set in 1874 while investigating integrability (the ‘length’ of a set was later formalized using

measure theory). Cantor’s 1883 description focused on topological properties, with self-similarity being less of a concern.

Example 5.5 (The Koch Curve and Snowﬂake). Another generalization of the Cantor set is pro-

duced as the limit of a sequence of curves.

• Let K

be a segment of length 1.

• Replace the middle third of K

with the other two sides of

an equilateral triangle to create K

• Replace the middle third of each segment in K

as before to

create K

• Repeat ad inﬁnitum.

The resulting curve is drawn along with the Koch snowﬂake ob-

tained by arranging three copies around an equilateral triangle.

The relation to the Cantor set should be obvious in the construc-

tion. Indeed if K

= [0, 1], then the intersection of this with the

Koch curve is the Cantor set itself!

The Koch curve is self-similar in that it comprises N = 4 copies

of itself shrunk by a factor of r =

. Its fractal dimension is

therefore

log 4

log 3

≈ 1.2619, between that of a line and an area.

We may also consider the curve’s length. Let s

be the number

of segments in K

, each having length t

, and let ℓ

= t

be the

length of the curve K

. It follows that

= 4

, t

=⇒ ℓ





→ ∞

The Koch curve is inﬁnitely long!

Koch Curve

Koch Snowﬂake

Self-similarity

Exercises 5.1. 1. By removing a constant middle fraction of each interval, construct a fractal analo-

gous to the Cantor set but with dimension

. More generally, if one removes a constant middle

fraction f from each interval, what is the resulting dimension?

2. Prove that the area inside the n

iteration of the construction of the Koch snowﬂake is

√



1 +



1 −







−−−→

n→∞

√

Area(△)

3. Suppose r(t), t ∈ [0, 1] describes a regular (smooth) curve in the plane.

(a) Use the arc-length formula L =

′

(t)

dt together with Riemann sums and the linear

approximation r(t +

) ≈ r(t) +

′

(t) to argue that

L ≈

N−1

∑

k=0





k + 1



−r







(∗)

(b) Parametrizing r such that each segment in ( ∗) has the same length R, prove that L ≈ NR.

(Any regular curve thus has fractal dimension 1 in the sense stated by Mandelbrot (pg. 81))

5.2 Contraction Mappings & Iterated Function Systems

Thus far we have dealt informally with fractals where the whole consists of multiple pieces scaled

by the same factor. In general we can mix up scaling factors. To do this, and to be more rigorous, we

need to borrow some ideas from topology and analysis.

Deﬁnition 5.6. A contraction mapping with scale factor c ∈ [0, 1) is a function S : R

→ R

such that

∀x, y ∈ R

S(x) −S(y)

≤ c

x −y

A contraction mapping moves inputs closer together. It should be clear that every such is continuous

(lim x

= y =⇒ lim S(x

) = S(y)).

Example 5.7. The function S(x) =

x +

is a contraction mapping (on R) with scale factor c =

S(x) −S(y)

x −y

To motivate the key theorem, consider using S to inductively deﬁne a sequence: given any x

, deﬁne

n+1

:= S(x

)

The ﬁrst few terms are

, x

which suggests (geometric series)

+ 2

∑

k=1

2( 3

−1

−3

−n−1

)

1 −3

−1

= 1 +

−1)

This can easily be veriﬁed by induction if you prefer. The striking thing about this sequence is that it

converges to the same limit lim x

= 1 regardless of the initial term x

The example illustrates one of the most powerful and useful theorems in mathematics.

Theorem 5.8 (Banach Fixed Point Theorem). Let S : R → R be a contraction mapping. Then:

1. S has a unique ﬁxed point: some L ∈ R

such that S(L) = L.

2. If x

∈ R is any value, then the sequence deﬁned iteratively by x

n+1

:= S(x

) converges to L.

In fact, as will be crucial momentarily, Banach’s result holds whenever S : H → H is a contraction

mapping on any complete metric space.

The main goal of this section is to use Banach’s result to

generate certain fractals via repeated application of contraction mappings to an initial shape. Our

motivating example already illustrates this. . .

Very loosely, a metric space is a set on which a sensible notion of distance can be deﬁned: on R, for instance, the

distance between two points is d(x, y) =

x −y

. If you’ve done analysis you’ll be familiar with completeness: every Cauchy

sequence converges (in H).

Example (5.4, Cantor Set mk. II). The functions S

, S

: R → R where

(x) =

are contraction mappings with scale factor c =

. More importantly, these functions deﬁne the Cantor

set: at each stage of its construction, we deﬁned

n+1

:= S

) ∪ S

) (∗)

Indeed, the self-similarity of the Cantor set can be expressed in the same manner: C = S

(C).

Amazingly, it barely seems to matter what initial set C

is chosen. Originally we took C

= [0, 1] to

be the unit interval, but we could instead start with the singleton set C

= {0}: iterating (∗) produces





, C





, C





, . . .

The ﬁrst few iterations are drawn in the ﬁrst picture below; it appears as if, in the limit, C

is becoming

the Cantor set. The second picture starts with a very different initial set C

= [0.2, 0.5] ∪ [0.6, 0.7];

iterating this also appears to produce the Cantor set!

It certainly appears as if the Cantor set is generated by the contraction maps S

, S

independently of

the initial data C

. Our main result shows in what sense this is true. Since this requires some heavy

lifting from topology and analysis, we provide only a synopsis.

• A subset K ⊂ R

is compact if it is closed (contains its boundary points) and bounded (K lies

within some ball centered at the origin). For instance, K = [0, 1] is a compact subset of R.

• The set of non-empty compact subsets of R

is a metric space H. This means that the distance

d(X, Y) between X, Y ∈ H may sensibly be deﬁned, though it is a little tricky. . .

The distance function is the Hausdorff metric. Given Y ∈ H, and x ∈ R

, deﬁne d

(x) = inf

y∈Y

x −y

to be the

distance from x to the ‘nearest’ point of Y. Deﬁne d

(y) similarly. The Hausdorff distance between X and Y is then

d(X, Y) := max

(

sup

x∈X

(x), sup

y∈Y

(y)

)

Roughly speaking, ﬁnd x ∈ X which is as far away (d

(x)) as possible from anything in Y, and ﬁnd y ∈ Y similarly; d(X, Y)

is the larger of these distances. Crucially, d(X, Y) = 0 ⇐⇒ X = Y.

• Since H is a metric space, we can discuss convergent sequences (K

) of compact sets

lim

n→∞

= K ⇐⇒ lim

n→∞

d(K

, K) = 0

It also makes sense to speak of Cauchy sequences in H. It may be proved that H is complete:

every Cauchy sequence (K

) ⊆ H converges to some K ∈ H.

• The main result is a corollary of Banach’s result (Theorem 5.8).

Theorem 5.9 (Iterated Function Systems). Let S

, . . . , S

be contraction mappings on R

with scale

factors c

, . . . , c

. Deﬁne

S : H → H by S(K) =

[

i=1

(K)

1. S is a contraction mapping on H with contraction factor c = max(c

, . . . , c

2. S has a unique ﬁxed set F ∈ H given by F = lim

k→∞

) for any non-empty K

∈ H.

Part 1 is not super difﬁcult to prove if you’re willing to work with the deﬁnition of the Hausdorff

metric (try it if you’re comfortable with analysis!). Part 2 is Banach’s theorem.

The upshot is this: repeatedly applying contraction mappings to any non-empty compact set E pro-

duces a compact limit set which is independent of E! We call the limit F for fractal. Such fractals are

sometimes called attractors: being limit-sets, they ‘attract’ data towards themselves.

Examples 5.10. 1. (Example 5.4, mk.III) We revisit the Cantor set one last time.

The contractions S

(x) =

x and S

(x) =

x +

(on R) produce a contraction S : H → H:

S(K) :=



(x), S

(x) : x ∈ K



By Theorem 5.9, if C

⊂ R is non-empty closed and bounded, then C = lim S

). Certainly

all three of our previous choices for C

are such sets: [0, 1], {0} and [0.2, 0.5] ∪[0.6, 0.7].

A nice application of the Theorem allows us to ﬁnd all sorts of interesting points in the Cantor

set. For instance, consider the functions T, U, where

T(x) = S



(x)



and U(y) = S



(y)



These are contractions on R (c =

) whose unique ﬁxed points are t =

and u =

; moreover

(u) = t and S

(t) = u. Now consider the non-empty compact set K = {t, u} ∈ H. Plainly

S(K) =



(t), S

(u), S

(t), S

(u)







⊃ K

(

) =

(

) =

It follows (induction) that K ⊆ lim S

(K) = C: both t =

and u =

lie in the Cantor set!

This seems paradoxical:

does not lie at the end of any deleted interval (all such points have

denominator 3

) but yet the Cantor set contains no intervals. How does

end up in there?!

2. (Example 5.5) The Koch curve arises from four contraction mappings S

: R

→ R

, each with

scale factor c =

Mapping Effect

(x, y) =





Scale

(x, y) =



x −

√

y +

√

x +



Scale

, rotate 60°, translate

(x, y) =



x +

√

y +

√

x −

y +

√



Scale

, rotate −60°, translate

(x, y) =





Scale

, translate

The combined map

S(K) := S

(K) ∪S

(K)

is a contraction on H = {non-empty compact K ⊂ R

Regardless of the initial input K

∈ H, the limit lim S

) is the Koch curve: applied to the

entire curve (drawn), the image of each S

is colored. The picture moreover links to a series of

animated constructions starting with different initial sets K

We can also play a similar game to the previous example to ﬁnd interesting points on the curve.

For instance, the unique ﬁxed point (

146

√

146

) of U = S

◦S

lies on the curve!

3. The Sierpi

nski carpet may be constructed using eight contraction

mappings, each of which scales the whole picture by a (length-scale)

factor of c =

, for a dimension of D =

log 8

log 3

≈ 1.893.

As with the Koch curve, the image links to several alternative con-

structions using different initial sets K

4. This fractal fern is constructed using three contraction mappings:

: Scale by

, rotate 5° clockwise, and translate by (0,

)

: Scale by

, rotate 60° counter-clockwise, and translate by (0,

)

: Scale by

, rotate 60° clockwise, and translate by (0,

)

The linked animation shows the ﬁrst few steps of its contsruction

starting from a single vertical line segment K

Fractal Dimension Revisited

Since Theorem 5.9 permits several different contraction factors, we need a new

approach to fractal dimension. We ask how many disks of a given radius ϵ are

required to cover a set. In the picture, the unit square requires four disks of

radius ε = 0.4. For smaller ε, we plainly need more disks. . .

Deﬁnition 5.11. Let K be a compact subset of R

1. If ε > 0, the closed ε-ball centered at x ∈ K consists of the points at most a distance ε from x:

(x) = {y ∈ R

: d(x, y) ≤ ε}

2. The minimal ε-covering number for K is the smallest number of radius-ϵ balls needed to cover K:

N(K, ε) = min

(

M : ∃x

, . . . , x

∈ K with K ⊆

[

n=1

)

3. The fractal dimension of K is the limit

D = lim

ε→0

log N(K, ε)

log( 1/ε)

Rigorously proving that N and D exist requires a more thorough study of topology, though a simple

example should at least convince us that the deﬁnition is reasonable!

Example 5.12. Let K = [0, 1] be the interval of length 1. It is not hard to check that

ε ≥

⇐⇒ N(K, ε) = 1 and

≤ ε <

⇐⇒ N(K, ε) = 2

etc. More generally, N and ϵ are related via

≤ ϵ <

2(N −1)

The dimension of K (= 1) may therefore be recovered via the squeeze theorem

D = lim

ϵ→0

log N

log( 1/ε)

= 1

Thankfully an easier-to-visualize modiﬁcation is available using boxes.

Theorem 5.13 (Box-counting). Let K ⊂ R

be compact and cover R

by boxes of side length

. Let

(K) be the number of such boxes intersecting K. Then

D = lim

n→∞

log N

(K)

log 2

We ﬁnish with a formula satisﬁed by the dimension of an iterated function system (Theorem 5.9).

Theorem 5.14. Let {S

}

n=1

be an iterated function system with attractor (limiting fractal) F and

where each contraction S

has scale factor c

∈ (0, 1). Under reasonable conditions,

the fractal

dimension is the unique real number D satisfying

∑

n=1

= 1

Examples 5.15. 1. We easily recover Deﬁnition 5.2 when the scale-factors are identical c

= r:

= 1 =⇒ D =

−log M

log r

log M

log( 1/r)

2. The fractal fern (Examples 5.10) is generated by three contraction maps with scale factors

Its dimension is the solution to the equation













= 1 =⇒ D ≈ 1.3267

3. Numerical approximation is usually required to ﬁnd D, though sometimes an exact solution is

possible. For instance, if c

= c

and c

= c

, then





+ 3





= 1

This is quadratic in α =





, whence

2α + 3α

= 1 =⇒ α =

=⇒ D = log

3 ≈ 1.584

Other methods of creating fractals

The contraction mapping approach is one of many ways to cre-

ate fractals. Two other famous examples are the logistic map

(related to numerical approximations to non-linear differential

equations) and the Mandelbrot set (pictured).

The Mandelbrot set arises from a construction in the complex

plane. For a given c ∈ C, we iterate the function

(z) = z

+ c

If f ( f ( f (··· f (c) ···))) remains bounded, no matter how many

times f is applied, then c lies in the Mandelbrot set.

Much better pictures and trippy videos can be found online. . .

−1

−i

Roughly: the outputs of each S

meet only at boundary points; the ‘pieces’ of the fractal cannot overlap too much.

Exercises 5.2. 1. Let S

(x) =

x and S

(x) =

x +

be the contraction mappings deﬁning the

Cantor set and suppose x, y, z ∈ R satisfy

y = S

(x), z = S

(y), x = S

(z)

Show that x, y, z lie in the Cantor set, and ﬁnd their values.

2. (a) As in Example 5.7, illustrate Banach’s theorem for the contraction S(x) =

x + 5.

(b) Repeat part (a) for any linear polynomial S(x) = cx + d where

< 1.

3. Verify the claim in Example 5.10.2 that the point (

146

√

146

) lies on the Koch curve.

4. The construction of a Cantor-type set starts by removing the open intervals (0.1, 0.2) and

(0.6, 0.8) from the unit interval.

(a) Sketch the ﬁrst three iterations of this fractal.

(b) This construction may be described using three contraction mappings; what are they?

package to estimate its value.

5. A variation on the Koch curve is constructed using ﬁve contraction mappings. Each scales the

whole picture by a factor c, then rotates counter-clockwise, before ﬁnally translating.

map scale rotate translate (add (x, y))

0 0

90° (

, 0)

0 (

)

−90° (

)

0 (

, 0)

(a) Suppose your initial set K

is the straight line segment from (0, 0) to (1, 0). Draw the ﬁrst

two iterations of the fractal’s construction.

(b) The dimension of the fractal is the solution D to (

)

+ (

)

+ (

)

+ (

)

+ (

)

= 1.

By observing that





, convert to a quadratic equation in the variable α = (

)

Hence compute the dimension of the fractal.



log 4

log 3



. Explain (infor-

mally) what this means.

6. Verify the details of Example 5.12, including the computation of the limit.

7. Given constants 0 ≤ c

, . . . , c

< 1, use calculus to prove that the function f (x) =

∑

is strictly

decreasing. Hence conclude that the value D in Theorem 5.14 exists and is unique.

8. (If you’ve done analysis) Let S : R → R be a contraction mapping with scale factor c, suppose

∈ R is given, and deﬁne x

n+1

:= S(x

) inductively. Prove:

∀k, n ≥ 0,

n+k

− x

1 − c

− x

Conclude that the sequence (x

) is Cauchy. Hence prove Banach’s Theorem (5.8).