construction

This Talk is Under Construction
Berkeley Math Circle 2001-2002
Kiran Kedlaya

The straightedge and compass operations.

(a): Given two points, we may construct the line through them.
(b): Given two points, we may construct the circle centered at one point passing through the other point.
(c): Given two lines, two circles, or a line and a circle, we may construct their intersection points.

Recall some things you can construct from these steps:

the perpendicular bisector of the segment joining two points;
the angle bisectors of two given lines;
the circle through three given points;
a segment of length $\sqrt{a}$ , given segments of length 1 and ;
the regular hexagon, pentagon and 17-gon (!);

and some things you've probably heard cannot be constructed from these steps:

the trisectors of an arbitrary angle;
a regular heptagon (7-gon) or nonagon (9-gon);
a segment of length $\sqrt[3]{2}$ , given segments of length and .

We call a real number

(straightedge and compass) constructible if, starting from two points at distance 1, we can produce two more points at distance $\vert r\vert$ by a finite sequence of straightedge and compass operations. We call a complex number constructible if its real and imaginary parts are constructible. Then:

the number 1 is constructible;
if and are constructible, then so are , , and ;
if $r \neq 0$ is constructible, so is ;
if is constructible, so are $\pm \sqrt{r}$ ;
a number is constructible if and only if it can be written in terms of rational numbers using only addition, subtraction, multiplication, division and square roots.

In particular, each constructible number is algebraic: it is the root of some polynomial with integer coefficients. A famous theorem of Lindemann states that $\pi$ is transcendental (not algebraic); that's why one cannot ``square the circle''. But there are algebraic numbers that are not constructible, such as $\sqrt[3]{2}$ .

Warning. The next part of the discussion belongs to the subject of abstract algebra, so may involve some ideas you may not have seen before; but I hope the intuition will be clear. (A subset of that is what is called ``linear algebra''; that's pretty much all I'm really using.) Given a finite set of algebraic numbers, the set of complex numbers obtained from and the rational numbers by addition, subtraction, multiplication and division forms what is called a number field. If is a number field, we can always find $z_1, \dots, z_n$ in such that

$\displaystyle K = \{ a_1 z_1 + \cdots + a_n z_n: a_1, \dots, a_n \in \mathbb{Q} \}$

and that representation is not redundant (there's only way to write any element of

this way). Such a set $\{z_1, \dots, z_n\}$ is called a basis of

, and the number

turns out not to depend on the choice of the basis (a fact from linear algebra that I'm not going to prove now). That number is called the degree of the number field. For example, the number field

generated by $\sqrt[3]{2}$ has degree 3, because we can take $z_1 = 1, z_2 = \sqrt[3]{2}, z_3 = \sqrt[3]{4}$ . And if

consists of a single primitive

-th root of unity $e^{2\pi i/n}$ , then the degree of

turns out to be the Euler phi function $\phi(n)$ .

Theorem 1 If

is a finite set of constructible numbers and

is the number field they generate, then the degree of

is a power of 2.

Proof. [Sketch] We prove this by induction on the size of

. It's obvious if

is of size zero, since then $S = \mathbb{Q}$ . Suppose $S = S_1 \cup \{\sqrt{s}\}$ , and assume that the degree of the number field

generated by

is a power of 2 and that $s \in K_1$ . If $\sqrt{s}$ is already in

, then

and we're done. Otherwise, pick a basis $z_1, \dots, z_n$ of

; then I claim that $z_1, \dots, z_n, z_1 \sqrt{s}, \dots, z_n \sqrt{s}$ is a basis of

. Certainly every element of

can be written as

$\displaystyle a_1 z_1 + \cdots + a_n z_n + b_1 z_1 \sqrt{s} + \cdots + b_n z_n \sqrt{s}$

with the

and

all rational. So we only have to check that there aren't two ways to write some number; in fact, we only have to check that there is no way to write 0 besides using all zeroes. If we had a nontrivial way to write zero, we could write $a = a_1 z_1 + \cdots + a_n z_n$ and $b = b_1 z_1 + \cdots + b_n z_n$ , which are both elements of

. Then $a + b \sqrt{s} =0$ ; if

were nonzero, then $\sqrt{s} = -a/b$ would be in

, contradiction. So

and hence

; but since $z_1, \dots, z_n$ is a basis of

, this can only happen if the

and

are all zero. Conclusion: the degree of

is exactly twice that of

, so it's also a power of 2. Why this is only a sketch: You can always arrange for this induction to work at the cost of making

larger. (Given whatever numbers you started with, you have to toss in the ``intermediate'' numbers you needed to construct them.) So you have to check that if the degree of a number field is a power of 2, then the degree of any number field it contains is also a power of 2, which requires a little more linear algebra. $\qedsymbol$

So for example, $\sqrt[3]{2}$ is not constructible, because the number field it generates has degree 3. Also, $e^{2\pi i/7}$ and $e^{2\pi i/9}$ both generate number fields of degree 6, so they're not constructible either; thus the regular 7-gon and 9-gon cannot be constructed with straightedge and compass! It turns out the converse is also true: if an algebraic number generates a number field whose degree is a power of 2, then it is constructible. I won't prove this, but I'll give you an example. A 17-th root of unity $e^{2\pi i/17}$ generates a number field of degree 16, so this converse says that a regular 17-gon is constructible. That is, we can write $e^{2\pi i/17}$ in terms of rational numbers using square roots. We'll do that in the next section. One can do likewise for a 65537-gon, since 65537 is prime and $\phi(65537) = 2^{16}$ ; we won't do this here.

Example: constructing the 17-gon. In case you've never seen the construction of the 17-gon (due to Gauss), it's worth a look, in part because it's a very simple example of a notion from abstract algebra called a ``Galois group''. Here's the idea: let $z = e^{2\pi i/17}$ be a primitive 17th root of unity. Note that is a primitive root modulo 17, so that $1, 3, \dots, 3^{15}$ cover all the nonzero residue classes modulo 17. Now write

$\displaystyle -1 = x_0$	$\displaystyle = z + z^3 + z^{3^2} + \cdots + z^{3^{15}}$
$\displaystyle x_1$	$\displaystyle = z + z^{3^2} + z^{3^4} + \cdots + z^{3^{14}}$
$\displaystyle x_2$	$\displaystyle = z + z^{3^4} + z^{3^8} + z^{3^{12}}$
$\displaystyle x_3$	$\displaystyle = z + z^{3^8}$
$\displaystyle x_4$	$\displaystyle = z.$

Gauss realized that each

can be written as the root of a quadratic polynomial in terms of the previous ones. I'll leave it to you to check my arithmetic:

$\displaystyle x_1(-1-x_1)$	$\displaystyle = -4$
$\displaystyle x_2(x_1-x_2)$	$\displaystyle = -1$
$\displaystyle x_3(x_2-x_3)$	$\displaystyle = \frac{x_1x_2 - x_1 +x_2-3}{2}$
$\displaystyle x_4(x_3-x_4)$	$\displaystyle = 1.$

How do you guess this? I don't know how Gauss did it, but they are predicted by Galois theory. That's the branch of mathematics that tells us that you can't write the roots of a general degree 5 polynomial in terms of the coefficients using addition, subtraction, multiplication, division, and taking

-th roots (the way the quadratic formula does this in degree 2, and analogous formulas are known in degree 3 and 4). Here's how the prediction is made in that language: the Galois group of the number field generated by

is a cyclic group of order 16, generated by the map

that sends $z^{i}$ to $z^{3i}$ . (The point is that this map preserves addition and multiplication, since all it's doing is changing from one 17-th root of unity to another.) The map

fixes only rational numbers; the maps

fix number fields of degree 2, 4, 8, respectively. The image of

under

is precisely

, so the product

is fixed by

and so must be some rational number. Likewise, the image of

under

, so the product

is fixed by

and so must be in the number field generated by

, and so forth.

The marked straightedge operations. In using the marked straightedge to trisect angles, Archimedes assumed an additional axiom: (here is the length marked on the straightedge)

(d): Given a line $\ell$ , a circle , and a point , we can construct all pairs of points and , with on $\ell$ and on , such that the line passes through and the distance is equal to .

But you might as well go a little further, replacing the line $\ell$ by another circle:

(e): Given circles and and a point , we can construct all pairs of points and , with on and on , such that the line passes through and the distance is equal to .

We can define marked straightedge constructible real and complex numbers as you might expect, but the definition will depend on the distance

between the marks on the straightedge. The ``right'' definition is to take

, the same as the distance between the two starting points. (Or if you like, just take one starting point, and use the marked straightedge to produce the second one!)

The ``solid'' operations. The ancient Greeks considered a geometric object to be ``solid'' constructible if if could be obtained using the ordinary straightedge and compass operations plus the ability to draw conic sections. One can phrase this as follows:

(d): Given five points, we may construct the conic section (parabola, hyperbola, ellipse, or degenerate conic) through them.
(e): Given a conic section and a line, circle or another conic, we may construct their intersection points.

Likewise, one can define solid constructible real and complex numbers. It turns out that a complex number

is constructible if and only if:

is algebraic, i.e., for some polynomial with integer coefficients;
for some such , the number field generated by all of the roots of has degree a power of 2 times a power of 3. (It's enough to check this for the polynomial of smallest degree, called the minimal polynomial of .)

Warning: the second condition is stronger than saying that the number field generated by

alone has degree a power of 2 times a power of 3. For example, if

is a ``general'' polynomial of degree 6 with integer coefficients, then the number field generated by

itself will have degree 6 but the number field generated by all of the roots of

will have degree

, which has the bad prime factor 5. For a nice exposition of this, see the article ``Constructions Using a Compass and Twice-Notched Straightedge'' by Arthur Baragar, in the February 2002 American Mathematical Monthly. He also includes a discussion of the marked straightedge case, but a precise characterization of the constructible numbers in that case is still unknown.

The origami operations. The Japanese art of paper folding gives an entirely different approach to constructing geometric figures. Here is one set of axioms for ``origami geometry''; there may be others in the literature. (This one is from an ``origami mathematics'' web site.)

(a): Given two points, we may construct the line through them.
(b): Given two points, we may construct the perpendicular bisector of the segment joining the two points (i.e., we may fold one point onto another).
(c): Given two lines, we may form the angle bisector of either angle between them (i.e, we may fold one line onto another).
(d): Given a line and a point, we may construct the perpendicular to the line through the point.
(e): Given points and a line $\ell$ , we may construct the line $\ell_1$ through such that the reflection of across $\ell_1$ lies on $\ell$ .
(f): Given points and and lines $\ell_1$ and $\ell_2$ , we may construct the line(s) such that the reflection of across lies on $\ell_1$ , and the reflection of across lies on $\ell_2$ .

Challenge problems. Some of these are discussed in the article by Baragar mentioned above, which I again heartily recommend; he also includes many more challenge problems. Warning: the characterizations of constructible numbers involves the notion of a ``Galois group'', which we haven't talked about.

Find a construction of a regular 7-gon using marked straightedge.
You might wonder why I didn't state a version of the marked straightedge axiom (d) with two lines instead of a line and a circle. Prove that actually the resulting construction can already done with straightedge and compass.
Prove that origami constructions (a)-(e) can also be accomplished with straightedge and compass. (The only subtle one is (e).)
Find constructions for trisecting a given angle and doubling a cube using origami. (This will imply that origami construction (f) cannot be accomplished with straightedge and compass.)
Find a regular polygon that cannot be constructed using origami. Even better, find a characterization of origami constructible numbers. (I believe the latter is unsolved!)