Algebraic Equations and Finite Simple Groups:
What I learned from graduate school at University of Michigan, 1964–1967

 

PRELUDE: After three years of Graduate School in Mathematics at University of Michigan (1964-1967), writing a thesis under the direction of Don Lewis, I left for a postdoctoral at the Institute for Advanced Study. There I studied with Goro Shimura. My first year was extended to two years (1967-1969). Then, I went with James Ax to SUNY at Stony Brook. After receiving tenure and a Sloan Fellowship, I left. That bare bones outline of a beginning career tells little mathematically.

 

It has no hint that the work inspired by my time at UM connected resolutely with the simple group classification – through conversations with John Thompson – and with modular curves – interactions with J.P. Serre. Nor, that problems of Andrzej Schinzel and Harold Davenport, (visitors to UM my second year) in papers with Lewis, were the inspiration. Not even that technical tools came from assiduous use of Grothendieck's fiber product approach to algebraic equations. Yet, fulfilling those connections required – no word better – tutoring from many UM-affiliated faculty during my formative years.

 

This short version of the story relates Davenport's Problem, the steps in its solution, and how the connections above came about. A fuller, referenced version is at UMStory. That shows I never lost my youthful enthusiasm for completing programs of Abel, Galois and Riemann, as recorded in "What Gauss told Riemann about Abel's Theorem."

 

DAVENPORT'S PROBLEM AND FIBER PRODUCTS: When number theorists say almost all primes p, they mean all but finitely many. Davenport sought relations between two polynomials f(x) and g(y) with rational coefficients – where no change of the variable in f gives g ­– having the same ranges on integers mod p for almost all p. He liked this style of question, and often used exponential sums to interpret it.


Changing "almost all" to "infinitely many" and taking g(y)=y, restates the hypothesis of Schur's 1921 conjecture. The conclusion of Schur: f must be composed of linear, cycle and Chebychev polynomials. Richard Brauer was a student of Schur, and the advisor of Don Lewis. When I met him (see below) he asked if I knew he had worked on Schur's conjecture. I hadn't.

 

A variables separated algebraic equation looks like f(x)-g(y)=0. Writing this as f(x)-z=0 and g(y)-z=0 opened the territory. Fiber products of f and g over the z-line allowed me to use groups to draw conclusions. I'll use n for the degree of f.  Visiting Assistant Professors Armand Brumer and Richard Bumby guided my mastering Grothendieck's Tohoku paper and pieces of his EGA. There I learned to go between algebraic equations and group theory.

 

Chuck MacCluer's thesis, under Lewis, showed – for special f  – a geometric statement gave Schur's one-one mapping hypothesis.  Later, by extending the Chebotarev density theorem, I formulated a general context including Davenport and Schur. There, pure group theory translated the number theory.

 

Formulations, however, are not conclusions. Could you invert the direction polynomials to groups, as in Schur's Problem?

 

A rational function not composed of lower degree functions is indecomposable.  I state lightly what I found powerful in practice.

 

1. A rational function's covering group is primitive exactly when the function is indecomposable.

2. A polynomial's covering group always contains an n-cycle.

 

From that I learned, that if f was indecomposable, its covering group was either doubly transitive, or f was in the Schur conclusion. That finished the Schur story and deepened the Davenport story.

 

I now knew Davenport's hypothesis on f and g produced a difference set mod n that encoded how zeros of g(y)-z=0 summed to a zero of f(x)-z. Not only did the distinct permutation representations for f and g  have the same degree, their group representations were identical.

 

I had seen Brumer pepper Jack McLaughlin with group theory questions. When Brumer left for Columbia at the start of my 3rd year, I took his place with McLaughlin from whom I learned the distinction between doubly transitive and primitive. Richard Misera, a fellow grad student working with Donald Higman, saw this interaction and gave me a propitious example, coming from projective linear groups. I applied this, modulo something that I learned very much on my own – R(iemann)'s E(xistence) T(heorem) – to produce polynomial pairs having almost simple groups with special projective linear core. The three propitious points were these:

 

1. Without writing equations, I was able to see the Galois action of the cyclotomic field of n-th roots of 1, acted on the difference set relating f and g. The elements that preserved that difference set, up to translation (so-called multipliers), gave the definition field of the pair (f,g). Further, -1 was never a multiplier, so that definition field was never Q.

 

2. Because the covers given by f and g had genus 0, the only possible degrees for f and g were n=7, 11, 13, 15, 21, 31.

 

3. The cases with infinitely many essential pairs (f,g) modulo mobius action on z, x and y appearing in #2, had degree 7, 13 and 15. Further, in these cases those essential parameters formed a genus 0,  upper half-plane quotient, that wasn't a modular curve.

 

Tom Storer, newly at UM, when I visited it from IAS, worked with me on the last statement of #1. This completed Davenport's Problem over Q for indecomposable polynomials f.  There were no nontrivial examples. It used nothing from the simple group classification. The offshoot of that technique became the Branch Cycle Lemma, far and away the most practical tool by which to relate  geometric covering groups and definition fields. 

 

USING THE CLASSIFICATION AND THE GENUS 0 PROBLEM: I conjectured that the only Davenport pairs possible over any number field were those I'd listed above. Ax started me consulting with Walter Feit, who suggested – modulo the simple group classification – a paper by Kantor would give that. It did. The idea is this.

 

A Theorem of A(schbacher)-O('Nan)-S(cott) classifies primitive groups as arising from constructions with either (almost) simple groups at their core, or the group is affine.  Adding that the group came with an n-cycle allowed me to complete the result from Kantor's paper on my own.

 

William Leveque, with whom I had written a "first thesis" in diophantine approximation, had translated a famous paper of Carl Ludwig Siegel into English. His notes opened my understanding of theta functions, started me using techniques of Riemann, and showed me how to interpret Abel's theorem as an arithmetic statement, a la Andre Weil's thesis. That foray into Siegel's powerful works, and a course on the fundamental group taught by Morton Brown, enabled me to read, and immediately use beyond its contents, Springer's book on Riemann Surfaces. From these I had translation techniques extending the Chebotarev theorem, to apply to many problems.

 

The main trick – even with Siegel and Weil – was recognizing many unsolved problems as about variables separated equations. To these my methods on Davenport's Problem applied immediately. Here is the hardest, my favorite.

 

A Hilbert-Siegel Problem: Let f(x) be a polynomial with Q coefficients. Suppose the set of integers z' for which f(x)-z' is reducible, but has no degree 1 factor, is infinite. This happens automatically if f decomposes, but the only others have degree 5 (and then there are examples).

 

The group translation was to what I called a primitive group with a double-degree representation: That is a group with a doubly transitive representation and an n-cycle, which had another representation of twice that degree.

 

A positive came out of this: Monodromy groups of indecomposable polynomials were very special, and likely classifiable. Excluding finitely many exceptional covering groups, they came from well-known permutation representations of cyclic, dihedral, alternating and symmetric groups. The degree 5 special case in the Hilbert-Siegel Problem showed me how actions by the symmetric group on sets of  integer pairs might occur, but even for the alternating and symmetric group, polynomial monodromy was limited.

 

Walking to lunch with John Thompson one day at University of Florida, I gave him my conviction, listing my data above.

 

His response – immediately he confessed to being "seized" by the problem – was that I shouldn't limit it to polynomial covers. Rather, include  indecomposable rational functions (genus 0 covers). Then, a likely statement would have all composition factors cyclic or  alternating. He proposed we work on that together.


My heart was in algebraic equations. I suggested Bob Guralnick far more appropriate for that problem. Here was the upshot.

 

Peter Mueller produced a definitive classification of the polynomial monodromy, including – a la what happened in Davenport's Problem– a list of the polynomial monodromy that arose over Q. Davenport's Problem had already captured the harder elements of that classification. 

 

The more optimistic conjecture I made for polynomials turned out true even for indecomposable rational functions. This addition to Guralnick-Thompson was Guralnick's work with many co-authors. Here, however, it was not possible to be so precise on most of the exceptional "genus 0 groups" in the manner of the exceptional degrees that came out in Davenport's Problem.

 

UM SEMINARS AND MODULAR CURVES:  Double transitivity of a permutation representation is an easily understood property. Primitivity much less so. I immediately, relayed to Lewis' number theory seminar my early discoveries, like #1 and #2 above, and this:

 

3. If f is a polynomial, then it is indecomposable over the definition field exactly when it is indecomposable over the algebraic closure.

 

Serre's Open Image Theorem is a statement on absolute Galois groups acting on systems of modular curve points. It has two parts: Called, respectively, complex multiplication and GL2. I began my work on moduli space problems by translating the  Schur Problem for rational functions  to Serre's Theorem. The GL2 case showed myriad rational functions violate #3. Later Guralnick-Mueller-Saxl showed there were only sporadic other examples. This, too, used the Davenport Problem method.

 

Bob MacRae was a visiting UM assistant professor who attended the Number Theory seminar. He wrote up two of my thesis sections for publication. These related #1-#3 to other general properties of variables separated equations.  These papers are far more quoted than my two (later) Annals papers. MacRae also arranged for me to talk at U. of Chicago through his advisor Irving Kaplansky. Later Richard Brauer, who attended that talk, presented me with a tenure offer during my 2nd post-doctoral year at IAS. Those two sentences are still full of mysteries to me.

 

I saw how the major Diophantine results on modular curves interpret as a statement on regular realizations of Dihedral groups as Galois groups with  the covering ramification given by involutions.

 

Later I generalized this  to any p-perfect group and conjugacy classes prime to p a direction backwards  to Davenport's Problem. It produced a (Modular) Tower of spaces – rarely of modular curves, but often very modular curve-like – to interpret the problem. No rational points at high levels has now been shown in many cases, a result from various incarnations for modular curves by Demjanenko-Manin, Faltings, Mazur, then Mazur-Merel. Modular Towers completely encodes many unknown aspects  of the Inverse Galois Problem: My version of Shimura Varieties.

 

Lewis arranged that I could go to Bowdoin college NSF-funded summers of eight weeks each on Algebraic Number Theory (1966) and Algebraic Geometry (1967).  Both summers I learned everything put in front of me.  I also learned that I would be regarded as an ignoramus for not having the background prevalent at Harvard, MIT or Princeton at the time.

 

Notes of Brumer, following Gunning, and a seminar at UM with Roger Lyndon and I as alternating speakers, prepared me for many aspects of modular curves. I was ready for Princeton, and for getting the most of my interactions with Shimura.

 

THE SIGNIFICANCE OF DAVENPORT'S PROBLEM: The Schur Conjecture and Davenport's Problem have simple statements using Chow motives (which have attached zeta functions). For Davenport, the statement interpret to a zeta function being trivial.

 

It was Ax's idea to consider attaching a zeta function to any similar Diophantine problem.  Yet, there was no way to compute it or find its properties, until my 1976 Annal's paper introduced Galois Stratifications. This was my replacement for Chow motives, which didn't exist then. Denef and Loeser later showed how to make this zeta attachment canonical, using Chow motives. Still, their proof went through Galois Stratification.

 

So, Davenport's Problem was my foray into mathematical objects studied by others that were in the kernel of the linearization of Diophantine problems using Chow motives. My conclusion: This kernel is often what much practical mathematics is about.

 

Further, much practical mathematics on equations has their variables separated. (Think hyperelliptic curves and slight generalizations to see how prevalent that is.) The resolution of the genus 0 problem is an apt tool to figure where exactly variables separated fits among all two variable algebraic relations.

 

Of course, without an aid to help with group theory, you can't use the method. I later took on one more problem in the Davenport range, for which I needed help from group theorists.  That was a version of Schur's problem over a single finite field. Guralnick and  Jan Saxl joined me on in the 3rd section: Going through every step of the A-O-S classification. Though we didn't complete the affine group case, the results were definitive. That included solving an 1897 conjecture of Dixon.

 

I was not a passive purveyor of Guralnick and Saxl. First, I caught the unusual examples of new Schur covers that were slipping by overly-optimistic group assumptions. Second, I carefully showed how using A-O-S worked. McLauglin might have approved. It resembled how he often laid out the steps that allowed him his seemingly-encyclopedic recall in our two-person seminar.


FINAL UM COMMENTS: This story supports those who believe in the essential connectedness of mathematics. My fuller telling (
UMStory) shows the achievements had influence on the work of many. Yet, official success is another story. Taking a geometric approach to number theory doesn't bode well when a totally number-oriented approach– as in the Fermat's Last Theorem/modular curve connection – has gleaned the most attention.

 

Further, making disparate connections in an era of technical specialists cuts the possibilities of readers sharply. Finally, when I look, say, at a Scientific American, or a New York Review of Books, it comes clear that these magazines are so aware they must avoid serious discussion of mathematics, despite how many areas of science build on it.

 

John Thompson once asked me what Davenport thought of my methods. My response: "He didn't like group theory, or Galois Theory, or even algebraic geometry. I got the impression he hated them." John nodded: "That sounds like him."


Still, it was Lewis' role as the algebraist that brought both Davenport and Schinzel into my world. Many people have said they first heard of me from Shinzel's 1970 International congress talk, that alluded to his problems solved by the monodromy method above.

 

It would have helped if other UM students, even slightly related, interacted with me from the hundreds of talks I've given, and the many conferences I've attended and run. There were over 200 grad students at UM with me.

 

The three others who got PhDs in 1967 were all analysts, one much more famous than anyone who might be reading this. That was "The Unabomber, " a no-show at the going away party Paul Halmos gave us. You can find a picture of me from years related here –  opposite the page with Grothendieck – in Halmos' "I have a photographic Memory." I was standing in front of my Schur Conjecture diagram at the end of my 1968 UM lecture on it.

 

I didn't know about that picture until years later. A brief description of the picture is at fried-HalmosBook.html. Still, either I, or the Schur Conjecture, must have been funny. A New Yorker magazine not long afterwards based a cartoon on it.


I have seen only one person from my graduate years more than once after grad school. That was the topologist Bob Edwards who twice sat in on talks of mine at AMS conferences.