History of Solving Polynomial Equations

Transcription

History of Solving Polynomial Equations
MATH2001 Development of Mathematical Ideas
History of Solving Polynomial
Equations
29 March 2012
Dr. Tuen Wai Ng, HKU
What is a polynomial ?
anxn + · · · + a1x + a0
This modern notation was more or less developed by René Descartes
(1596-1650) in his book La Géométrie [Geometry] (1637).
Before the 17th century, mathematicians usually did not use any
particular notation.
Before Descartes, François Viète (1540-1603) has already developed
the basic idea of introducing arbitrary parameters into an equation and to
distinguish these from the equation’s variables.
François Viète (1540-1603)
René Descartes (1596-1650)
He used consonant (B, C, D, . . ..) to denote parameters and vowels
(A, E, I, . . .) to denote variables.
Certainly, letters had been used before Viète, but not in actual
computations; one letter would be used for a certain quantity, another
for its square, and so forth.
Let us give an idea of the notation used in Viète’s book Zététiques
[Introduction to the Analytic Art ](1591), the expression
F ·H +F ·B
=E
D+F
is written

F in H 
+F in B
æquabitur E.


D + F


Viète’s notation for powers of the unknown is very heavy: he writes “A
quadratum” for A2, “A cubus” for A3, “A quadrato-quadratum” for A4 ,
etc., and “A potestas,” “A gradum” for Am, An.
His notation was only partly symbolic. For example, an equation in the
unknown A, such as A3 + 3B 2A = 2C 3 would be expressed by Viète as
A cubus +B plano 3 in A aequari C solido 2.
Note that Viète required “homogeneity” in algebraic expressions: all
terms had to be of same degree (for example, all terms above are of the
third degree).
The requirement of homogeneity goes back to Greek antiquity, where
geometry reigned supreme. To the Greek way of thinking, the product
ab (say) denoted the area of a rectangle with sides a and b; similarly, abc
denoted the volume of a cube. An expression such as ab+c had no meaning
since one could not add length to area.
This condition of homogeneity was definitively abandoned only around
the time of Descartes.
Most of the shortcomings of Viète’s notations were overcome by
Descartes in his important book La Géométrie [Geometry], in which he
expounded the basic elements of analytic geometry.
2
Before Descartes, 3x2 was written as 3⌣ by Ralfaele Bombelli in 1572,
whereas Simon Stevin wrote 3⃝3 + 5⃝2 − 4⃝1 for 3x3 + 5x2 − 4x in 1585.
The exponential notation x2, x3, etc., came with Descartes, whose
formulas are actually written in a notation very close to our own. For
example, he used x, y, z, . . . for variables and a, b, c, for parameters.
Most importantly, he introduced an “algebra of segments.” That is, for
any two line segments with lengths a and b, he constructed line segments
with lengths a + b, a − b, a × b, and a/b.
Thus homogeneity of algebraic expressions was no longer needed. For
example, ab + c was now a legitimate expression, namely a line segment.
So there is no need of geometry in algebra and algebra had now become
considerably more abstract. It was well on its way to to play an important
role in mathematics.
Viète and Descartes’s notations and ideas proved indispensable in the
crucial developments of the seventeenth century: in analytic geometry,
calculus, and mathematized science.
Remarks on other notations.
The signs + and − were already in use around 1480 (+ was apparently
a deformation of the symbol &), but by the beginning of the 17th century,
they were used generally.
Multiplication was written as M by Michael Stifel (1545), and as in by
Viète (1591); our current notation dates back to William Oughtred (1637)
for the symbol ×, and to Wilhelm Leibniz (1698) for the dot.
The symbol = used by Michel Recorde (1557) came to replace the
symbol used by Descartes, an α written backward, toward the end of the
17th century, thanks to Leibniz.
√
Albert Girard (1595-1632) introduced the notation
3, which he
1
substituted for ⃝;
he also introduced the abbreviations for sine and tangent,
3
and used the symbols <, > like Harriot.
The symbol
∑
was introduced by Leonhard Euler (1707-1783).
These notations passed into general usage only during the 19th century.
What do we mean by solving an equation ?
Example 1. Solve the equation x2 = 1.
x2
x2 − 1
(x − 1)(x + 1)
x
=1
=0
=0
= 1 or = −1
• Need to check that in fact (1)2 = 1 and (−1)2 = 1.
Exercise. Solve the equation
√
√
x+ x−a=2
where a is a positive real number.
What do we mean by solving a polynomial
equation ?
Meaning I:
Solving polynomial equations: finding numbers that make the
polynomial take on the value zero when they replace the variable.
• We have discovered that x, which is something we didn’t know, turns out
to be 1 or −1.
Example 2. Solve the equation x2 = 5.
x2
2
x
−5
√
√
(x − 5)(x + 5)
x
• But what is
√
5 ? Well,
√
=5
=0
= 0√
√
= 5 or − 5
5 is the positive real number that square to 5.
• We have ”learned” that the positive solution to the equation x2 = 5 is
the positive real number that square to 5 !!!
• So there is a sense of circularity in what we have done here.
• Same thing happens when we say that i is a solution of x2 = −1.
What are “solved” when we solve these
equations ?
• The equations x2 = 5 and x2 = −1 draw the attention to an inadequacy
in a certain number system (it does not contain a solution to the equation).
• One is therefore driven to extend the number system by introducing, or
‘adjoining’, a solution.
• Sometimes, the extended system has the good algebraic properties of the
original one, e.g. addition and multiplication can be defined in a natural
way.
√
√
• These extended number systems (e.g. Q( 5) = {a + b 5|a, b ∈ Q} or
Q(i)) have the added advantage that more equations can be solved.
Consider the equation
x2 = x + 1.
• By completing√the square,
or by applying the formula, we know that the
√
solutions are 1+2 5 or 1−2 5 .
• It is certainly not true by definition that
equation.
√
1+ 5
2
is a solution of the
• What we have done is to take it for granted that we can solve the equation
x2 = 5 (and similar ones) and to use this interesting ability to solve an
equation which is not of such a simple form.
• When we solve the quadratic, we are actually showing that the problem
can be reduced to solving a particularly simple quadratic x2 = c.
What do we mean by solving a polynomial
equation ?
Meaning II:
Suppose we can solve the equation xn = c (i.e. taking roots), then try to
express the the roots of a degree n polynomial using only the usual algebraic
operations (addition, subtraction, multiplication, division) and application
of taking roots.
• In this sense, one can solve any polynomials of degree 2,3 or 4 and we
will see later that this is in general impossible for polynomials of degree 5
or above.
• The Babylonians (about 2000 B.C.) knew how to solve specific quadratic
equations.
Babyloniasn problem text on tablet YBC 4652
In fact, many Babylonian clay tablets have been preserved with lists of
mathematical problems on them.
• The equations were given in the form of “word problems.” Here is a
typical example and its solution:
I have added the area and two-thirds of the side of my square and it is
0;35 [35/60 in sexagesimal notation]. What is the side of my square?
• In modern notation the problem is to solve the equation x2 + (2/3)x =
35/60. The solution given by the Babylonians is:
You take 1, the coefficient. Two-thirds of 1 is 0;40. Half of this, 0;20,
you multiply by 0;20 and get [the result] 0;6,40. Add 0;6,40 to 0;35 and
[the result] 0;41,40 has 0;50 as its square root. The 0;20, which you
have multiplied by itself, you subtract from 0;50, and 0;30 is [the side
of] the square.
The instructions
for finding the solution can be expressed
in modern notation
√
√
2 + 0; 35 − (0; 40)/2 =
as
x
=
[(0;
40)/2]
0; 6, 40 + 0; 35 − 0; 20 =
√
0; 41, 40 − 0; 20 = 0; 50 − 0; 20 = 0; 30.
√
• These instructions amount to the use of the formula x = (a/2)2 + b −
a/2 to solve the equation x2 + ax = b !!!
The following points about Babylonian algebra are important to note:
(i) There was no algebraic notation. All problems and solutions were verbal.
(ii) The problems led to equations with numerical coefficients. In particular,
there was no such thing as a general quadratic equation, ax2 +bx+c = 0,
with a, b, and c arbitrary parameters.
(iii) The solutions were prescriptive: do such and such and you will arrive
at the answer and there was no justification of the procedures. But the
accumulation of example after example of the same type of problem
indicates the existence of some form of justification of Babylonian
mathematical procedures.
(iv) The problems were chosen to yield only positive rational numbers as
solutions. Moreover, only one root was given as a solution of a quadratic
equation. Zero, negative numbers, and irrational numbers were not part
of the Babylonian number system.
(v) The problems were often phrased in geometric language, but they were
not problems in geometry. Nor were they of practical use; they were likely
intended for the training of students. Note, for example, the addition of
the area to 2/3 of the side of a square in the above problem.
(vi) They were also able to solve equations that lead to quadratic equations,
for example x + y = a and x2 + y 2 = b, by methods similar to ours.
[Please read p.205-206 of Chapter 5]
• The Chinese (about 200 BC) and the Indians (about 600 BC) advanced
beyond the Babylonians. For example, they allowed negative coefficients in
their equations (though not negative roots), and admitted two roots for a
quadratic equation.
• They also described procedures for manipulating equations, but had no
notation for, nor justification of, their solutions.
• From The Nine Chapters on the Mathematical Art (Jiuzhang Suanshu)
(100 BC-50 AD), we know the Chinese had methods for approximating
roots of polynomial equations of any degree, and solved systems of linear
equations using “matrices” (rectangular arrays of numbers) well before such
techniques were known in Western Europe.
• The solution formula for solving the quadratic equations was mentioned
in the Bakshali Manuscript written in India between 200 BC and 400 AD.
• The mathematics of the ancient Greeks is very strong in geometry and
number theory but rather weak in algebra.
• Elements (around 300 BC) contains several parts that can be interpreted
as algebraic. These are geometric propositions that, if translated into
algebraic language, yield algebraic results: laws of algebra as well as
solutions of quadratic equations. This work is known as geometric algebra.
For example, Proposition II.4 in the Elements states that
“If a straight line be cut at random, the square on the whole is equal
to the square on the two parts and twice the rectangle contained by the
parts.”
• If a and b denote the parts into which the straight line is cut, the
proposition can be stated algebraically as (a + b)2 = a2 + 2ab + b2.
Proposition II.11 states:
“To cut a given straight line so that the rectangle contained by the
whole and one of the segments is equal to the square on the remaining
segment.”
• This asks, in algebraic language, to solve the equation a(a − x) = x2.
• Note that for Greek algebra, homogeneity in algebraic expressions is a
strict requirement; that is, all terms in such expressions must be of the
same degree.
A much more significant Greek algebraic work is Diophantus’
Arithmetica (around 250 AD). Although essentially a book on number
theory, it contains solutions of equations in integers or rational numbers.
Arithmetica introduced a partial algebraic notation - a most important
achievement:
ζ denoted an unknown, Φ negation,1́σ equality, ∆σ the square of the
unknown, K σ its cube, and M the absence of the unknown (what we would
write as x0).
For example, x3 − 2x2 + 10x − 1 = 5 would be written as
K σ αζιΦ∆σ βM α1́σM ε
• Note that numbers were denoted by letters, e.g., α stood for 1 and ε for
5; moreover, there was no notation for addition, thus all terms with positive
coefficients were written first, followed by those with negative coefficients.
Diophantus made other remarkable advances in algebra, namely:
(i) He gave two basic rules for working with algebraic expressions: the
transfer of a term from one side of an equation to the other, and the
elimination of like terms from the two sides of an equation.
(ii) He defined negative powers of an unknown and enunciated the law of
exponents, xmxn = xm+n, for −6 ≤ m, n, m + n ≤ 6.
(iii) He stated several rules for operating with negative coefficients,
for example: “deficiency multiplied by deficiency yields availability”
((−a)(−b) = ab).
(iv) He did away with such staples of the classical Greek tradition as
(a) giving a geometric interpretation of algebraic expressions,
(b) restricting the product of terms to degree at most three, and
(c) requiring homogeneity in the terms of an algebraic expression.
• During the Dark Ages, Diophantus’s work was forgotten and like many
other mathematical treatises from the classical period, Arithmetica survived
through the Arab tradition.
• However, only six of the original thirteen books of which Arithmetica
consisted have survived.
Islamic mathematicians attained important algebraic accomplishments
between 900 AD and 1500 AD. For example, Muhammad ibn-Musa alKhwarizmi (around 780-850 AD) was conisdered by some “the Euclid of
algebra” because he systematized the subject (as it then existed) and made
it into an independent field of study.
Muhammad al-Khwarizmi (around 780-850 AD)
He did this in his book al-jabr w al-muqabalah.
“Al-jabr” (from which stems our word “algebra”) denotes the moving of
a negative term of an equation to the other side so as to make it positive,
and “al-muqabalah” refers to cancelling equal (positive) terms on both sides
of an equation.
Al-Khwarizmi (from whose name the term “algorithm” is derived) applied
these rules to solve quadratic equations.
He also classified quadratic equations into five types: ax2 = bx, ax2 =
b, ax2 + bx = c, ax2 + c = bx, and ax2 = bx + c.
This clasification was necessary as al-Khwarizmi did not admit negative
coefficients or zero. He also had essentially no notation, so that his problems
and solutions were expressed rhetorically.
The following is an example of one of al-Khwarizmi’s problems with
solution:
“What must be the square, which when increased by ten of its roots
amounts to thirty-nine?” (i.e., solve x2 + 10x = 39).
Solution: “You halve the number of roots [the coefficient of x], which in
the present instance yields five. This you multiply by itself; the product is
twenty-five. Add this to thirty nine; the sum is sixty-four. Now take the
root of this, which is eight, and subtract from it half the number of the
roots, which is five; the remainder is three. This is the root of the square
which you sought.”
√
• Symbolically, the prescription is: [(1/2) × 10]2 + 39 − (1/2) × 10.
Al-Khwarizmi gave a geometric justification of his solution procedures:
Construct the gnomon as in Fig.1, and “complete” it to the square in Fig.2
by the addition of the square of side 5.
The resulting square has length x + 5. But it also has length 8, since
x2 + 10x + 52 = 39 + 25 = 64. Hence x = 3.
Fig. 1
Fig. 2
The Babylonians were solving quadratic equations by about 2000 BC,
using essentially the equivalent of the quadratic formula. A natural question
is therefore whether cubic equations could be solved using similar formulas.
At least another three thousand years would pass before the answer would
be known.
Girolamo Cardano (1501-1576)
• Based on the work of Scipione del Ferro and Nicolo Tartaglia, Cardano
published the solution formula for solving the cubic equations in his book
Ars Magna [The Great Art] (1545).
Nicolo Tartaglia actually passed his method to Cardano, who had
promised that he would not publish it, which he promptly did as he
discovered that Scipione del Ferro had actually discovered Tartaglia’s
formula before Tartaglia himself.
What came to be known as Cardano’s formula for the solution of the
cubic x3 = ax + b was given by
√
x=
3
√
√
√
3
2
3
b/2 + (b/2) − (a/3) + b/2 − (b/2)2 − (a/3)3.
• Comments on Cardano’s book:
(i) Cardano used no symbols, so his “formula” was given rhetorically (and
took up close to half a page). Moreover, the equations he solved all had
numerical coefficients.
(ii) He was usually satisfied with finding a single root of a cubic. In fact, if
a proper choice is made of the cube roots involved, then all three roots
of the cubic can be determined from his formula.
(iii) Negative numbers are found occasionally in his work, but he mistrusted
them, calling them “fictitious.” The coefficients and roots of the cubics
he considered were positive numbers (but he admitted irrationals), so
that he viewed (say) x3 = ax+b and x3 +ax = b as distinct, and devoted
a chapter to the solution of each (compare al-Khwarizmi’s classification
of quadratics).
(iv) He gave geometric justifications of his solution procedures for the cubic.
[We shall study it in Section 5.2]
• Lodovico Ferrari, a student of Cardano discovered the solution formula
for the quartic equations in 1540 (published in Ars Magna later).
The formulae for the cubic and quartic are complicated, and the methods
to derive them seem ad hoc and not memorable.
The first attempt to unify solutions to quadratic, cubic and quartic
equations date at least to Lagrange ’s work in Réflexions sur la
résolution algébrique des équations [Reflections on the algebraic solution
of equations](1770/1771).
Lagrange also believed (but could not prove) that the general equation
of degree five was not solvable by radicals, meaning that there could be
no formula for the roots that involved only algebraic operations[addition,
subtraction, multiplication,division, raising to a natural power] on the
coefficients of the equation together with radicals of these coefficients.
Lagrange’s analysis characterized the general solutions of the cubic and
quartic cases in terms of permutations of the roots, laying a foundation for
the independent demonstrations by Abel and Galois of the impossibility of
solutions by radicals for general fifth degree or higher equations.
Before, Abel and Galois, the Italian Paolo Ruffini (1765-1822) published
a lengthy treatise Teoria Generale delle Equazioni [General Theory of
Equations] in 1799 and he claimed to have a proof that the general
equation of degree five was not solvable by radicals.
Due to the less than clear exposition, his arguments were received with
much skepticism (later on a significant gap was found), and were in general
not accepted by the mathematical community at that time.
In 1824, the Norwegian mathematician Niels Henrik Abel (1802-1829)
gave a different proof, published in the first issue of Journal für die Reine
und Angewandte Mathematik in 1826 [1, vol. 1, pp. 66-94]. The paper
contained the following
Abel’s Theorem (1824). The generic algebraic equation of degree five
is not solvable by radicals.
There is some evidence that Abel had similar results for higher-degree
equations of prime degree, as well as results on the roots of equations that
are solvable by radicals.
Possibly only his untimely death at the age of 26 prevented him from
being the one to give a complete solution to the problem of which algebraic
equations are solvable by radicals.
Abel’s Proof
Abel’s idea was that if some finite sequence of rational operations and
root extractions applied to the coefficients produces a root of the equation
x5 + ax4 + bx3 − cx2 + dx + e = 0,
the final result must be expressible in the form
1
m
2
m
x = p + R + p2R + · · · + pm−1R
m−1
m
,
where p, p2, . . . , pm−1, and R are also formed by rational operations
and root extractions applied to the coefficients, m is a prime number,
and R1/m is not expressible as a rational function of the coefficients
a, b, c, d, e, p, p2, . . . , pm−1.
By straightforward reasoning on a system of linear equations for the
coefficients pj , he was able to show that R is a symmetric function of the
roots, and hence that R1/m must assume exactly m different values as the
roots are permuted.
Moreover, since there are 5! = 120 permutations of the roots and m is
a prime, it follows that m = 2 or m = 5, the case m = 3 having been ruled
out by Cauchy.
The hypothesis that m = 5 led to certain equation in which the left-hand
side assumed only five values while the right-hand side assumed 120 values
as the roots were permuted.
Then the hypothesis m = 2 led to a similar equation in which one side
assumed 120 values and the other only 10.
Abel concluded that the hypothesis that there exists a formula for solving
the equation was incorrect.
P. Pesic, Abel’s proof. An essay on the sources and meaning of
mathematical unsolvability. MIT Press, Cambridge, MA, 2003.
C. Houzel, The work of Niels Henrik Abel. The legacy of Niels Henrik
Abel, 21–177, Springer, Berlin, 2004
Geometric proof of Abel’s theorem
V.B. Alekseev, Abel’s theorem in problems and solutions. Based on
the lectures of Professor V. I. Arnold. With a preface and an appendix
by Arnold and an appendix by A. Khovanskii. Kluwer Academic Publishers,
Dordrecht, 2004.
D. Fuchs and S. Tabachnikov, Equation of degree five, Mathematical
omnibus. Thirty lectures on classic mathematics, 79–92, American
Mathematical Society, Providence, RI, 2007.
The Abel Prize
The Niels Henrik Abel Memorial Fund was established on 1 January
2002, to award the Abel Prize for outstanding scientific work in the field of
mathematics.
The prize amount is 6 million NOK (about 750,000 Euro) and was
awarded for the first time on 3 June 2003.
Abel Prize Laureates:
Jean-Pierre Serre (2003), Sir Michael Francis Atiyah and Isadore M.
Singer (2004), Peter D. Lax (2005), Lennart Carleson (2006), Srinivasa S.
R. Varadhan (2007), John G. Thompson and Jacques Tits (2008), Mikhail
L. Gromov (2009), John Tate (2010), John Milnor (2011), Endre Szémeredi
(2012)
Solution of the general quintic by elliptic integrals.
• In 1844, Ferdinand Eisenstein showed that the general quintic equation
could be solved in terms of a function χ(λ) that satisfies the special quintic
equation
(
)5
χ(λ) + χ(λ) = λ.
This function is in a sense an analog of root extraction, since the square
root function φ and the cube root function ψ satisfy the equations
(
φ(λ)
)2
(
= λ, ψ(λ)
)3
= λ.
• In 1858 Hermite and Kronecker showed (independently) that the quintic
equation could be solved by using an elliptic modular function.
R. B. King, Beyond the quartic equation. Birkhäuser Boston, Inc.,
Boston, MA, 1996.
The honor of giving a complete solution to the problem of which algebraic
equations are solvable by radicals was given to the French mathematician
Evariste Galois (1811-1832).
At the tender age of 18, Galois communicated to the Academy of
Sciences in Paris some of his results on the theory of equations, through one
of its members, Augustin-Louis Cauchy (1789–1857). Shortly thereafter,
Galois learned that a number of his results had actually been obtained by
Abel, before him.
Two years later, he submitted a rewritten version, Mémoire sur les
Conditions de Resolubilité des Equations par Radicaux [Memoir on the
Conditions for Solvability of Equations by Radicals], which give a complete
solution of the remained problem of characterizing those equations that are
solvable by radicals.
Galois’s Theorem. A polynomial is solvable by radicals if and only if its
Galois group is solvable.
The importance of this memoir was not recognized properly until 1843,
when Joseph Liouville prepared Galois’s manuscripts for publication and
announced that Galois had indeed solved this age-old problem.
Galois’s work forms one of the beginnings of abstract algebra, namely
the theory of groups and that of fields.