LECTURE 1
Chee Yap NUMERICAL ALGEBRAIC COMPUTATION (I)

NUMERICAL ALGEBRAIC COMPUTATION (I)

[Previous] [Next] [Top] [Bot]

Introduction

Traditional CG is mostly combinatorial, and has no need of numerical/algebraic techniques

Numerical primitives operations are "Constant time". No longer a valid assumption!

New emphasis: nonlinear objects (curves, surfaces), robustness issues.

Numerical complexity can dominate overall performance.

NOTE 1: These html slides are best viewed using Internet Explorer, because Netscape apparently does not know how to get the right symbol fonts.

NOTE 2: These slides were prepared for a 2.5-hour lecture at the ``Workshop on Computer Algebra in Computational Geometry'' at the Lorentz Center, Leiden University, The Netherlands (Oct 1-5, 2001). There is a corresponding set of detail notes that parallel these slides.

[Previous] [Next] [Top] [Bot]

Overview

Fundamental Theorem of algebra

Fundamental Computational Theorem of algebra

Part 1 of Lecture: Basic Techniques

Part 2 of Lecture: Current Concerns

Emphasis on the low level efficient techniques, including survey of zero finding algorithms.

Will try to give some simple illustrative proofs.

Background references:
1. Survey of Schönhage [9] (ICM'86). Also, Pan's survey [6].
2. Henrici's book [3] (Applied and Computational Complex Analysis, John Wiley, 1974)
3. My Book [11] (Fundamental Problems of Algorithmic Algebra, Oxford Press, 2000). Note that the free version of the book is on the web: http://cs.nyu.edu/yap/book/

[Previous] [Next] [Top] [Bot]

Preliminary

ˇ Initial algebraic structures: Z Í Q Í R Í C

ˇ Main object of study: polynomials,

A(X) = m
ĺ
i=0
a_i Xⁱ, (a_m š 0, a_i Î R₀ Í C)
(1)

ˇ What kind of objects are polynomials? Dual nature!

algebraic objects (members of polynomial ring R₀[X])

analytic objects (functions A: CŽC assuming R₀ Í C)

[Previous] [Next] [Top] [Bot]

Main Problem

Solve A(X)=0.

z Î C such that A(z)=0: roots or zeros

Zero or Root Finding (or Zero Treatment [Householder])

Two subrings are specified in root finding:

R₀ Í R₁ Í C.

This corresponds to finding roots of A(X) Î R₀[X] in R₁.

Important cases:
- Algebraically closed case: (R₀, R₁)=(C,C)
Fundamental Theorem of algebra is about this case.
- Real algebraic case: (R₀, R₁)=(Z, R)
- Diophantine case: (R₀, R₁)=(Z, Z)

[Previous] [Next] [Top] [Bot]

Some Generalizations

ˇ A(X,Y)=0, curves
ˇ A(X,Y,Z) = B(X,Y,Z)=0, surfaces
ˇ Multivariate systems of equations
ˇ Ideal theoretic setting: ideal membership problem
ˇ Tarski's theory of elementary geometry and algebra

[Previous] [Next] [Top] [Bot]

Two Distinct Approaches

Algebraic side:
- 16th Centure Italian algebraists
- Impossibility proof of Abel (1824)
- Galois theory, algebraic number theory
- Computer algebra
- E.g., Cohen [1]

Numerical side:
- Viéte (1600), Newton (1669)
- Descartes, Budan, Fourier, Sturm: root counting problem
- Waring, Lagrange: root isolation problem
- Complex analysis and numerical analysis
- E.g., Householder [4], Henrici [3]

[Previous] [Next] [Top] [Bot]

Algebraic Numbers

Algebraic number a: zero of A(X) Î Z[X]. If A(X) is monic, a is algebraic integer.

Defining polymomial and minimum polynomial Irr(a).

E.g. a=Ö2: X⁴-4 is a defining polynomial, but Irr(Ö2)=X²-2.

degree and height of a. E.g., deg(Ö3)=2, height of Ö3=3.

FACT: a complex number a+ib is algebraic iff a, b are real algebraic numbers. (Here i =Ö{-1})

PROOF: Let A(X) Î Z[X]. If a+ib is a zero of A(X), then A(a+ib) = P(a,b)+iQ(abeta) for some polynomials P(X,Y), Q(X,Y). So (a,b) is solution to P(X,Y)=Q(X,Y)=0. By elimination theory (see resultants below), we see that a (also b) is the zero of a polynomial of degree 2m. Conversely, if a, b are algebraic, then so is a+ib since algebraic numbers forms a field (below).

[Previous] [Next] [Top] [Bot]

Sylvester Resultants

ˇ Are a+b and ab algebraic?

ˇ Two standard approaches: symmetric functions and resultants. We prefere the latter as more constructive.

ˇ The resultant of A(X) and B(X) is the determinant of the (m+n)-square matrix:

S(A,B) :=
é
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ë

a_m
a_m-1
ź

a₀

a_m
a_m-1
ź

a₀

^··_·

^··_·

a_m
a_m-1
ź

a₀

b_n
b_n-1
ź
b₁
b₀

b_n
b_n-1
ź
b₁
b₀

^··_·

^··_·

b_n
b_n-1
ź

b₀
ů
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ű
(2)

[Previous] [Next] [Top] [Bot]

Properties

ˇ Elimination: let A,B be multivariate polynomials, treated as polynomials in X. Then in res_X(A,B), we have ``eliminated'' the variable X from A, B.

ˇ This justifies the name ``resultant'':

res(A,B)=0 Ű deg( gcd
(A,B)) > 0.

ˇ PROOF:

deg( gcd
(A,B)) > 0 Ű GA + HB = 0

where degG < n and degH < m. [Just choose G = B/gcd(A,B), H = A/gcd(A,B).]

But, GA + HB = 0 is equivalent to:

[ g_n-1, g_n-2 ,ź, g₀, h_m-1 ,ź, h₀] · S(A,B) · é
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ę
ë

X^m+n-1

X^m+n-2

:

X^m+1

X^m

X^m-1

:

X

1
ů
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ú
ű = 0

But the last equation is equivalent to [[`g],[`h] ] ·S(A,B)=0, or det(S(A,B))=0. Q.E.D.

[Previous] [Next] [Top] [Bot]

Poisson's Formula for Resultants

ˇ Let

A=a m
Ő
i=1
(X-a_i), B=b m
Ő
j=1
(X-b_j).

ˇ LEMMA: For a Î C,
(a) res(a, B) = aⁿ.
(b) res(A, B) = (-1)^mnres(B, A).
(c) res((X-a)·A,B) = B(a)res(A,B).

ˇ THEOREM:

res(A,B)

=

aⁿ m
Ő
i=1
B(a_i)

=

aⁿb^m m
Ő
i=1
n
Ő
j=1
(a_i-b_j)

PROOF: To get the first equality, apply the lemma(c) m times, and lemma(a) once to res(aŐ^m(X-a_i),B). To get the second equality, note that B(a_i) = bŐ_j(a_i-b_j).

[Previous] [Next] [Top] [Bot]

Algebraic Numbers form a Field

ˇ THEOREM:
(i) 1/a is the root of X^m A(1/X) provided a š 0.
(ii) bąa is a root of C(X)=res_Y (A(Y),B(X-±Y)).
(iii) ab is a root of C(X)=res_Y (A(Y),YⁿB([ X/Y])).

ˇ PROOF: (i) is immediate. We prove (ii) as (iii) is similar:

res_Y(A(Y),B(X-±Y))

=

aⁿ m
Ő
i=1
B(X-±a_i)

=

aⁿb^m m
Ő
i=1
n
Ő
j=1
(X-±a_i-b_j).

[Previous] [Next] [Top] [Bot]

Degree and Length Bounds

ˇ The above proof yields important information about the degrees and heights when we add or multiply algebraic numbers.

ˇ Let a_i have degree Ł d_i and length at most l_i (i=1,2).

ˇ Then degree and length of a₁a₂ are at most d₁d₂ and

l = l₁^d₂ l₂^d₁.

ˇ The degree and length of a₁ąa₂ are at most d₁d₂ and

l = l₁^d₂ l₂^d₁ 2^{d₁d₂+min{d₁,d₂}}.

[Previous] [Next] [Top] [Bot]

Representing Algebraic Numbers

ˇ How to represent polynomials? Sequence of coefficients.

ˇ Let A(X) be defining polynomial for a.

ˇ ``Symbolic form'': a ~ (A(X), i), 1 Ł i Ł m

ˇ Isolating Interval: a ~ (A(X), a, b) where [a,b] is isolating interval for a.

ˇ Thom Representation: Let the sign signature of r Î R relative to A(X) is the sign sequence of

A(r) ,A˘(r) ,ź, A⁽ⁱ⁾(r) ,ź, A^(m)(r).

Thom's lemma: the set of numbers r that has a given sign sequence is either an isolated point or an open interval. Isolated points are just the roots of A(X). So

a ~ (A(X), signature(a))

ˇ Constructive Representation:
- relatively new, but important for us (more below)
- No explicit defining polynomial
- basically DAGs with integers at leaves and operators in the interior nodes.

[Previous] [Next] [Top] [Bot]

Localization of Zeros

ˇ D(z,r) = complex open disk centered at z Î C, radius r > 0. Write D(r) if z=\0.

ˇ Let A(X) have k distinct roots. Define

0 < r₁ Ł r₂ Ł ź Ł r_m

where r_i is minimum radius such that D(r_i) has min{i, k} distinct roots of A(X).

ˇ Some root bound estimations:
- Give upper bound on r_m
- Give upper bound on r₁
- Give lower bound on r₁
- Give a lower bound on min_i=1^k-1 |r_i+1-r_i|. (Root Separation)
- Give upper bounds in r_m/r₁ (isolation ratio).

[Previous] [Next] [Top] [Bot]

Some Bounds

ˇ (Cauchy)

r₁ ł |a₀|
|a₀|+ max
{|a₁| ,ź, |a_n|}
, r_n Ł 1 +
max
{|a₀| ,ź, |a_n-1|}

|a_n|
.
(3)

ˇ (Mahler) For any k+1 of roots of A(X), say a₀ ,ź, a_k (k=1 ,ź, m), we reorder them so that

|a₀| ł |a₁| ł ź ł |a_k|.

Then

k
Ő
i=1
|a_i-1-a_i| >
Ö

|disc(A)|

·M(A)^-m+1 ·m^-m/2 · ć
č Ö3
m
ö
ř k

.

NOTE: A(X) has multiple roots iff disc(A)=0. Here M(A)=Ő_i min{1,|a_i} is Mahler's measure. We know that M(A) Ł || A ||₂.

When k=1, we get a root separation bound: if A is square-free and integer,

sep(A) ł ||A||^-m m^-(m+2)/2.
(4)

[Previous] [Next] [Top] [Bot]

More Bounds

ˇ Let l₁ ,ź, l_n are positive numbers such that

m
ĺ
i=1
l_i Ł 1.

ˇ LEMMA (Henrici) We have r₁ ł b where

b :=

min
i=0 ,ź, n
ě
í
î ć
č ę
ę a₀
a_i
ę
ę l_i ö
ř 1/i

ü
ý
ţ
(5)
and the ith term is Ľ whenever a_i=0.

ˇ PROOF: Assume |z| < b. It suffices to prove that |p(z)| > 0:

|p(z)|

=

ę
ę n
ĺ
i=0
a_i zⁱ ę
ę

>

|a₀| ć
č 1 - n
ĺ
i=1
ę
ę a_i
a₀
ę
ę bⁱ ö
ř

ł

|a₀| ć
č 1 - n
ĺ
i=1
l_i ö
ř (since l_i ł bⁱ |a_i/a₀| )

ł

0.

[Previous] [Next] [Top] [Bot]

Applications

ˇ As application, choose l_i = 1/2ⁱ for i=1 ,ź, m. Then

r₁ ł 1
2
m
min
i=1
ě
í
î ę
ę a₀
a_i
ę
ę 1/i

ü
ý
ţ
(6)

ˇ Another application: if b₁ is the unique positive solution of the equation

|a₀| = m
ĺ
i=1
|a_i| Xⁱ

then r₁ ł b₁.

ˇ PROOF: choose

l_i :=
ę
ę a_m
a₀
ę
ę b₁ⁱ, (i=1 ,ź, m)

By definition of b₁, we have l₁+ź+l_m=1. Also,

b₁ = l_i ^1/i ę
ę a₀
a_i
ę
ę

for all a_i š 0. Thus b₁ is the same as b defined in the above lemma. Our claim follows from the said lemma.

[Previous] [Next] [Top] [Bot]

Related Bounds

ˇ Complementary bounds: upper bounds on r_m and lower bounds on r₁ are basically interchangeable.

ˇ This is because a š 0 is zero of A(X) iff 1/a is zero of B(X) = X^m A(1/X^m).

ˇ From (6), we immediately get:

r_m Ł 2 m-1
max
i=0
ę
ę a_m-i
a_m
ę
ę 1/i

ˇ Similarly, r_m is upper bounded by the unique positive root of the equation

|a_m| X^m - m-1
ĺ
i=0
|a_i|Xⁱ.

This is another bound of Cauchy's.

ˇ Above we are consider discs D(r) centered at the origin. We can consider D(z,r) just by shifting the polynomial, A(X-z).

[Previous] [Next] [Top] [Bot]

Cauchy Index

ˇ For more precise location of roots, we need Sturm's theory.

ˇ Write A(X) = a_mŐ_i^k(X-a_i)^m_i (1 Ł k Ł m)

ˇ Key is the behaviour of the logarithmic derivative of A,

f(X) = A˘(X)
A(X)
= m
ĺ
i=1
m_i
X-a_i

ˇ The Cauchy index of a real function f(x) at x=r:
I_f(r) = +1 if r is a pole of f and f(x) switches from -Ľ to +Ľ as x increases through r.
I_f(r) = -1 if sign changes from +Ľ to -Ľ.
Else, I_f(r) = 0

ˇ For a Ł b, let I_f[a,b] be the sum of I_f(r) over all a Ł r Ł b. E.g., when f(X)=A˘(X)/A(X) , I_f[a,b] is the number of distinct real roots of A(X) in [a,b].

[Previous] [Next] [Top] [Bot]

Sturm Sequence

ˇ Given a sequence (x₁ ,ź, x_n) of real numbers, define its sign variation Var(x₁ ,ź, x_n) to be the number of sign changes in the signs of the x_i, ignoring zero signs.

ˇ E.g.,

Var(1,0,-2, 1.3, 3.2, 0, 0, -.5) = 3.

ˇ The Sturm sequence for a pair of real functions A₀(X), A₁(X) over an interval [a,b] is [`A]=(A₀, A₁ ,ź, A_k)

of real functions such that
(i) A₀(a)A₀(b) š 0,
(ii) A_k(x) is non-zero for x Î [a,b],
(iii) Let i=0 ,ź, k-1 and a Ł x Ł b. If A_i(x)=0, then either i=0 and A₁(x) š 0, or A_i-1(x)A_i+1(x) < 0.

ˇ For any real a, let V_[`A](a) be the sign variation of [`A](a)=(A₀(a), A₁(a) ,ź, A_k(a)). Omit [`A] if understood.

ˇ THEOREM (Sturm)

If [`A] is a Sturm sequence for A₀, A₁ over the interval [a,b], then I_f[a,b] = V(a)-V(b) where f=A₀/A₁.

[Previous] [Next] [Top] [Bot]

Connection to PRS

ˇ How to construct Sturm sequences efficiently? Connection to polynomial remainder sequences (PRS), a very well-studied topic.

ˇ Focus on computing in a D[X] where D is a unique factorization domain (UFD). Typically D=Z or D=Z[X₁ ,ź, X_n-1]). There is a big complexity difference between D and its quotient field.

ˇ Let A,B Î D[X]. Write A ~ B (similarity) if aA=bB for some a,b Î D.

ˇ The content of A is the the GCD of its coefficients. Call A primitive if its content is 1. Gauss' lemma: A,B primitive implies AB primitive.

ˇ Pseudo-remainder of A(X), B(X) Î D[X]: prem(A,B) = rem(lead(B)^d A, B) where d=deg(A)-deg(B)+1.

ˇ A sequence (A₀, A₁ ,ź, A_h) is called a PRS (over D) if each A_i+1 ~ prem(A_i-1, A_i) for i=2 ,ź, h, and prem(A_h-1, A_h)=0.

ˇ E.g., Pseudo-Euclidean PRS: just let A_i+1 = prem(A_i-1, A_i). Problem: exponential growth (easily).

ˇ E.g., Primitive PRS: let A_i+1 be primitive part at each step. Problem: multiple GCD at every step.

[Previous] [Next] [Top] [Bot]

Subresultant PRS

ˇ Solution of Collins (1967), with subsequent simplifications by Brown:

ˇ Maintain recursive an element b_i Î D and let A_i+1 = prem(A_i-1, A_i)/b_i.

ˇ For i=0 ,ź, k-1, we do the following: Assume that A_i+1 has just been computed. Then

d_i:=deg(A_i)-deg(A_i+1), a_i:=lead(A_i).

Then let

b_i+1: = ě
í
î

(-1)^d₀+1
if
i=0,

(-1)^d_i+1(y_i)^d_i a_i
if
i=1 ,ź, k-2,

where (y₀ ,ź, y_k-1) is an auxiliary sequence given by

y_i+1 : = y_i( a_i+1
y_i
)^d_i = (a_i+1)^d_i
(y_i)^d_i-1
,

for i=0 ,ź, k-2. Set y₀ = 1.

[Previous] [Next] [Top] [Bot]

Correctness and Complexity

ˇ Why is this correct?

ˇ Why is this polynomial time?

ˇ Theory of subresultant sequences will explain all these. See book.

ˇ Complexity. It is easy to see that the algorithm takes O(n²logn) operations of D.

ˇ Improvement: Schwartz [10] applied the Half-GCD idea to get an O(nlog² n) bound, provided we only compute the sequence of partial quotients and coefficients of similarities

(Q₁,a₁,b₁) ,ź, (Q_k-1,a_k-1,b_k-1)

where a_iA_i+1=b_i A_i-1+A_iQ_i.

[Previous] [Next] [Top] [Bot]

The Half-GCD Technique

ˇ For simplicity, we assume the Euclidean case (so D is actually a field).

Initial idea: if

A_i+1 = A_i-1 - Q_i A_i, (i=1 ,ź, h-1).

we should focus on the quotient sequence (Q₁ ,ź, Q_h) instead of the remainders.

JUSTIFICATIONS: (1) Can reconstruct the remainder sequence in O(nlog² n) time.
(2) Storing the quotient sequence uses linear, not quadratic, space.

ˇ Write in matrix notation:

V_i : = é
ę
ë

A_i-1

A_i
ů
ú
ű = é
ę
ë

Q_i
1

1
0
ů
ú
ű é
ę
ë

A_i

A_i+1
ů
ú
ű =:V_i+1.

Note that the matrix here has determinant -1. More concisely:

V_i Q_i
Ž

V_i+1.

ˇ The HGCD Algorithm, on input A,B (degA > degB > 0) will return the quotient Q so that

V = é
ę
ë

A

B
ů
ú
ű Q
Ž

V˘ = é
ę
ë

A˘

B˘
ů
ú
ű

such that they straddle the mid-point:

deg(A˘) ł deg(A)/2 > deg(B;).

ˇ SPEEDUP IDEA: Suppose we replace A, B by A˘: = Adiv X^m/2 and B˘: = Bdiv X^m/2. Let us recursively call Q˘: = HGCD(A˘, B˘). What happens if who now apply Q˘ to reduce (A,B)? More precisely, let

é
ę
ë

A

B
ů
ú
ű Q˘
Ž

é
ę
ë

A˘˘

B˘˘
ů
ú
ű

and we ask what is deg(A) deg(B)? Intuitively, deg(A˘˘) ł deg(A)[ 3/4] ł deg(B˘˘). Thus we managed to go down ``a quarter''.

A second recursive call on HGCD(A˘˘, B˘˘) should get us to the half-way point! Problem: what if we get too lucky?

ˇ If HGCD takes time O(nlog² n), the we can reduce the GCD problem to HGCD so that the overall running time remains O(nlog² n).

ˇ Brief history of the half-GCD technique: the ideas apply for polynomial GCD and integer GCD. Latter is harder! Knuth (1971), Schönhage, Moenck, Aho-Hopcroft-Ullman, Brent-Gustavson-Yun, Thull-Yap.

[Previous] [Next] [Top] [Bot]

From PRS to Sturm

ˇ It is easy to see: if you take the negative of the remainders of the polynomial remainders, then you can get a Sturm sequence.

ˇ But, suppose we are given a PRS, where

b_i A_i+1 = a_i A_i-1 + Q_iA_i , i=1 ,ź, h-1.

How can we get the Sturm sequence? E.g. in subresultant PRS, we know b_i, a_i.

ˇ Let s_i = -sign(a_i b_i).

ˇ Our goal is to compute (s₀ ,ź, s_h) such that (s₀ A₀ ,ź, s_h A_h) is a Sturm sequence.

ˇ We need

(b_i s_i+1)(s_i+1A_i+1) = (a_is_i-1)(s_i-1A_i-1) + Q_i A_i .

such that sign(a_i s_i+1b_i s_i-1)=-1 or,

sign(s_i s_i+1s_i-1)=1.

ˇ Multiplying together these equations,

(s₀s₁s₂)(s₂s₃s₄)(s₄ s₅ s₆) ź(s_2j-2s_2j-1s_2j)=1.

ˇ Telescoping, we obtain the desired formula for s_2j:

s_2j = j
Ő
i=1
s_2i-1.

Similarly, for the odd indices,

s_2j+1= j
Ő
i=1
s_2i.

ˇ Thus the sequence (s₁ ,ź, s_h) of signs splits into two alternating subsequences whose computation depends on two disjoint subsets of {s₁ ,ź, s_h-1}.

ˇ PRS computation is amenable to fast parallel computation. It is nice to observe that computing the s_i's can be computed in O(logn) parallel steps, using the parallel prefix algorithm.

[Previous] [Next] [Top] [Bot]

Domination

ˇ There is one detail we forgot (:-). The last term A_h in the PRS is not a constant. Recall the definition.

Evaluating a Sturm sequence at a root r of A_h cause all terms to vanish. We call r a degenerate value.

ˇ For instance, if A₀, A₁ are relatively prime, then A_h=constant, so there is no degenerate values. Otherwise, the standard response is: use the depressed sequence A˘_i = A_i/A_h for i=0 ,ź, h.

ˇ Actually, it is known that this is unnecessary. But we want to understand this phenomenon in general.

ˇ DEFINITION: let m(A,r) denote the multiplicity of r as a root of A(X). N.B. m(A,r)=0 if r is not a root of A.

We say A dominates B if at each root a of A, we have

m(A,a) ł m(B,a).

ˇ Thus, A dominates B in the following situations:
(a) B is a derivative of A.
(b) A and B are relatively prime.
(c) B is square free.
(d) B divides A.

[Previous] [Next] [Top] [Bot]

A General Sturm Theorem

ˇ Let us fix A, B and form a Sturm sequence of A,B. For any a < b, let

V_A,B[a, b] = V(a)-V(b)

be the difference in sign variations at a and at b.

ˇ Different forms of ``Sturm theories'' amount to the question: what does V[a,b] count?

ˇ THEOREM: Assume A dominates B and A(a)A(b) š 0. Then

V[a,b] =
ĺ
a
sign(A^(r)(a)B^(s)(a))
(7)
where a ranges over all roots of A in the interval [a,b], and r=m(A,a) and s=m(B,a), and r+s is odd .

ˇ REMARK: this theorem is only proved for polynomials, not real functions A, B.

[Previous] [Next] [Top] [Bot]

Corollaries

ˇ Let V[a,b] be the difference of sign variation for A, B, as in the last theorem.

ˇ 1. STURM˘s THEOREM:

If B = A˘ (derivative), then V[a,b] counts the number of roots of A in [a,b].

PROOF: In the above (7), we have r+s=odd for every root a of A in [a,b]. Moreover, at each a, we have A^(r)(a) = B^(s)(a), so that each summand is +1.

N.B. We have thus justified our ``oversight'' of not dividing by A_h.

ˇ 2. THEOREM (Schwartz-Sharir) If A, B are square-free, then

V[a,b] =
ĺ
a
sign(A˘(a)B(a))

where a ranges over the roots of A(X) in [a,b].

PROOF: Let a be a root of A(X) in [a,b]. If B(a)=0, the it adds nothing to the (7). If B(a) š 0, then r=1 and s=0 in (7). Thus the sum in this theorem is equal to the sum in (7).

ˇ 3. THEOREM (Sylvester, also: Ben-Or, Kozen, Reif)

Let B = A˘C for some polynomial C, and A,C be relatively prime. Then

V[a,b] =
ĺ
a
sign(C(a))

where a ranges over the roots of A in [a,b].

PROOF: Note that A dominates B. Then V[a,b] is the same sum as in the previous theorem. However, the summands sign(A˘(a)B(a) are now equal to

sign(A˘(a)A˘(a)C(a))=sign(C(a)).

N.B. This result is used in the famous BKR algorithm.

ˇ 4. THEOREM (Cauchy Index)

Let f(X) = A(X)/B(X) where A, B are relatively prime. Then V[a,b] is the negative of the Cauchy Index I_f[a,b].

PROOF: In the summation of V[a,b], we must have s=0, r=odd. This means

sign(A^(r)(a))

=

sign(A(a⁺))-sign(A(a^-))
2
,
sign(A^(r)(a)B⁽⁰⁾(a))

=

sign(A(a⁺)B(a⁺)) -sign(A(a^-)B(a^-))
2

=

sign(f(a⁺))-sign(f(a^-))
2
.

Summing the last equation over all a, the left-hand side equals V[a,b], and the right-hand side equals I_f[a,b].

[Previous] [Next] [Top] [Bot]

Sign Determination of Algebraic Numbers

ˇ Problem A:

How do we compare two algebraic numbers a: b?

ˇ Assume a ~ (A,a,b) and b ~ (B, a,b) are two numbers represented as isolating interval. It is assumed that A, B are square-free.

N.B. It is wlog to assume that they have the interval.

ˇ Consider the function B(X) in the interval [a,b]. We have:

a ł b Ű B(a)·B˘(b) ł 0.

Note that equality is attained on both sides at the same time.

ˇ But the sign of B˘(b) is that of

B(b)-B(a)

and the sign of B(a) is given by the theorem of Sylvester (and Ben-Or, Kozen, Reif):

V_A,A˘B[a,b].

ˇ Problem B:

Suppose a ~ (A, a,b) as above. We want to compute in the field Q(a). Suppose b = B(a) where B is any polynomial. What is the sign of b?

ˇ By the theorem of Schwartz-Sharir, we first compute

V_A,B[a,b] = sign(A˘(a) B(a).)

But sign(A˘(a) = sign(A(b)-A(a)). Hence sign(B(a)) can be determined.

[Previous] [Next] [Top] [Bot]

Theorem of Routh-Hurwitz

ˇ PROBLEM: Given F(Z) Î C[Z], we want to count the number of its complex roots that lie in the upper half of the complex plane.

ˇ Let F(X)=F₀(X)+ iF₁(X) where F_i(X) Î R[X].
- We may assume F₀ F₁ š 0. Otherwise F or iF is a real polynomial, and this is easy.
- a is a real root of F(X) iff it is a root of gcd(F₀,F₁).
- We may assume F₀, F₁ are relatively prime (so no real roots).
- We may assume deg(F₀) ł deg(F₁) (otherwise, replace F by iF).

ˇ THEOREM (Routh-Hurwitz): The number of roots of F(Z) in the upper half plane is given by

1
2
(deg(F) - V_F₀,F₁[-Ľ,+Ľ] ).

[Previous] [Next] [Top] [Bot]

Complex Root Isolation Algorithm

ˇ We show how to count roots in any box (i.e., axes-aligned rectangle). This is basically Pinkert approach (1970).

Name the 4 quadrants of the complex plane (I), (II), (III), (IV).

ˇ By simple transformations of F(Z) we can also solve the followng problems:
(1)- Counting roots in the lower half-plane: use use the above procedure on G(Z) : = F(- Z).
(2)- Counting roots to the right of the imaginary axis: a is to the right of the imaginary axis iff ia is above the real axis. So, use G(Z) : = F(Z/i) = F(-iZ).
(3)- Counting roots in the either (I) or (III): a is in the (I) or (III) iff a² is in the upper half plane. The polynomial f(Z) : = F(Z)F(-Z) has roots which are precisely squares of the roots of F(Z). Moreover, f(Z) has no odd-degree terms: f(Z)=G(Z²) for some G(Z). Use this G(Z)!

G(Z) : = f(ÖZ) = F(ÖZ)F(-ÖZ)
(8)

(4)- Similarly, we can count roots in either (II) or (IV).

(5)- Counting roots in (I): This is given by

#(I) = 1
2
[#(I+II) + #(I+IV) - #(II+IV) ].

(6)- Counting roots in a translated quadrant t+(I): a is in t+(I) iff a-t is in (I). So let G(Z)=F(Z+t). Then F(a)=0 iff G(a-t)=0. We can count such a-t's by applying (5) to G(Z).
(7)- Counting in any finite box B: Let (I) be translated to each corner of the box, and call them NW, SW, NE, SE. Then we have

#(B) = #(SW) - #(NW) - #(SE) + #(NE).

ˇ Once we can count in a box, we can use any bisection methods (or quadtree subdivision) to converge to every root.

[Previous] [Next] [Top] [Bot]

Multidimensional Sturm

ˇ Hermite (1852, 1853, 1880) gave the generalization to 2-D. Gave a method for computing them using symmetric functions.

ˇ Pedersen [8] (NYU Thesis, 1991) gave the generalization to all dimensions. He provided sequences for counting zeroes in boxes, simplices and arbitrary shapes.

ˇ Philip Milne [5] (Bath Thesis, 1990) independently gave another formulation based on volume function, which does not fall under Pedersen's class of sequences. Although simple, the computational tool here has not been fully developed.

[Previous] [Next] [Top] [Bot]

Survey of Zero Finding Algorithms

ˇ Follow Henrici [3] (with updates) by citing list of attributes

Global or Local Convergence. Note that local convergence is a property of those root finding algorithms that require initial approximate roots as part of the input. Then the algorithm is locally convergent if it is guaranteed to converge when the approximation is sufficient close to the desired root. This is exemplified by Newton's algorithm

Unconditional Convergence. This refers to algorithms that expect certain properties in the input polynomials. Some examples are (1) the roots are all real, and (2) all roots are simple. Note that conditional convergence is independent of the previous Global/Local convergence.

A posteriori estimates. This means that the algorithm not only produces approximate roots [^(a)]_i to each root a_i, but it also provide bounds b_i such that |[^(a)]_i-a_i| Ł b_i. In modern approaches, we expect the user` to specify the accuracy b_i's that the algorithm must deliver.

Uniform Convergence. Henrici defines this to mean that if the algorithm begins with an approximate roots [(a)\tilde]_i with error bounds e_i, and we specify a parameter q < 1, then our algorithm will reduce the error to e_iq in time that is a function of q only (assume the input polynomial is fixed).

Speed of Convergence. This is usually defined in terms of the order of convergence, defined to be the

lim

sup
Ľ
|z_n+1-w|
|z_n-w|^v
< Ľ

where z_i's are converging to root w. Thus Newton's method has order v=2 and Laguerre's method has order v=3.

Determination of All Zeros or One Zero. Some algorithms converges to one root a at a time. We can then ``deflate'' the polynomial A(X)/(X-a) to continue the process. Others are able to simultaneously converge to all the roots.

Cluster Insensitity. Many algorithms will perform poorly in the presence of clusters. Some even becomes invalid in the presence of multiple roots. Newton's method is an example.

Numerical Stability.

Parallelism. A fundamental goal of parallel algorithm research is to place a problem within the complexity class NC. An open problem here is whether root isolation is in NC. But the weaker problem of computing root clusters to specified precision was first shown to be in NC by Neff. We refer to Pan's survey [6]

Relative versus absolute error bounds. In most papers [6] that address this issue, if the desired error bound is b, will output approximate zeros whose absolute error is < 2^-b; but methods based on Turan's power sum method achieve relative error bounds. Also, the work of Smale uses a weaker approximation, of finding that approximate zeros z such that |A(z)| < 2^-b.

Iterative versus recursive. Iterative algorithms tend to be good in practice but often lack global convergence. Recursive schemes are often based on splitting the roots into roughly equal subsets and recursively solving them. In recent years, iterative methods based on eigenvalue computation (Manocha-Demmel, Fortune) has shown its practical usefulness.

Root Approximation versus Root Isolation. In root approximation, we are happy to determine clusters of roots (possibly multiple roots) to within some give error bound. In root isolation, we cannot stop until we obtain clusters with exactly one distinct root. Root isolation is a harder problem.

[Previous] [Next] [Top] [Bot]

Variations and Complexity Model

Formulations of the root finding problem:

ˇ Complex Root Location: given P(X) of degree n and a precision b > 1, to determine m discs D₁ ,ź, D_m such that the diameter of D_i is at most 2^-b, and there is a bijection f(z_j) from the m roots of P to D_i's such that if f(z_j)=D_i means z_j Î D_i.

REMARK: the discs can overlap (even coincide).

ˇ Complex Root isolation: Like the above, except that the disks are either disjoint or identical.

ˇ Complex Root Factorization: (Schöhage) Compute m linear factors L_i(X) : = a_i X + b_i (a_i, b_i Î C) such that |P - L₁źL_n| < 2^-b. Here |·| can be any polynomial norm.

ˇ Complexity Model: Algebraic Model where we just count the number of algebraic operations (typically +, -,×, ¸ and possibly radical extraction).

Boolean Model where reduce complexity to counting bit operations. This can be uniform (e.g. Turing machines) or non-uniform (Boolean circuits).

ˇ The complexity of root finding is a function T(n,b) in an appropriate model. Focus of theoretical research: bound T(n,b) tightly. DISCUSSION: another parameter may be appropriate.

ˇ The algebraic model is preferred by most papers (it is simplest). We can generally tranform a bound in the algebraic model to the bit model by multiplying by M(n(n+b)), where M(n)=O(nlogn loglogn) is complexity of multiplication in Boolean model.

[Previous] [Next] [Top] [Bot]

Weyl's Exclusion Algorithm

ˇ Weyl's (1924) constructive proof of the Fundamental Theorem is considered the first ``algorithm'' for complex root finding [9]. [Also L.E.J. Brouwer (1924)]

ˇ Pan [7] observed that in several reworkigs of Weyl's algorithm by Henrici and Gargantini (1969), Renegar (1987) and Pan (1987), each implied new record bounds for the complexity bound T(n,b).

ˇ Weyl's method is based on the notion of Exclusion Tests relative to a polynomial F(Z): given a region R Í C of a suitable type, the region either passes or fails the test.

ˇ Exclusion property: If R passes the test, then R has no zeros.

ˇ Typical regions are disks or boxes (actually squares). Each region has a center. Assume there is a natural way to cover each region R by a fixed number of smaller regions. E.g., for boxes, use a quadtree subdivision.

ˇ The ``Weyl scheme'' for complex root finding. Stage 0: begin with a region containg all roots. Stage i+1: let S_i be all the regions that fails the test from the previous stage. For each R Î S_i, apply the test to each subregion that spawned from R. Halt when the regions are sufficiently small.

ˇ Simple exclusion tests: we know (see (3) or (6)) how to compute lower bounds d where d Ł r₁. Given a box B with center z, we determine d such that the disk D(z,d) contains no zeros. We pass the test iff B Í D(z,d).

[Previous] [Next] [Top] [Bot]

Improvements: Proximity Tests

ˇ Proximity Tests are exclusion tests with an additional property: for some r > 1, if a region B fails a test then the ``expanded region˘˘rB mustcontain a root.

REMARK: rB denotes the expansion (or contraction) about the center of B. Call r the ratio or relative error of the test.

ˇ Let the roots of F(Z) = ĺ_i=0^m a_i Zⁱ be a₁ ,ź, a_m.

ˇ Following van der Sluis [2], we shift the origin to the center of gravity C = -a_m-1/(n a_m) of the roots, and consider G₀(Z) = F(Z+C) = ĺ_i=0^m b_i Zⁱ. Then

b   ć
Ö

2
n

    Ł
max
i
|a_i - C|     Ł     1.63 b,            (b =
max
i
|b_i/b_m|).

This gives us a proximity test with ratio r=1.63 Ö{n/2}. Using arguments from above, we get the weaker r=2n.

ˇ Recall in (8) that we have a transformation: F(Z) Ž G(Z) such that the zeros of G(Z) are the squares of the zeros of F(Z). Let G_k(Z) denote the k-th iteration of this transformation applied to G₀.

ˇ Applying the proximity test to G_k(Z), then we obtain ratio r = (1.63 Ö{n/2})^{1/2^k}. Choose k=Q(loglogm) to get a ratio r Ł 2, say.

ˇ P. Turan (1968, 1975, 1984) gave non-trivial bounds on the roots based on power sums of the roots. To apply these bounds, we compute the power sums of the roots of G_k. This can be solved using Newton's identities, in O(nlogn) steps. This leads to further improvement of Weyl's method.

[Previous] [Next] [Top] [Bot]

Eigenvalue Approach

ˇ Theoretical versus Practical Methods: this classification is almost co-extensive with classifying algorithms into ``recursive versus iterative methods''. The asymptotically best algorithms do not work in practice.

ˇ Bini and Fiorentino (1999) implemented a very fast method mpsolve based on Aberth's method. It is based on Newton's method.

ˇ Another approach to iterative algorithms is via eigenvalue computation. Fortune (2000) implemented an algorithm eigensolve approach that is very competitive mpsolve.

ˇ Recall: l is eigenvalue of (square) matrix M with associated eigenvector v if Mv = lv. The characteristic polynomial of M is p(x) = det(xI - M). So, l are the zeros of p(x).

ˇ Goal: given p(x), find the associated M. Thus, we reduce root finding to matrix eigenvalues.

ˇ Let D = diag(d₁ ,ź, d_n) be a diagonal matrix, and R = (r_ij) be a rank 1 matrix (i.e., R = u^T v where u,v are n-vectors).

[Previous] [Next] [Top] [Bot]

Reduction to Eigensolving

ˇ LEMMA:

det
(D+R) = n
Ő
i=1
d_i + n
ĺ
i=1
r_ii
Ő
j š i
d_j.

PROOF: by induction on n, and

det
(D+R) = det
(D[1] | (D+R)[2:n]) + det
(R[1] | (D+R)[2:n])

where M[i:j] is the submatrix with columns i to j, and M[i] : = M[i:i].

ˇ COROLLARY:

(-1)ⁿ char(D+R) = n
Ő
i=1
(X-d_i) + n
ĺ
i=1
r_ii
Ő
j š i
(X-d_i).

ˇ Let S=(s₁ ,ź, s_n) be distinct complex numbers, P(X) a monic polynomial of degree n. Also let Q(X) = Q_S(X) : = Ő_i=1ⁿ(X-s_i). Define the Lagrange coefficients

l_i : = ę
ę P(s_i)
Q˘_S(s_i)
ę
ę , (i=1 ,ź, n).

ˇ The generalized companion matrix C(P,S) is

C(P,S) : = diag(s₁ ,ź, s_n) - é
ę
ę
ę
ę
ę
ë

l₁
l₂
ź
l_n

l₁
l₂
ź
l_n

ź
ź
ź
ź

l₁
l₂
ź
l_n
ů
ú
ú
ú
ú
ú
ű .

[Previous] [Next] [Top] [Bot]

Root Solving via Eigenvalue

ˇ THEOREM (Smith, 1970): The eigenvalues of matrix C(P,S) are precisely the roots of P.

PROOF: This is equivalent to saying that char(C(P,S)) = ąP. By the COROLLARY,

(-1)ⁿ char(C(P,S)) = n
Ő
i=1
(X-d_i) + n
ĺ
i=1
l_i
Ő
j š i
(X-s_i).

The second term on the RHS is the Lagrange interpolating polynomial of degree n-1 with values P(s_i) at s_i. Thus the LHS agrees with P at each s_i. Since they are both monic, they must by equal.

ˇ Let C(P) be the usual companion matrix of P. We assume the existence of a floating point eigenvalue subroutine called eig.

ˇ Here is the Fortune's algorithm:

Procedure eigensolve(P)

1. eig(fl(C(P))); // fl converts to float.

2. while (! converged(S))

3. S = eig(fl(C(P,S)));

4. return S;

ˇ It is an open problem to prove unlimited convergence of this algorithm. Fortune has proved conditional convergence.

[Previous] [Next] [Top] [Bot]

Durand-Kerner Method

ˇ Durand-Kerner Method. Recent implementation by Shankar Krishnan. Works quite well in practice.

ˇ Let F(X) be a monic polynomial of degree n. Let X^(k) = (x^(k)₁, x^(k)₂ ,ź, x^(k)_n) be the simultaneous approximation to all the roots in the kth iteration. Then:

x^(k+1)_i = x^(k)_i - F(X)

Ő
j š i
(x^(k)_i - x^(k)_j )

ˇ Local convergence is known. Initial guess X⁽⁰⁾ is obtained by n equidistributed points on a circle about the origin. Improved initial guess are used by Krishnan.

ˇ OTHER METHODS: Bairstow's method is applicable to real polynomials when we want to avoid complex arithmetic. We can extract only quadratic or linear terms with real coefficients.

[Previous] [Next] [Top] [Bot]

Newton Iteration

ˇ Newton iteration is still a remarkably lively subject. Several papers of Brent (ca. 1976) gives the definitive complexity theoretic treatment. Recent work of Smale investigates the average behavior.

ˇ Well-known: Newton is locally quadratically convergent. The Newton basin of a root is the region where Newton's method converges to that root. Friedman (1989) gave a lower bound on the radius of the basin.

ˇ We ask another question: how close must one be to a root before the quadratic convergence behavior of Newton appears? (Call this the ``quadratic Newton basin''. Smale gives implicit conditions for this, but it is desirable to know a priori bounds.

ˇ THEOREM: If A(X) Î Z[X] is square-free then Newton will converge quadratically when the initial point is within a distance

d = m^-3m-9 (2+||A||_Ľ)^-6m.

PROOF: See book.

[Previous] [Next] [Top] [Bot]

A New Computational Mode

ˇ Traditional applications of algebraic computation: as seen in Maple or Mathematica.

ˇ We are increasingly seeing a new mode of algebraic computation that does not fit the model. What is it?

ˇ Consider the problem of Euclidean shortest path (ESP) between two points, amidst a collection of polygonal obstacles. Assume the input numbers are integers.

ˇ Can use Djikstra's algorithm which is worst case quadratic. Hershberger and Suri improved this to O(nlogn).

ˇ Regardless, all algorithms must make comparisons of the form l:0 where

l = n
ĺ
i=1
n_i
Ö

a_i

, (n_i Î Z, a_i Î N)

ˇ Note that deg(l)=2ⁿ, and n may be linear in the number of input corners. Is ESP impractical?

ˇ Yes, because of two main characteristics of the said computational mode:
(1) Constructive: l is a ``constructive algebraic number'', not given as the root of a polynomial.
(2) Incremental: the numerical precision needed should be adaptive.

[Previous] [Next] [Top] [Bot]

Requirements of the Constructive Algebraic Mode

ˇ (1) ``Traditional'' modes of algebraic computation are monolithic. E.g., solve the Stewart Platform configuration. The new mode is small and incremental (extract one square root at a time). But the ultimate buildup may be non-trivial. E.g., l. A platform like Maple is suitable for the monolithic problems, but unsuitable for the latter.

ˇ Most geometric algorithms that has significant algebraic components will fall under this mode. E.g., Voronoi diagrams of a set of lines in 3-space, computing the arrangement of a set of surfaces.

Other examples are incremental geometric constructions (ruler-and-compass type). E.g, Cinderella system.

ˇ (2) The algebraic computation is intermixed with other non-algebraic computation (e.g., constructing the shortest path graph). Thus, we need a generalprogramming language support, including compilers.

ˇ (3) The precision needed for each step could not be easily predicted in advance. Thus, the new mode of computation has to dynamicallyincrease the precision of numbes.

ˇ (4) Achieving exact comparison is a critical part of the correctness of our computation. This poses the problem ofConstant Sign Determination (see next).

ˇ (5) We need facilities to handle constructive algebraic numbers. There are currently 2 systems that handleconstructive algebraic numbers: LEDA˘s real class,and our Core Library.

[Previous] [Next] [Top] [Bot]

The Constant Sign Problem

ˇ To support the above mode of computation, we need to solve a class of computational problems.

ˇ Let W be a set of algebraic operators over C.

NOTE: constants and variables are operators too. E.g., W = {+,-, sin(), 1, p, X_i: i Î N}.

ˇ Expr(W) is the set of expressions constructed from the operators in W.

ˇ Assume W has no variables. Each E Î Expr(W) has a unique interpretation, I(E) Î C or I(E) = (undefined).

ˇ The Constant Zero Determination (CZD) for W is: given E Î Expr(W), determine if I(E)=0 (or if undefined).

ˇ If W are real operators, we also have the Constant Sign Determination (CSD) for W is: given E Î Expr(W), determine the sign of I(E)=0 (or if undefined).

ˇ FACT: If CSD(W) is decidable, so is CZD(W).

ˇ Expressions are assumed to be given as a DAG or a straightline program.

[Previous] [Next] [Top] [Bot]

Algebraic and Beyond

ˇ Consider the following sets of operators:
W₀ = {+,-,×, 1}.
W₁ = W₀Č{¸}.
W₂ = W₁Č{^kÖ{·}: k ł 2 }.
W₃ = W₂Č{Root^k_i(): 1 Ł i Ł k, k Î N}.
W₄ = W₀Č{p, sin()}.

ˇ THEOREM: The problem CZD(W₃) is decidable in exponential time. When viewed as real operators, the CSD(W₃) is also decidable in exponential time.

ˇ The theorem is a consequence of constructive root bounds (see Mehlhorn's Lecture in this workshop)

ˇ OPEN PROBLEM: Is CZD(W₄) decidable?

ˇ Discussion: Exact Geometric Computation (EGC) is possible for algebraic computation, but unknown for all non-algebraic cases. We could replace sin() by any other elementary function.

[Previous] [Next] [Top] [Bot]

Incremental Complexity of Algebraic Numbers

ˇ Since root refinement is a critical problem in the constructive mode of computation, we pose two problem in this area.

ˇ The first problem is practical: Brent's work has shown that the asymptotic complexity of algebraic numbers, and computing transcental functions is O(M(n)logn). This work remains to be made practical.

ˇ The second problem is more theoretical: How must does it cost to get one additional bit of information about a?

ˇ More precisely, given that a is the i-th largest real root in [a,b], how hard is it to half the range of uncertainty?

ˇ Three ranges of complexity:
(1) Sturm range. We perform one "Sturm Query''. Should the cost of computing the Sturm sequence be counted? This can be amortized over this range. This stage goes from b-a = r_n until root separation bound.
(2) Bisection range. This occurs when [a,b] is an isolation interval. When b-a is smaller than the root separation bound, then we are in this range.
(3) Newton range. When b-a is smaller than the radius of the ``quadratic Newton basis'' we are in this range. This is amortized complexity.

ˇ OPEN QUESTION: How inherent are these observations?

[Previous] [Next] [Top] [Bot]

Conclusions

ˇ The Constructive Algebraic Mode of computation is represented by two current systems, LEDA's real number type, and in our Core Library. See Project Home.

ˇ Traditional concerns of computer algebra does not address adequately the needs of this mode.

ˇ What are the fundamental questions to be investigated in this setting?

ˇ How can we deal with non-algebraic computations in the face of the open question?

[Go To Top]

References

[1]: H. Cohen. A Course in Computational Algebraic Number Theory. Springer-Verlag, 1993.
[2]: A. V. der Sluis. Upper bounds for roots of polynomials. Numer. Math., 15:250-262, 1970.
[3]: P. Henrici. Applied and Computational Complex Analysis. John Wiley & Sons, New York, 1974.
[4]: A. S. Householder. Numerical Treatment of a Single Non-Linear Equation. McGraw-Hill, New York, 1970.
[5]: P. S. Milne. On the algorithms and implementation of a geometric algebra system. PhD thesis, University of Bath, 1990.
[6]: V. Y. Pan. Sequential and parallel complexity of approximate evaluation of polynomial zeros. Comput. Math. Applic., 14(8):591-622, 1987.
[7]: V. Y. Pan. Solving a polynomial equation: some history and recent progress. SIAM Review, 39(2):187-220, 1997.
[8]: P. Pedersen. Counting Real Zeros. PhD thesis, Courant Institute, New York University, 1991.
[9]: A. Schönhage. Equation solving in terms of computational complexity. Proc. International Congress of Mathematicians, pages 131-153, 1986. Berkeley, California.
[10]: J. T. Schwartz and M. Sharir. On the piano movers' problem: II. General techniques for computing topological properties of real algebraic manifolds. Advances in Appl. Math., 4:298-351, 1983.
[11]: C. K. Yap. Fundamental Problems in Algorithmic Algebra. Oxford University Press, 2000. A version is available at URL ftp:/Preliminary/cs.nyu.edu/pub/local/yap/algebra-bk.

File translated from T_EX by T_TH, version 3.01.
On 9 Oct 2001, 09:34.

	Procedure eigensolve(P)
1.	`eig(fl(C(P)));`	// `fl` converts to float.
2.	`while (! converged(S))`
3.	`S = eig(fl(C(P,S)));`
4.	`return S;`