1. Introduction
Finding zeros of polynomial systems is a fundamental problem in scientific computing. Newton’s method is widely used to solve this problem. For a fixed approximate solution of a system, we can use the
-theory [
1,
2,
3], the interval methods or the optimization methods [
4,
5,
6,
7,
8,
9] to completely determine whether it is related to a zero of the system. However, the
-theory or the interval methods focuses mainly on a simple zero of a square system, that is, a system with
n equations and
n unknowns.
Some special certifications of a rational solution of rational polynomials with certified sum of squares decompositions are considered [
10,
11,
12,
13,
14,
15,
16].
What about singular zeros of a well-constrained polynomial system? Usually, an over-determined system which contains the same zero as a simple one is constructed by introducing new equations. The basic idea is the deflation techniques [
17,
18,
19,
20,
21,
22,
23,
24]. In some studies [
25,
26,
27,
28,
29,
30], new variables are also included. Moreover, some authors verify that a perturbed system possesses an isolated singular solution within a narrow and computed error bound. The multiplicity structures of singular zeros of a polynomial system are also studied [
18,
21,
29]. Although it is in a theoretical sense and global sense, the method in [
17] provides a sufficient condition that a zero is exactly a zero of a zero-dimensional polynomial system with rational coefficients.
For the deflation methods mentioned above, on the one hand, to be a zero of the perturbed systems does not mean being a zero of the input system considering the difference between the two systems; on the other hand, although the over-determined systems without introducing new variables have the same zeros as the input systems, the verification methods, such as the -theory or the interval methods, could not be used directly on the over-determined systems in general.
In [
31], the authors extended the
-theory from well-constrained systems to over-determined systems. A main result about Newton’s method given in their paper is their Theorem 4 [
31], which says that under the condition of
, where
,
is the Moore–Penrose inverse of the Jacobian matrix
of
g and
is an attractive fixed point for Newton’s method and, simultaneously, a strict local minimum for . However, as they stated, whether the attracting fixed points for Newton’s method are always local minima of , or the zeros of the input system, is unknown.
In this paper, we consider the problem of certifying the simple real zeros of an over-determined polynomial system. Given , we construct a new square system with . After transforming the input over-determined system into a square one, we can use both the -theory and the interval methods to certify its simple zeros. In this paper, we only consider using the interval methods to certify the simple real zeros of the over-determined system. We prove that the simple real zeros of the input system are local minima of sum of squares of the input polynomials. We also give the condition that the local minimum is a simple zero of the input system.
Let be the field of real numbers. Denote as the polynomial ring. Let be a polynomial system. Let .
The following theorem is the main result of this paper.
Theorem 1. Let () and . Then, we have:
- 1.
If is an isolated simple real zero of Σ, is a local minimum of f.
- 2.
is a simple real zero of Σ if and only if is a simple real zero of the square system , , where and r is a new variable.
In the above theorem, we get a necessary and sufficient condition to certify the simple real zeros of the input system by certifying the simple real zeros of the square system . Therefore, to certify that is a simple real zero of , the key point is verifying that .
However, it is difficult to decide numerically if a point is a zero of a polynomial. Thus, we cannot use the necessary and sufficient condition to certify the simple real zeros of by certifying the simple real zeros of .
As an alternative, we refine and certify the simple real zeros of by refining and certifying a new square system , with the interval methods and get a verified inclusion , which contains a unique simple real zero of . In fact, is a local minimum of f, which also is a necessary condition for the certification. On the one hand, if , by Theorem 1, is a simple real zero of , and then is a simple real zero of . Thus, we certified the input system . On the other hand, if , we can only assert that has a unique zero in the verified inclusion , which means we certified the system .
A big difference between this paper and our pervious work [
32] is that we do not merely consider certifying simple zeros of over-determined polynomial systems, but also consider the certification of the general isolated zeros. Specifically, as an application of our method, we give a heuristic method for certifying not only the isolated singular zeros of polynomial systems, but also the multiplicity structures of the isolated singular zeros of polynomial systems.
This paper is an extended version of the CASC’17 conference paper [
32].
The paper is organized as follows. We introduce some notations and preliminaries in the next section. In
Section 3, we give a method to show how to transform an over-determined system into a square one. The interval verification method on the obtained square system is considered in
Section 4. We give two applications of our method in
Section 5 and draw conclusions in
Section 6.
2. Preliminaries
Let be the field of complex numbers. Denote as the polynomial ring. Let be a polynomial system. Let . denote that is a zero of .
Let A be a matrix. Denote as the transpose of A and as the rank of A. Let denote the matrix whose i-th row j-th column element is .
Let
be a polynomial system. Denote
as the Jacobian matrix of
. That is,
For a polynomial , let denote , and . Denote , with .
We denote the value of a function matrix at a point as . Let denote the value of a function matrix at a point , similarly for .
Definition 1. Anisolated solutionof is a point which satisfies: Definition 2. We call an isolated solution of asingular solutionif and only if Otherwise, we call asimple solution.
Definition 3. Astationary pointof a polynomial function is a point , which satisfies: We can find the following lemma in many undergraduate textbooks about linear algebra (see Example 7 on page 224 in [
33] for example).
Lemma 1. Let be a real matrix with and . Then, the ranks of A and B are the same, especially for the case that A is of full rank.
In the following, we consider the real zeros of the systems with real coefficients. It is reasonable since, for a system (m equations and n unknowns) with complex coefficients, we can rewrite the system into a new one with equations and unknowns by splitting the unknowns and equations , where , , , and find the complex zeros of the original system by finding out the real zeros of the new system.
3. Transforming Over-determined Polynomial Systems into Square Ones
In this section, we show how to transform an over-determined polynomial system into a square one with their zeros having a one-to-one correspondence, especially for the simple zeros.
By Definition 3, we have the following lemma:
Lemma 2. Given a polynomial system (). Let and . If is an isolated real zero of , then is a stationary point of f.
Lemma 3. Let (), with . If is an isolated real zero of Σ, then we have:
- 1.
is an isolated real zero of .
- 2.
.
Proof. It is clear that is an isolated real zero of providing that is an isolated real zero of , since .
To prove the second part of this lemma, we rewrite
as follows.
where
is the transpose of a vector or a matrix
. Then,
Then, the Jacobian matrix of
is
We rewrite
where
is an
matrix which is exactly the Jacobian matrix of
, that is,
. Then, we have
By Lemma 1, the second part of the lemma is true. This ends the proof. □
Remark 1. In our construction of f and , the degrees of the polynomials almost be doubled compared to the original one. However, to evaluate the Jacobian matrix of , we evaluate the Jacobian matrix of the original system with numerical products. One can find it from Equation (4) in the above proof. In fact, to get , we only need to compute , which does not increase our actual computing degree. As a byproduct, thanks to the doubled degree of the polynomials, our final certified accuracy is also improved in Lemma 4.
The following is the proof of Theorem 1:
Proof. In fact, by fixing the real zero as an isolated simple zero in Lemma 3, we have is an isolated simple real zero of . Since is an isolated simple zero of , is a column full rank matrix. Therefore, it is easy to verify that is a positive definite matrix. Thus, is a local minimum of f and the first part of the theorem is true. Now, we consider the second part.
First, it is easy to verify that
is the real zero of
if and only if
is the real zero of
. Notice that
. Thus, with the same method as proving Lemma 3, we can compute easily that
which implies that
which means that
is of full rank if and only if
is of full rank. Thus,
is an isolated simple zero of
if and only if
is an isolated simple zero of
. The second part is true. We have finished the proof. □
From Theorem 1, we know that the simple real zeros of and are in one-to-one correspondence with the constraint that the value of the sum of squares of the polynomials in at the simple real zeros is identically zero. Thus, we can transform an over-determined polynomial system into a square system .
We show a simple example to illustrate the theorem below.
Example 1. The simple zero of the over-determined system corresponds to a simple zero of a square system , where with We can verify simply that is a simple zero of .
Although the simple real zeros of and have a one-to-one correspondence, it cannot be used directly to do certification of the simple zeros of since we cannot certify numerically. However, we can certify the zeros of as an alternative, which is a necessary condition for the certification.
We discuss it in next section.
4. Certifying Simple Zeros of Over-determined Systems
In this section, we consider certifying the over-determined system with the interval methods. We prove the same local minimum result as [
31].
The classical interval verification methods are based on the following theorem:
Theorem 2 ([
5,
6,
8,
30]).
Let be a polynomial system, , real interval vector with and real matrix be given. Let an interval matrix be given whose i-th row satisfiesDenote by I the identity matrix and assumewhere denotes the interior of X. Then, there is a unique with . Moreover, every matrix is nonsingular. In particular, the Jacobian is nonsingular. About interval matrices, there is an important property in the following theorem.
Theorem 3 ([
34]).
A symmetric interval matrix is positive definite if and only if it is regular and contains at least one positive definite matrix. Given an over-determined polynomial system
with an isolated simple real zero, we can compute a related square system
Based on Lemma 3, a simple zero of is a simple zero of . Thus, we can compute the approximate simple zero of by computing the approximate simple zero of . Using Newton’s method, we can refine these approximate simple zeros with quadratic convergence to a relative higher accuracy. Then, we can certify them with the interval method mentioned before and get a verified inclusion , which possesses a unique certified simple zero of the system by Theorem 2, denoting as .
However, even though we get a certified zero of the system , considering Lemma 2, we cannot say is a zero of the input system . Because the certified zero is just a stationary point of f. Considering Theorem 1 and the difference between and , we have the following theorem.
Theorem 4. Let Σ, , , f, and the interval be given as above. Then, we have:
- 1.
is a local minimum of f.
- 2.
There exists a verified inclusion that possesses a unique simple zero of the system . Especially, if , the verified inclusion possesses a unique simple zero of the input system Σ.
Proof. First, it is easy to see that computing the value of the matrix at the interval will give a symmetric interval matrix, denoting as . By Theorem 2, we know that, for every matrix , M is nonsingular. Therefore, the interval matrix is regular. Especially, the matrix , which is the Hessian matrix of f, is full rank and, therefore, is positive definite. Thus, is a local minimum of f. By Theorem 1, we know that is positive definite. Thus, for every point , is a positive definite matrix. Considering Theorem 2, it is trivial that, for the verified inclusion , there exists a unique simple zero of the system . If , by Theorem 1, the verified inclusion of the system is a verified inclusion of the original system . □
Remark 2. 1. In the above proof, we know that, for every point , is a positive definite matrix.
2. By Theorem 2, we know that there is a unique with . However, we could not know what the exact is. According to the usual practice, in actual computation, we take the midpoint of the inclusion as and verify whether . Considering the uniqueness of in , therefore, if , we are sure that the verified inclusion possesses a unique simple zero of the input system Σ. If , we can only claim that there is a local minimum of f in the inclusion and is a verified inclusion for the system .
Considering the expression of and f and for the midpoint of , we have a trivial result below.
Lemma 4. Denote . Under the conditions of Theorem 4, we have .
Based on the above idea, we give an algorithm below. In the verification steps, we apply algorithm
in INTLAB [
30], which is based on Theorem 2, to compute a verified inclusion
for the related square system
. For simplicity, denote the interval
and the midpoint of
as
.
The correctness and the termination of the algorithm is obvious by the above analysis.
We give two examples to illustrate our algorithm.
Example 2. Continue Example 1. Given an approximate zero , using Newton’s method, we get a higher accuracy approximate zero Compute and . After applying algorithm on , we have a verified inclusion: Based on Theorem 2, we know that there exists a unique , such that .
Let . By Theorem 1, we can certify the simple zero of Σ by certifying the simple zero of theoretically. Considering the difference between and , we check first whether the value of f at some point in the interval is zero. According to the usual practice, we consider the midpoint of , which equals and, further, is zero. Therefore, we are sure that there exists a unique , s.t. and, then, there exists a unique simple zero of the input system Σ, which means we certified the input system Σ.
Example 3. Let be an over-determined system. Consider an approximate zero Using Newton’s method, we get a higher accuracy zero After applying algorithm on , we have a verified inclusion: ).
Similarly, based on Theorem 2, we know that there exists a unique , such that .
Proceeding as in the above example, we consider the midpoint of and compute . Thus, by Theorem 4, we get a verified inclusion , which contains a unique simple zero of the system . It means that may contain a zero of Σ. Even if does not contain a zero of Σ, it contains a local minimum of f, which has a minimum value no larger than .
5. Two Applications
As an application, we consider certifying isolated singular zeros of over-determined systems heuristically. Generally, dealing with the multiple zeros of polynomial systems directly is difficult. The classical method to deal with the isolated singular zeros of polynomial systems is the deflation technique, which constructs a new system owning the same singular zero as an isolated simple one. Although the deflation method can be used to refine or verify the isolated zero of the original system, it is a pity that the multiplicity information of the isolated zero is missed. In this section, as an application of the method of converting an over-determined system into a square system in previous section, we give a heuristic method for certifying isolated singular zeros of polynomial systems and their multiplicity structures.
5.1. Certifying Isolated Singular Zeros of Polynomial Systems
Recently, Cheng et al. [
35] proposed a new deflation method to reduce the multiplicity of an isolated singular zero of a polynomial system to get a final system, which owns the isolated singular zero of the input system as a simple one. Different from the previous deflation methods, they considered the deflation of isolated singular zeros of polynomial systems from the perspective of linear combination.
In this section, we first give a brief introduction of their deflation method and, then, show how our method is applied to certify the isolated singular zeros of the input system in a heuristic way.
Definition 4. Let , and a tolerance , s.t. . We say f is-singularat if Otherwise, we say f is-regularat .
Let
be a polynomial system.
is an approximate isolated zero of
. Consider a tolerance
. First, we can compute the polynomials of all
, which is
-regular at the approximate zero
. That is, we compute a polynomial set
Then, put
and
together and compute a subsystem
, whose Jacobian matrix at
has a maximal rank
s. If
, we get the final system
. Otherwise, we choose a new polynomial
and compute
where
are new introduced variables. Next, we check if
If Equation (
6) holds, we get the final system
. Otherwise, let
and repeat again until Equation (
6) holds.
Now, we give an example to illustrate the above idea.
Example 4. Consider a polynomial system . Consider an approximate singular zeroof and the tolerance . First, we have the Taylor expansion of at : Consider the tolerance . Sincewe get a polynomialwhich is θ-regular at . Similarly, by the Taylor expansion of at , we have that are all θ-regular at . We can choosefrom . To , let By solving a Least Square problem:we get an approximate value: Then, computeand we get a polynomial setwhich satisfies Thus, we get the final system .
In the above example, given a polynomial system with an isolated singular zero , by computing the derivatives of the input polynomials directly or the linear combinations of the related polynomials, we compute a new system , which has a simple zero. However, generally, the final system does not contain all . Thus, to ensure that the simple zero or parts of the simple zero of the square system really correspond to the isolated singular zero of the original system, we put and together and consider certifying the over-determined system in the following.
Example 5. Continue with Example 4. we put and together and get the over-determined system . According to our method in Section 4, let After applying algorithm on at , we have a verified inclusion: By Theorem 2, we affirm that there is a unique isolated simple zero , s.t. .
Next, as in Examples 2 and 3, we consider the midpoint of and compute . Thus, by Theorem 4, we get a verified inclusion , which contains a unique simple zero of the system . It means that may contain a zero of Σ. Even if does not contain a zero of Σ, it contains a local minimum of f, which has a minimum value no larger than .
In the above example, we get the verified inclusion of the system . Noticing that , according to Theorem 4, we are not sure if the verified inclusion contains a unique simple zero of the system . While, considering the value of is very small, under certain numerical tolerance condition(for example ), we can deem that the verified inclusion contains a simple zero of the system . That is, we certified the over-determined system and further certified the original system .
5.2. Certifying the Multiplicity Structures of Isolated Singular Zeros of Polynomial Systems
In recent years, Mourrain et al. [
21,
36] proposed a new deflation method, which can be used to refine the accuracy of an isolated singular zero and the parameters introduced simultaneously and, moreover, the parameters can describe the multiplicity structure at the zero. They also proved that the number of equations and variables in this deflation method depend polynomially on the number of variables and equations of the input system and the multiplicity of the singular zero. However, although they also showed that the isolated simple zeros of the extended polynomial system correspond to zeros of the input system, the extended system is usually an over-determined system. Therefore, the problem of knowing the multiplicity structure of the isolated singular zero exactly becomes the problem of solving or certifying the isolated simple zero of the over-determined system.
In this section, we first give a brief introduction of their deflation method and, then, show how our method is applied to certify the multiplicity structure of the isolated singular zero of the input system heuristically.
Let . Let be an isolated multiple zero of . Let , be the maximal ideal at and Q be the primary component of I at so that .
Consider the ring of power series
and we use the notation for
:
The deflation method based on the orthogonal primal-dual pairs of bases for the space and its dual , which is illustrated in the following lemma.
Lemma 5. Let be as in the above and δ be the multiplicity of at . Then, there exists a primal-dual basis pair of the local ring with the following properties:
- 1.
(a) The primal basis of the local ring has the form We can assume that and that the monomials in B are connected to 1. Define the set of exponents in B - 2.
The unique dual basis orthogonal to B has the form:
The above lemma says that, once given a primal basis B of the local ring , there exists a unique dual basis , which can be used to determine the multiplicity structure of in and further the multiplicity of , orthogonal to B. Based on the known primal basis B, Mourrain et.al constructed the following parametric multiplication matrices, which can be used to determine the coefficients of the dual basis .
Definition 5. Let B as defined in Lemma 5 and denote the exponents in B by as above. Letwith and we denote . We define an array μ of length consisting of 0s, 1s and the variables as follows: for all and the corresponding entry is The parametric multiplication matrices corresponding to E are defined for by Definition 6. (Parametric normal form). Let be a field. We definewhere is the coefficient vector of 1 in the basis B. Based on the above lemma and definitions, the multiplicity structure are characterized by polynomial equations in the following theorem.
Theorem 5 ([
36]).
Let be any field, , and let be an isolated zero of . Let Q be the primary ideal at and assume that B is a basis for satisfying the conditions of Lemma 5. Let be as in Lemma 5 and for be the parametric multiplication matrices corresponding to E as in Definition 5 and be the parametric form as in Definition 6. Then, is an isolated zero with multiplicity one of the polynomial system in : The second part of Equation (
7) gives a pairwise commutation relationship of the parametric multiplication matrices. Moreover, Theorem 5 makes sure that Equation (
7) has an isolated zero
of multiplicity one. Thus, it can be used to deflate the isolated zero
of the input system
and simultaneously determine the multiplicity structure of
.
Now, we show an example to illustrate how their method works.
Example 6. Let be a polynomial system with a three-fold isolated zero . Given the primal basis , which satisfies the properties of Lemma 5, we can compute the parametric multiplication matrices: Thus, Equation (7) generates the following polynomials: - 1.
gives the polynomials .
- 2.
gives the polynomials .
- 3.
gives the polynomial .
Furthermore, Theorem 5 promises that is an isolated zero with multiplicity one of the system .
On the one hand, from the above example, we can see that given a polynomial system
with an isolated zero
, by Theorem 5, we will get an extended system
, which owns an isolated zero
with multiplicity one. Moreover, by Lemma 5, we have the dual basis
which corresponds to the primal basis
.
On the other hand, it is not hard to see that Equation (
7) defined in Theorem 5 usually gives an over-determined extended system
. Once given an approximate zero
, similar to in Corollary 4.12 in [
29], we can use random linear combinations of the polynomials in
to produce a square system, which has a simple zero at
with high probability. Furthermore, Newton’s method can be used on this square system to refine
to a higher accuracy. However, this operation can only return an approximate multiplicity structure of the input system
with a higher accuracy. Next, we consider employing our certification method to certify the multiplicity structure of
.
Example 7. Continue to consider Example 6. Let . Given an approximate zero By Algorithm 1, with Newton’s method, we get a higher accuracy zero After applying algorithm on at , we have a verified inclusion: Based on Theorem 2, we know that there exists a unique , s.t. .
Similarly, as in Examples 2 and 3, we consider the midpoint of and compute . Thus, by Theorem 4, we are sure that there exists a unique simple zero of the input system Σ in the interval , which means we certified the input system Σ.
According to the analysis in the above example, we know that, after applying our Algorithm 1 on the extended system , we get a verified inclusion , which possesses a unique simple zero of . Noticing that the values of the variables in determine the coefficients of the dual basis , thus, certifying the extended system means certifying the multiplicity structure of the input system at . Thus, by Theorem 4, as long as , we are sure that we certified not only the isolated singular zero of the input system , but also its multiplicity structure.
Algorithm 1: verifying a simple zero of a polynomial system. |
Input: an over-determined polynomial system and an approximate simple zero . Output: a verified inclusion and a small non-negative number.
- 1:
Compute f and ; - 2:
Compute ; - 3:
Compute and ; - 4:
if, then - 5:
return ; - 6:
else - 7:
return . - 8:
end if
|
6. Conclusions
In this paper, we introduce the following two main contributions. First, we consider certifying the simple zeros of over-determined systems. By transforming the given over-determined system into a square one, we prove a necessary and sufficient condition to certify the simple real zeros of the over-determined system by certifying the simple real zeros of the square system . However, noting that deciding numerically if a point is a zero of a polynomial is difficult, we refine and certify the simple real zeros of by refining and certifying a new square system with the interval methods and get a verified inclusion , which contains a unique simple real zero of . In fact, is a local minimum of f, which is also a necessary condition for the certification. By the necessary and sufficient condition in Theorem 1, we know that, as long as , we can say that is a simple real zero of and we certified the input system .
Second, based on our work [
35] and the work of Mourrain et al. [
21,
36], as an application of our method, we give a heuristic method for certifying not only the isolated singular zeros of polynomial systems, but also the multiplicity structures of the isolated singular zeros of polynomial systems.
In the future, for the certified zero , trying to give a sufficient condition for the certification is the direction of our effort.