The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point

Pearson, Kelly; Zhang, Tan

doi:10.3390/math12050705

Open AccessArticle

The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point

by

Kelly Pearson

and

Tan Zhang

^*

Department of Mathematics and Statistics, Murray State University, Murray, KY 42071, USA

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(5), 705; https://doi.org/10.3390/math12050705

Submission received: 13 December 2023 / Revised: 12 February 2024 / Accepted: 26 February 2024 / Published: 28 February 2024

(This article belongs to the Special Issue Spectral Theory of Tensors, Tensor (Rank) Decompositions, and Their Applications)

Download Versions Notes

Abstract

We begin with a degree m real homogeneous polynomial in n indeterminants and bound the distance from a given n-dimensional real vector to the real vanishing of the homogeneous polynomial. We then apply these bounds to the real homogeneous polynomial associated with a nonzero m-order n-dimensional weakly symmetric tensor which has zero as an eigenvalue. We provide “nested spheres” conditions to bound the distance from a given n-dimensional real vector to the nearest zero eigenvector.

Keywords:

tensor eigenvalues; higher order tensor

MSC:

15A18; 15A69

1. Introduction

Let

R

be the real field, and we consider an m-order n-dimensional tensor

A

consisting of

n^{m}

entries in

R

:

A = (a_{i_{1} \dots i_{m}}), a_{i_{1} \dots i_{m}} \in R, 1 \leq i_{1}, \dots, i_{m} \leq n .

We denote the space of all m-order n dimensional tensor real tensors by

R^{[m, n]}

.

To an n-vector

x = (x_{1}, \dots, x_{n})

, real or complex, we define the n-vector:

A x^{m - 1} : = {(\sum_{i_{2}, \dots, i_{m} = 1}^{n} a_{i i_{2} \dots i_{m}} x_{i_{2}} \dots x_{i_{m}})}_{1 \leq i \leq n} .

We denote the n-vector

x^{[m - 1]} : = (x_{1}^{m - 1}, \dots, x_{n}^{m - 1})

.

The following were first introduced and studied by Qi and Lim [1,2,3,4].

Definition 1.

Let

A \in R^{[m, n]}

. A pair

(λ, x) \in C \times (C^{n} ∖ {0})

is called an eigenvalue–eigenvector (or simply eigenpair) of

A

if they satisfy the equation

A x^{m - 1} = λ x^{[m - 1]} .

(1)

We call

(λ, x)

an H-eigenpair if they are both real.

Definition 2.

Let

A \in R^{[m, n]}

. A pair

(λ, x) \in C \times (C^{n} ∖ {0})

is called an E-eigenvalue and E-eigenvector (or simply E-eigenpair) of

A

if they satisfy the equation

\{\begin{matrix} A x^{m - 1} = λ x, \\ x^{⊤} x = 1 \end{matrix}

(2)

We call

(λ, x)

a Z-eigenpair if they are both real.

The notion of weakly symmetric tensors was first introduced in [5].

Definition 3.

A \in R^{[m, n]}

is called weakly symmetric if the associated homogeneous polynomial

f_{A} (x) : = \sum_{i_{1}, i_{2}, \dots, i_{m} = 1}^{n} a_{i_{1} i_{2} \dots i_{m}} x_{i_{1}} x_{i_{2}} \dots x_{i_{m}}

(3)

satisfies

\nabla f_{A} (x) = m A x^{m - 1}

. In the tensor notation, according to [2], the homogeneous polynomial

f_{A} (x)

is also denoted by

A x^{m}

.

Although this definition is not as intuitive as symmetric tensors, it nevertheless provides the same desired variational (extremal) property as symmetric tensors. It should also be noted that, for

m = 2

, symmetric matrices and weakly symmetric matrices coincide. However, it is shown in [5] that a symmetric tensor is necessarily weakly symmetric for

m > 2

, but the converse is not true in general. Furthermore, if

A \in R^{[m, n]}

is weakly symmetric, by homogeneity, it satisfies the familiar Euler’s identity:

A x^{m} = f_{A} (x) = \frac{1}{m} 〈 \nabla f_{A} (x), x 〉 = 〈 A x^{m - 1}, x 〉,

(4)

where

〈 \cdot, \cdot 〉

denotes the standard inner product on

R^{n}

.

Both H-eigenvalues and Z-eigenvalues of a given tensor have found numerous applications in numerical multilinear algebra, image processing, higher order Markov chains, and spectral hypergraph theory. In particular, it is a well-known fact (e.g., [2]) that the extremal Z-eigenvalues correspond to the constrained extremal values of

f_{A} (x)

on the unit sphere

S^{n - 1}

. However, most theoretical as well as numerical developments, refs. [6,7,8,9,10,11] have been dedicated to finding the extremal eigenvalues and eigenvectors, and very little attention has been given to the zero eigenvalue and its eigenvectors. However, it is important not to overlook the importance of the zero eigenvectors, since positive semi-definite (PSD) tensors must take on zero eigenpairs. From a practical standpoint, for large values of m or n, finding real solutions to a high-degree multivariate polynomial system may not be feasible. In particular, as the degree m increases, even solving a single multivariate polynomial equation becomes both time-consuming and costly. With this in mind, we endeavor to provide reasonable upper and lower bounds on the distance from a given initial point

e \in R^{n} ∖ {0}

with

f_{A} (e) \neq 0

to the nearest zero eigenvector of

A

without using high-power computer software.

Throughout the paper, we shall always assume our tensor is nonzero and weakly symmetric. Our paper is organized as follows. In Section 2, we begin by considering a more general problem concerning the real vanishing

V_{R} (f) = f^{- 1} (0)

of a degree m real homogeneous polynomial f in n indeterminants. We provide the lower bound of the distance from a given point e outside

V_{R} (f)

to

V_{R} (f)

. This lower bound is completely determined by the combinatorial nature of the coefficients of f itself. In Section 3, we establish an upper bound, based on the analytic and algebraic nature of the f, on the distance from a given initial point e outside

V_{R} (f)

to

V_{R} (f)

. In Section 4, we establish the connection between the real zeros of the associated homogeneous polynomial

f_{A}

and the zero eigenvectors of a nonzero m-order n-dimensional weakly symmetric tensor

A

. We first examine the basic topological structure of

V_{R} (f_{A})

as well as the critical point set

Z (f_{A})

. We then provide both upper and lower bounds on the distance from a given initial point e with

f_{A} (e) \neq 0

to the nearest zero Z-eigenvector. In Section 5, we give a variety of examples to demonstrate how the upper and lower bounds work.

2. Lower Bound for the Distance to the Real Vanishing

For simplicity, we shall only work with real homogeneous polynomials. We first establish some notational convention, which will be used throughout the rest of this paper. We denote the standard Euclidean norm on

R^{n}

by

| | \cdot {| |}_{2}

, the standard unit ball in

R^{n}

by

D^{n}

, and the standard unit sphere by

S^{n - 1}

, i.e.,

D^{n} = {x \in R^{n} {: | | x | |}_{2} \leq 1} and S^{n - 1} : = {x \in R^{n} {: | | x | |}_{2} = 1} .

Let

d \geq 1

be a positive integer, we denote by

R {[x_{1}, \dots, x_{n}]}_{d}

the set of all real homogenous polynomials of degree d in the indeterminants

x = (x_{1}, \dots, x_{n})

. Let

f \in R {[x_{1}, \dots, x_{n}]}_{d}

. Since

f : R^{n} \to R

is continuous, we denote the uniform norm of f on

S^{n - 1}

by

{| | f | |}_{\infty} : = max_{x \in S^{n - 1}} | f (x) |

. Furthermore, we denote

V_{R} (f) : = f^{- 1} (0) = {x \in R^{n} : f (x) = 0}

to be the real vanishing of f, which is always a closed subset of

R^{n}

. The goal of this section is to bound the distance from a point outside

V_{R} (f)

to

V_{R} (f)

from below.

Lemma 1.

Let

f (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

with

d \geq 1

. Then, there exists a constant

N (f) > 0

such that

| f (x) | \leq (\binom{d + n - 1}{n - 1}) \cdot N (f), \forall x \in D^{n} .

Namely,

{| | f | |}_{\infty} \leq (\binom{d + n - 1}{n - 1}) \cdot N (f)

.

Proof.

Let

ν = (ν_{1}, \dots, ν_{n})

with

1 \leq ν_{i} \leq d

be a multi-index such that

| ν | = ν_{1} + \dots + ν_{n} = d

. We write

f (x) = \sum_{ν = (ν_{1}, \dots, ν_{n}), | ν | = d} A_{ν} x_{1}^{ν_{1}} \dots x_{n}^{ν_{n}},

in terms of different monomials. Let

N (f) : = max_{ν, | ν | = d} | A_{ν} |

be the largest monomial coefficient in absolute value. Since there are at most

(\binom{d + n - 1}{n - 1})

nonzero different monomials in

f (x)

, our assertion follows. □

Lemma 2.

Let

f (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

with

d \geq 1

. Then, there exists a constant

C_{d, n} (f) > 0

such that for all x,

y \in D^{n}

,

| f (x) - f (y) | \leq C_{d, n} (f) \cdot {| | x - y | |}_{2} .

Proof.

Let

x, y \in D^{n}

. Since

D^{n}

is convex, by the mean value theorem, there exists a point c along the line segment

t x + (1 - t) y

for

0 \leq t \leq 1

, joining x and y such that

f (x) - f (y) = 〈 \nabla f (c), x - y 〉 .

By the Cauchy–Schwarz inequality, we have:

| f (x) - f (y) | \leq {| | \nabla f (c) | |}_{2} \cdot {| | x - y | |}_{2} .

We now compute:

\begin{matrix} | \frac{\partial}{\partial x_{i}} f (x) | & = & | \sum_{ν = (ν_{1}, \dots, ν_{n}), | ν | = d} A_{ν} (\frac{\partial}{\partial x_{i}} x_{1}^{ν_{1}} \dots x_{n}^{ν_{n}}) |, \\ \leq & \sum_{ν = (ν_{1}, \dots, ν_{n}), | ν | = d} d \cdot N (f) \\ = & d \cdot (\binom{d + n - 1}{n - 1}) \cdot N (f) . \end{matrix}

We set

C_{d, n} (f) : = d \cdot \sqrt{n} \cdot (\binom{d + n - 1}{n - 1}) \cdot N (f)

, it follows that for all

x \in D^{n}

,

{| | \nabla (f) (x) | |}_{2}

\leq C_{d, n} (f)

as required. Our assertion now follows. □

Let

f (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

. Let

e \in S^{n - 1}

be such that

f (e) \neq 0

. Suppose

V_{R} (f) \neq {0}

, then for any

y \in S^{n - 1} \cap V_{R} (f)

, Lemma 2 yields that

{| | e - y | |}_{2} \geq \frac{| f (e) |}{C_{d, n} (f)} = \frac{| f (e) |}{d \cdot \sqrt{n} \cdot (\binom{d + n - 1}{n - 1}) \cdot N (f)} .

Since

S^{n - 1} \cap V_{R} (f)

is closed and bounded, it is compact; hence, we can define

d (e, V_{R} (f)) : = min_{y \in S^{n - 1} \cap V_{R} (f)} {| | e - y | |}_{2}

as the Euclidean distance from e to

V_{R} (f)

and

d_{2} (e) : = \frac{| f (e) |}{d \cdot \sqrt{n} \cdot (\binom{d + n - 1}{n - 1}) \cdot N (f)}

, then, we have:

Theorem 1.

Let

f (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

. Let

e \in S^{n - 1}

be such that

f (e) \neq 0

. Assume that

V_{R} (f) \neq {0}

, then

d (e, V_{R} (f)) \geq d_{2} (e)

.

3. Upper Bound for the Distance to the Real Vanishing

In this section, we endeavor to establish a nontrivial upper bound for the distance to

V_{R} (f)

from a given point. Fix

e \in S^{n - 1} ∖ V_{R} (f)

and let

x \in V_{R} (f) \cap S^{n - 1}

, since

x \neq \pm e \in S^{n - 1}

, there exists a unique geodesic (a great circle

S^{1}

) on

S^{n - 1}

joining e and x on

S^{n - 1}

. From differential geometry, we know a geodesic is distance minimizing from a given point e until reaching its conjugate point, which in this case, is the antipodal point

- e \neq x

. This means that the arc length of the great circle joining x to e is the spherical distance between them. Consequently, the two distinct lines

[e]

and

[x]

can be at most

π / 2

-spherical distance apart, so projectively speaking, the Euclidean distance between

[e]

and

[x]

is at most

\sqrt{2}

, which is a trivial upper bound.

Surprisingly, a nontrivial upper bound is a much more challenging task. We will need additional tools from a special class of polynomials, known as hyperbolic polynomials. The existing literature on both real stable and hyperbolic polynomials is vast. For a more in-depth reading on this topic, we refer the interested reader to [12,13]. However, to be more self-contained, we introduce the following definitions.

Definition 4.

A nonzero polynomial

p (x) \in R [x_{1}, \dots, x_{n}]

is called real stable if it has no zeros in

H^{n} : = {z \in C : Im (z) > 0}^{n}

, i.e.,

\forall i, Im (x_{i}) > 0 ⟹ p (x_{1}, \dots, x_{n}) \neq 0 .

It is a well-known fact that a nonzero polynomial

p (x) \in R [x_{1}, \dots, x_{n}]

is real stable if and only if, for all

x \in R^{n}

and

e \in R_{> 0}^{n}

, the polynomial

p (x + t e) \in R [t]

is real rooted.

Definition 5.

A degree d homogeneous polynomial

p (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

is called hyperbolic in direction

e \in R^{n}

if

p (e) \neq 0

and the univariate polynomial

t \mapsto p (x + t e) \in R [t]

for every

x \in R^{n}

is real rooted, i.e., it has only real zeros.

The study of hyperbolic polynomials dated back to G

\overset{̊}{o}

rding and Hurwitz’s time and has since been playing a vital role in hyperbolic programming [12,13]. To help visualize the notion of hyperbolicity, by considering the restriction

p (x + t e)

of

p (x)

on the line originating from x parallel to the fixed direction e, we insist that

p (x + t e) \in R [t]

is real rooted.

Some of the most noteworthy examples of hyperbolic polynomials are as follows:

Example 1.

The Lorentzian quadratics

p (x_{1}, x_{2}, \dots, x_{n}) : = x_{1}^{2} - x_{2}^{2} - \dots - x_{n}^{2}

is hyperbolic in direction

e = (1, 0, \dots, 0)

.

Example 2.

Let

1 \leq k \leq n

. The degree k symmetric polynomial

σ_{k} (x_{1}, \dots, x_{n}) : = x_{i_{1}} \dots x_{i_{k}}

for

1 \leq i_{1} < \dots < i_{k} \leq n

is hyperbolic in direction

e = (1, \dots, 1)

.

Example 3.

Let

ℓ_{i} (x) \in R {[x_{1}, \dots, x_{n}]}_{1}

for

1 \leq i \leq d

be a linear form, then their product

p (x) = ℓ_{1} (x) \dots ℓ_{d} (x)

is hyperbolic in direction e as long as e is not a common zero to all

ℓ_{i} (x)

.

Example 4.

Let

{Sym}_{n} (R)

denote the real vector space of all

n \times n

real symmetric matrices. The determinant function

det : {Sym}_{n} (R) \to R

is hyperbolic in direction

e = I_{n}

, the

n \times n

identity matrix.

Example 5.

Let

G = (V, E)

be a finite graph, then the matching polynomial of G is real stable, and hence hyperbolic in any direction

e \neq 0

.

A very important property of hyperbolic polynomials states that, if

p, q \in R [x_{1}, \dots, x_{n}]

are both hyperbolic in direction e, then so is their product

p \cdot q

. Moreover, an algorithm in polynomial time, based on Newton’s identities, can be used to check the real rootedness of a given polynomial due to the following result:

Theorem (Hermite-Sylvester). A polynomial

p (t) = \prod_{k = 1}^{n} (t - λ_{k})

is real rooted if and only if the

n \times n

Hermitian matrix H with

H_{i j} = \sum_{k = 1}^{n} λ_{k}^{i + j - 2}

is a positive semi-definite (PSD) matrix.

We now return to the upper bound estimate. In [14], a similar upper bound was found by M. Shub for complex homogeneous polynomials. However, since we are only concerned with the real zeros of a homogeneous polynomial, the original argument must be accordingly modified to suit the needs of real solutions.

Theorem 2.

Let

f (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

. Assume

V_{R} (f) \neq {0}

. Let

e \in S^{n - 1}

be such that

f (e) \neq 0

. If

f (x)

is hyperbolic in the direction e, then the Euclidean distance from the nearest zero of f on the unit sphere to e is at most

d^{*} (e)

, where

d^{*} (e) : = \sqrt{2 - 2 \sqrt{1 - {(\frac{| f (e) |}{{| | f | |}_{\infty}})}^{2 / d}}} .

Proof.

Without loss of generality, by rotation if necessary, we may adjust

e = e_{1} = (1, 0, \dots, 0) \in S^{n - 1}

and

| f (e) | > 0

. For any

x \in R^{n} = R \times R^{n - 1}

, we can write

x = (x_{1}, y)

for

x_{1} \in R

and

y \in R^{n - 1}

. Then, the homogeneous polynomial

f (x)

takes on the form

f (x) = f (x_{1}, y) = H_{0} x_{1}^{d} + \sum_{i = 1}^{d} H_{i} (y) x_{1}^{d - i},

where

H_{i} (y) \in R {[x_{2}, \dots, x_{n}]}_{i}

is of homogeneous degree i for

0 \leq i \leq d

in the remaining indeterminants. Clearly,

f (e) = f (1, 0) = H_{0} \neq 0

, so we may instead consider the monic polynomial

F (x_{1}, y) = \frac{f (x)}{f (e)} = x_{1}^{d} + \sum_{i = 1}^{d} {\hat{H}}_{i} (y) x_{1}^{d - i},

where

{\hat{H}}_{i} (y) \in R {[x_{2}, \dots, x_{n}]}_{i}

for

1 \leq i \leq d

.

Let

z_{0} \in V_{R} (f) \cap S^{n - 1}

is the nearest zero of f from e. Let s be the arc-length of the geodesic (great circle) joining e and

z_{0}

; then,

F (x_{1}, y)

has no real zeros inside the double cone

K_{e}

, whose central symmetry axis is in the direction e of the radius

tan s

in the hyperplane defined by

x_{1} = 1

, as illustrated in the figure below.

Fix

0 \neq y \in D^{n - 1}

and let

x = (x_{1}, y) \in D^{1} \times D^{n - 1} ≅ D^{n}

. We now study the univariate polynomial

g (x_{1}) : = F (x_{1}, y), \forall x_{1} \in D^{1} = [- 1, 1] .

By assumption, since

f (x)

is hyperbolic in direction e,

g (x_{1}) = F (x_{1}, y)

is real rooted with all real zeros inside the double cone of radius

r = (cot s) \cdot {| | (0, y) | |}_{2}

, whose central symmetry axis is in the direction

(0, y)

. According to Vietá’s formula, the coefficient

{\hat{H}}_{i} (y)

is precisely the ith symmetric function of the roots, since all roots are real and inside the cone of radius r, we have

{\hat{H}}_{i} (y) \leq (\binom{d}{i}) r^{i} .

It follows that

\begin{matrix} | F (x) | = \frac{| f (x) |}{| f (e) |} & \leq & {| x_{1} |}^{d} + \sum_{i = 1}^{d} (\binom{d}{i}) r^{i} {| x_{1} |}^{d - i} \\ = & (| x_{1} {| + r)}^{d} \\ = & {(| x_{1} | + (cot s) \cdot {| | (0, y) | |}_{2})}^{d} . \end{matrix}

If

x = (x_{1}, y) \in S^{n - 1}

, then

| F (x) | \leq {(| x_{1} | + (cot s) \sqrt{1 - | x_{1} |^{2}})}^{d} .

It is a straightforward calculus exercise to see that

max_{| x_{1} | \leq 1} (| x_{1} | + (cot s) \sqrt{1 - | x_{1} |^{2}}) = \sqrt{1 + {cot}^{2} s} = csc s, 0 < s \leq \frac{π}{2} .

This implies

{| | f | |}_{\infty} \leq | f (e) | {(csc s)}^{d} or sin s \leq {(\frac{| f (e) |}{{| | f | |}_{\infty}})}^{1 / d} .

We now return to the Euclidean distance

| | e - z_{0} {| |}_{2}

between e and

z_{0}

. Note that

| | e - z_{0} {| |}_{2}

is precisely the length of the cord connecting e and

z_{0}

with the prescribed arc length s, and it is therefore easy to see

| | e - z_{0} {| |}_{2}^{2} = 4 {sin}^{2} (\frac{s}{2}) = 2 (1 - cos s),

which implies

{(1 - \frac{1}{2} | | e - z_{0} {| |}_{2}^{2})}^{2} = {cos}^{2} s = 1 - {sin}^{2} s \geq 1 - {(\frac{| f (e) |}{{| | f | |}_{\infty}})}^{2 / d},

or equivalently,

| | e - z_{0} {| |}_{2} \leq d^{*} (e) = \sqrt{2 - 2 \sqrt{1 - {(\frac{| f (e) |}{{| | f | |}_{\infty}})}^{2 / d}}} .

This completes the proof. □

Although

{| | f | |}_{\infty}

is not directly computed via the coefficients of f itself, it is not difficult to obtain by available constrained optimization methods, for instance De Lathauwer et al. [15] and Kofidas and Regalia [16], or using the MaxValue and MinValue commands provided directly by Mathematica [17].

Combining the results of Theorems 1 and 2, we have the following “nested spheres” estimate:

Corollary 1.

Let

f (x) \in R {[x_{1}, \dots, x_{n}]}_{d}

. Assume

V_{R} (f) \neq {0}

. Let

e \in S^{n - 1}

be such that

f (e) \neq 0

. Then

$d_{2} (e) \leq d (e, V_{R} (f))$ .
If in addition, $f (x)$ is hyperbolic in direction e, then $d (e, V_{R} (f)) \leq d^{*} (e)$ .

4. The Zero Eigenvectors of a Nonzero Weakly Symmetric Tensor

In this section, we turn our attention to the problem of locating the nearest zero Z-eigenvector of a nonzero weakly symmetric tensor

A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

from a given point. We shall henceforth assume that

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

is weakly symmetric and has zero as a Z-eigenvalue.

Given an initial point

e \in R^{n} ∖ {0}

and assuming

A

has zero eigenvectors, we would like to afford both lower and upper bounds on the Euclidean distance from e to the nearest zero Z-eigenvector of

A

.

In order to make an easier transition from real homogeneous polynomials to real weakly symmetric tensors, we begin by analyzing the critical points of a homogeneous polynomial

f \in R {[x_{1}, \dots, x_{n}]}_{d}

. We denote by

Z (f) : = {x \in R^{n} : \nabla f (x) = 0}

the set of critical points of f. It is worth noting that both

V_{R} (f)

and

Z (f)

are closed and path-connected with

0 \subseteq Z (f) \subseteq V_{R} (f)

. To see that

V_{R} (f)

is path-connected, clearly

0 \in V_{R} (f)

and observe that for any

0 \neq x \in V_{R} (f)

, we have

f (t x) = 0

for all

t \in R

, and thus, the whole line

[x] : = span {x} \subset V_{R} (f)

. Similarly, since

0 \in Z (f) = ⋂_{i = 1}^{n} V_{R} (f_{i}), where f_{i} (x) = \frac{\partial f}{\partial x_{i}} (x),

we have that

Z (f)

is also closed and path-connected. We see that

Z (f) \subseteq V_{R} (f)

follows from the fact

f (x) = \frac{1}{d} 〈 \nabla f (x), x 〉

.

In multivariate calculus, given a differentiable function

f : R^{n} \to R

, a point

p \in R^{n}

is said to be a critical point of f if

\nabla f (p) = 0

. Furthermore, p is said to be a non-degenerate critical point of f if the Hessian matrix of f at p is nonsingular. Following the famous Morse’s Lemma, all non-degenerate critical points are isolated; that is, if p is a non-degenerate critical point of f, then there exists a neighborhood of p, which contains no other critical points of f. Furthermore, a non-degenerate critical point is a local maximum, or a local minimum, or a saddle point of the function f.

Given a nonzero weakly symmetric tensor

A \in R^{[m, n]}

, let

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

be its associated homogeneous polynomial. Since

0

is always a critical point of

f_{A}

, if

0

is non-degenerate, then

0

must be an isolated point in

Z (f_{A})

. Since

Z (f_{A})

is path-connected, we have

Z (f_{A}) = {0}

. This implies that

\nabla f_{A} (x) = 0

has only the trivial solution; hence, 0 must not be a Z (or H)-eigenvalue of

A

. This observation directs our attention to the case where

0

is a degenerate critical point of

f_{A}

.

Example 6.

The classical example of the “monkey saddle” defined by

f (x, y) = x^{3} - 3 x y^{2}

, whose only critical point is at

0 = (0, 0)

, happens to be a degenerate critical point. However, since

0

is the only critical point,

\nabla f (x, y) = 0

has only the trivial solution.

Example 6 shows a degenerate, yet isolated, critical point that still fails to be a candidate for zero eigenvectors. Suppose

0

is a non-isolated critical point of

f_{A}

(and therefore necessarily degenerates), then there exists

x_{0} \in Z (f_{A}) ∖ {0}

, i.e.,

\nabla f_{A} (x_{0}) = 0

, i.e.,

x_{0}

is a Z (or H-) eigenvector of 0. Hence, 0 must be a Z (or H-)eigenvalue of

A

.

Putting these observations together, we reach the following conclusion:

Proposition 1.

Let

A \in R^{[m, n]}

be a weakly symmetric tensor with associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. The following are equivalent:

0 is a Z (or H)-eigenvalue of $A$ .
$0$ is a non-isolated critical point of $f_{A}$ .
$dim (Z (f_{A})) > 0$ .

We note that, when

0

is a non-isolated critical point of

A

, it is still possible to have

Z (f_{A}) ⊊ V_{R} (f_{A})

as supported by the following example.

Example 7.

Consider

f (x_{1}, x_{2}) : = x_{1}^{2} x_{2}^{2} (x_{1}^{2} - x_{2}^{2}) \in R {[x_{1}, x_{2}]}_{4} .

It is easy to see that

V_{R} (f)

consists of four lines:

x_{1} = 0

,

x_{2} = 0

, and

x_{1} = \pm x_{2}

. However,

Z (f)

consists of only the coordinate axes

x_{1} = 0

and

x_{2} = 0

.

Under the framework of tensors, we have the following alternative lower bounds on the distance from a given point

e \in R^{n}

with

f_{A} (e) \neq 0

to the nearest zero Z-eigenvector of

A

.

Lemma 3.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Then, there exists a constant

M (A) > 0

such that

| f_{A} (x) | \leq M (A), \forall x \in D^{n} .

Namely,

| | f_{A} {| |}_{\infty} \leq M (A)

.

Proof.

By definition,

f_{A} (x) = \sum_{i_{1}, i_{2}, \dots, i_{m} = 1}^{n} a_{i_{1} i_{2} \dots i_{m}} x_{i_{1}} x_{i_{2}} \dots x_{i_{m}} .

We set

M (A) : = max_{1 \leq i_{1}, i_{2}, \dots, i_{m} \leq n} | a_{i_{1} i_{2} \dots i_{m}} |,

the largest entry in absolute value of

A

, then all

x \in D^{n}

, we have

| f_{A} (x) | \leq M (A) .

□

Lemma 4.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Then, there exists a constant

C (A) > 0

such that for all x,

y \in D^{n}

,

| f_{A} (x) - f_{A} {(y) | \leq C (A) \cdot | | x - y | |}_{2} .

Proof.

The proof is similar to Lemma 2. We have the following alternative form of

f_{A} (x) = \sum_{| α | = m, 1 \leq j_{1} < \dots < j_{r} \leq n} (\binom{m}{α}) a_{j_{1}^{α_{1}} \dots j_{r}^{α_{r}}} x_{j_{1}}^{α_{1}} \dots x_{j_{r}}^{α_{r}},

where

(\binom{m}{α}) = \frac{m!}{α_{1}! \dots α_{r}!}

. Let

x, y \in D^{n}

. Since

D^{n}

is convex, by the mean value theorem, there exists a point c along the line segment

t x + (1 - t) y

for

0 \leq t \leq 1

, joining x and y such that

f_{A} (x) - f_{A} (y) = 〈 \nabla f_{A} (c), x - y 〉 .

By the Cauchy–Schwarz Inequality, we have:

| f_{A} (x) - f_{A} (y) | \leq | | \nabla f_{A} {(c) | |}_{2} \cdot {| | x - y | |}_{2} .

Since

\frac{\partial}{\partial x_{i}} (x_{j_{1}}^{α_{1}} \dots x_{j_{s}}^{α_{s}} \dots x_{j_{r}}^{α_{r}}) = \{\begin{matrix} α_{s} x_{j_{1}}^{α_{1}} \dots x_{j_{s}}^{α_{s} - 1} \dots x_{j_{r}}^{α_{r}}, j_{s} = i \\ 0, j_{s} \neq i, \end{matrix}

we have:

\frac{\partial}{\partial x_{i}} f_{A} (x) = \sum_{| α | = m, 1 \leq j_{1} < \dots < j_{r} \leq n} (\binom{m}{α}) a_{j_{1}^{α_{1}} \dots j_{r}^{α_{r}}} \frac{\partial}{\partial x_{i}} (x_{j_{1}}^{α_{1}} \dots x_{j_{s}}^{α_{s}} \dots x_{j_{r}}^{α_{r}}) .

It follows that

\begin{matrix} | \frac{\partial}{\partial x_{i}} f_{A} (x) | & \leq & \sum_{| α | = m, 1 \leq j_{1} < \dots < j_{r} \leq n} (\binom{m}{α}) | a_{j_{1}^{α_{1}} \dots j_{r}^{α_{r}}} | (α_{1} + \dots + α_{r}) \\ \leq & \sum_{| α | = m, 1 \leq j_{1} < \dots < j_{r} \leq n} m \cdot M (A) \\ = & m \cdot n^{m} \cdot M (A) . \end{matrix}

We set

C (A) : = m \cdot n^{m + 1 / 2} \cdot M (A)

, and it yields:

| | \nabla f_{A} (x) {| |}_{2}^{2} = \sum_{i = 1}^{n} {| \frac{\partial}{\partial x_{i}} f_{A} (x) |}^{2} \leq n^{2 m + 1} {(m \cdot M (A))}^{2} .

Thus,

| | \nabla f_{A} (x) {| |}_{2} \leq C (A),

which completes the proof. □

Then we immediately have the following consequence.

Corollary 2.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Let

e \in S^{n - 1} ∖ V_{R} (f_{A})

. Assume

V_{R} (f_{A}) \neq {0}

. Let

y \in S^{n - 1} \cap V_{R} (f_{A})

, then

{| | e - y | |}_{2} \geq \frac{| f_{A} (e) |}{C (A)} .

In conjunction with Theorem 1, we also have:

Corollary 3.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Let

e \in S^{n - 1} ∖ V_{R} (f_{A})

. Assume that

V_{R} (f_{A}) \neq {0}

, then

d (e, V_{R} (f_{A})) \geq d_{2} (e)

.

Since

V_{R} (f_{A}) \cap S^{n - 1}

is closed and bounded, it is compact; there must be a point

y_{0} \in S^{n - 1} \cap V_{R} (f_{A})

such that

| | e - y_{0} {| |}_{2} = min_{y \in S^{n - 1} \cap V_{R} (f_{A})} {| | e - y | |}_{2}

, which is the nearest zero of

f_{A}

on the unit sphere to e. Projectively speaking, the distance from the line

[e]

to

V_{R} (f_{A}) \cap {RP}^{n - 1}

is at least

max \{d_{2} (e), \frac{| f_{A} (e) |}{C (A)}\}

.

We end this section by improving upon this lower bound. First, we introduce another constant

U (f_{A})

as follows.

Let

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

be given as above. Since each partial derivative

\frac{\partial}{\partial x_{i}} f_{A} (x)

for

1 \leq i \leq n

is a degree

(m - 1)

homogeneous polynomial of

x_{1}, \dots, x_{n}

, using Lemma 1, we can define in the same fashion constants

N (\frac{\partial f_{A}}{\partial x_{i}}) > 0

for

1 \leq i \leq n

. Now, we set

U (f_{A}) : = max_{1 \leq i \leq n} \{N (\frac{\partial f_{A}}{\partial x_{i}})\} .

Theorem 3.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Assume that

0

is a non-isolated critical point of

f_{A}

. Let

e \in S^{n - 1} ∖ V_{R} (f_{A})

, then the Euclidean distance from the nearest zero Z-eigenvector of

A

on the unit sphere to e is at least

{\hat{d}}_{2} (e)

, where

{\hat{d}}_{2} (e) : = \frac{m | f_{A} (e) |}{(m - 1) \cdot n \cdot (\binom{m + n - 2}{n - 1}) \cdot U (f_{A})} .

Proof.

For any

e, y \in S^{n - 1}

, we have:

| | \nabla f_{A} (e) - \nabla f_{A} (y) {| |}_{2}^{2} = \sum_{i = 1}^{n} | \frac{\partial f_{A}}{\partial x_{i}} (e) - \frac{\partial f_{A}}{\partial x_{i}} (y) |^{2} .

It follows from Lemma 2, for

1 \leq i \leq n

,

| \frac{\partial f_{A}}{\partial x_{i}} (e) - \frac{\partial f_{A}}{\partial x_{i}} (y) | \leq (m - 1) \cdot \sqrt{n} (\binom{m + n - 2}{n - 1}) \cdot N (\frac{\partial f_{A}}{\partial x_{i}}) {| | e - y | |}_{2} .

Since

U (f_{A}) : = max_{1 \leq i \leq n} N (\frac{\partial f_{A}}{\partial x_{i}})

, we have:

| \frac{\partial f_{A}}{\partial x_{i}} (e) - \frac{\partial f_{A}}{\partial x_{i}} (y) |^{2} \leq {(m - 1)}^{2} \cdot n \cdot {(\binom{m + n - 2}{n - 1})}^{2} \cdot {(U (f_{A}))}^{2} {| | e - y | |}_{2}^{2} .

Hence,

\begin{matrix} {| | \nabla f_{A} (e) - \nabla f_{A} (y) | |}_{2}^{2} & \leq & \sum_{i = 1}^{n} {(m - 1)}^{2} \cdot n \cdot {(\binom{m + n - 2}{n - 1})}^{2} \cdot {(U (f_{A}))}^{2} {| | e - y | |}_{2}^{2} \\ \leq & {(m - 1)}^{2} \cdot n^{2} \cdot {(\binom{m + n - 2}{n - 1})}^{2} \cdot {(U (f_{A}))}^{2} {| | e - y | |}_{2}^{2} . \end{matrix}

i.e.,

{| | \nabla f_{A} (e) - \nabla f_{A} (y) | |}_{2} \leq (m - 1) \cdot n \cdot (\binom{m + n - 2}{n - 1}) \cdot U (f_{A}) \cdot {| | e - y | |}_{2} .

Suppose

y \in Z (f_{A}) \cap S^{n - 1}

. Then,

\nabla f_{A} (y) = 0

. This implies

| | \nabla f_{A} {(e) | |}_{2} \leq (m - 1) \cdot n \cdot (\binom{m + n - 2}{n - 1}) \cdot U (f_{A}) \cdot {| | e - y | |}_{2} .

On the other hand, since

〈 \nabla f_{A} (e), e 〉 = m f_{A} (e)

, we obtain by the Cauchy–Schwarz inequality that

\begin{matrix} m | f_{A} (e) | & = & | 〈 \nabla f_{A} (e), e 〉 | \leq | | \nabla f_{A} {(e) | |}_{2} \cdot {| | e | |}_{2} \\ = & | | \nabla f_{A} {(e) | |}_{2} \leq (m - 1) \cdot n \cdot (\binom{m + n - 2}{n - 1}) \cdot U (f_{A}) \cdot {| | e - y | |}_{2} . \end{matrix}

Consequently,

{| | e - y | |}_{2} \geq \frac{m | f_{A} (e) |}{(m - 1) \cdot n \cdot (\binom{m + n - 2}{n - 1}) \cdot U (f_{A})} = {\hat{d}}_{2} (e)

as required. Lastly, since

Z (f_{A}) \cap S^{n - 1}

is compact, the Euclidean distance from the nearest zero Z-eigenvector of

A

on the unit sphere to e is attained at some

y_{0} \in Z (f_{A}) \cap S^{n - 1}

with

| | e - y_{0} {| |}_{2} \geq {\hat{d}}_{2} (e)

. □

Remark 1.

The main difference between Corollary 3 and Theorem 3 is that Corollary 3 gives a lower bound

d_{2} (e)

for the distance to the nearest real zero of

f_{A}

with the unit length from e, whereas Theorem 3 gives a lower bound

{\hat{d}}_{2} (e)

for the distance to the nearest zero Z-eigenvector of

f_{A}

with the unit length from e. It turns out, as seen by various examples in §5, the lower bound

{\hat{d}}_{2} (e)

in Theorem 3 tends to be sharper than

d_{2} (e)

given in Corollary 3.

We now rephrase Theorem 2 as follows:

Theorem 4.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Assume that

V_{R} (f_{A}) \neq {0}

. Let

e \in S^{n - 1} ∖ V_{R} (f_{A})

. If

f_{A} (x)

is hyperbolic in direction e, then the Euclidean distance from the nearest zero of

f_{A}

on the unit sphere to e is at most

d^{*} (e)

, where

d^{*} (e) : = \sqrt{2 - 2 \sqrt{1 - {(\frac{| f_{A} (e) |}{| | f_{A} {| |}_{\infty}})}^{2 / m}}} .

Similarly to Corollary 1, we now have:

Corollary 4.

Let

0 \neq A = (a_{i_{1} \dots i_{m}}) \in R^{[m, n]}

be weakly symmetric with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Assume that

0

is a non-isolated critical point of

f_{A}

. Let

e \in S^{n - 1} ∖ V_{R} (f_{A})

. Then,

${\hat{d}}_{2} (e) \leq d (e, Z (f_{A}))$ .
If, in addition, $f_{A} (x)$ is hyperbolic in direction e, then $d (e, V_{R} (f_{A})) \leq d^{*} (e)$ .

Remark 2.

In fact, a similar strategy is frequently adopted in single variable calculus. In order to find the inflection points of a smooth real-valued function f, we first find the real zeros of

f^{″} (x)

and then apply the first or second derivative test to check whether they are truly inflection points.

5. Some Examples

In this section, we will examine the upper and lower bounds obtained in previous sections via a collection of examples of distinct nature.

Example 8.

Let

A \in R^{[4, 2]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

f_{A} (x_{1}, x_{2}) = {(x_{1}^{2} - x_{2}^{2})}^{2} = x_{1}^{4} - 2 x_{1}^{2} x_{2}^{2} + x_{2}^{4} .

Clearly,

f_{A} (e_{1}) = 1

. Furthermore, since

f_{A}

is a product of hyperbolic polynomials in direction

e_{1}

, it is itself a hyperbolic polynomial in direction

e_{1}

. Additionally, since it is a perfect square, it is automatically a PSD tensor. By direct calculation, we see that:

\begin{matrix} f_{A} (e_{1}) = 1, | | f_{A} {| |}_{\infty} = 1, d_{2} (e) = \frac{1}{40 \sqrt{2}} \approx 0.0177, \\ {\hat{d}}_{2} (e_{1}) = \frac{1}{24} \approx 0.0417, and d^{*} (e_{1}) = \sqrt{2} . \end{matrix}

The zero Z-eigenvectors are

z_{1} = (\frac{1}{\sqrt{2}}, \frac{1}{\sqrt{2}}) and z_{2} = - z_{1} .

We see that

| | e_{1} - z_{1} {| |}_{2} \approx 0.7654 .

It is also clear, by symmetry, that

f_{A} (e_{2}) = 1

and

f_{A}

is hyperbolic in the direction

e_{2}

; hence, the exact same conclusion holds for the initial point

e_{2}

.

Example 9.

Let

A \in R^{[4, 2]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

\begin{matrix} f_{A} (x_{1}, x_{2}) & = & x_{1}^{4} + 4 x_{1}^{3} x_{2} + 3 x_{1}^{2} x_{2}^{2} - 4 x_{1} x_{2}^{3} - 4 x_{2}^{4} \\ = & {(x_{1} + 2 x_{2})}^{2} (x_{1}^{2} - x_{2}^{2}) . \end{matrix}

Clearly,

f_{A} (e_{1}) = 1

. Furthermore, since

f_{A}

is a product of hyperbolic polynomials in direction

e_{1}

, it is itself a hyperbolic polynomial in direction

e_{1}

, but it is not PSD.

On the other hand, it is easy to check

f_{A}

is also hyperbolic in direction

e_{2}

:

p (t) = f_{A} (t e_{2} + (x_{1}, x_{2})) = x_{1}^{4} + 4 x_{1}^{3} (t + x_{2}) + 3 x_{1}^{2} {(t + x_{2})}^{2} - 4 x_{1} {(t + x_{2})}^{3} - 4 {(t + x_{2})}^{4},

which has all real roots

t = x - y

and

t = - \frac{1}{2} (x_{1} + 2 x_{2})

. We compute to see that

\begin{matrix} | | f_{A} {| |}_{\infty} = - min_{x \in S^{1}} f_{A} (x) \approx 4.3223, d_{2} (e_{1}) = \frac{1}{80 \sqrt{2}} \approx 0.0088, \\ {\hat{d}}_{2} (e_{1}) = \frac{1}{96} \approx 0.0104, d^{*} (e_{1}) \approx 0.7478; \\ d_{2} (e_{2}) = \frac{1}{20 \sqrt{2}} \approx 0.0354, {\hat{d}}_{2} (e_{2}) = \frac{1}{24} \approx 0.0417, and d^{*} (e_{2}) \approx 1.2689 . \end{matrix}

Using Wolfram’s software Mathematica 10.2 [17], we find that the zero Z-eigenvectors are

z_{1} \approx (0.8944, - 0.4472) and z_{2} = - z_{1} .

We now compare to

| | e_{1} - z_{1} {| |}_{2} \approx 0.4595 and | | e_{2} - z_{2} {| |}_{2} \approx 1.0514,

see the figure below

This example will be referenced later in this section.

Example 10.

Let

A \in R^{[3, 3]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

\begin{matrix} f_{A} (x_{1}, x_{2}, x_{3}) & = & x_{1}^{3} + x_{1}^{2} x_{2} - x_{1} x_{2}^{2} - x_{2}^{3} + x_{1}^{2} x_{3} - x_{2}^{2} x_{3} - x_{1} x_{3}^{2} - x_{2} x_{3}^{2} - x_{3}^{3} \\ = & (x_{1}^{2} - x_{2}^{2} - x_{3}^{2}) (x_{1} + x_{2} + x_{3}) . \end{matrix}

In order to check whether

f_{A}

is hyperbolic in direction

e_{1}

, we compute:

p (t) = f_{A} (t e_{1} + (x_{1}, x_{2}, x_{3})) = [{(t + x_{1})}^{2} - x_{2}^{2} - x_{3}^{2}] (t + x_{1} + x_{2} + x_{3}) .

So

p (t)

has all real roots

t = - (x_{1} + x_{2} + x_{3})

and

t = - x_{1} \pm \sqrt{x_{2}^{2} + x_{3}^{2}}

. It is clear

f_{A} (e_{1}) = 1

. We use Mathematica [17] to find

| | f_{A} {| |}_{\infty} = - min_{x \in S^{2}} f_{A} (x) \approx 1.4804

. We also have:

d^{*} (e_{1}) \approx 1.0201, d_{2} (e_{1}) = \frac{1}{30 \sqrt{3}} \approx 0.0192, and {\hat{d}}_{2} (e_{1}) = \frac{1}{36} \approx 0.0278 .

A direct elimination shows the zero Z-eigenvectors are

z_{1} = (\frac{1}{\sqrt{2}}, \frac{1}{\sqrt{2}}, 0) and z_{2} = - z_{1},

whose distance

| | e_{1} - z_{1} {| |}_{2} \approx 0.7654

.

Contrasting to the previous examples, the current tensor has order 3, so it cannot be PSD. It is also true that

f_{A}

is not hyperbolic in any other direction.

Example 11.

Let

A = (\begin{matrix} 1 & \frac{1}{2} & 0 & 0 \\ \frac{1}{2} & 1 & 0 & 0 \\ 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 2 \end{matrix}) and B = (\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}) .

Then,

A, B \in {Sym}_{4} (R)

and

\forall x, y \in R

,

x A + y B \in {Sym}_{4} (R)

satisfying

\begin{matrix} det (x A + y B) & = & det (\begin{matrix} x & \frac{1}{2} x & 0 & 0 \\ \frac{1}{2} x & x & 0 & 0 \\ 0 & 0 & 2 x + y & 0 \\ 0 & 0 & 0 & 2 x + y \end{matrix}) \\ = & \frac{3}{4} x^{2} {(2 x + y)}^{2} \\ = & 3 x^{4} + 3 x^{3} y + \frac{3}{4} x^{2} y^{2} \in R {[x, y]}_{4} . \end{matrix}

Let

A \in R^{[4, 2]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

f_{A} (x, y) = det (x A + y B)

. It is straightforward to see

f_{A} (e_{1}) = 3 and p (t) : = f_{A} (t e_{1} + (x, y)) = \frac{3}{4} t^{2} x^{2} {(2 t x + y)}^{2},

which is real rooted, hence

f_{A}

is hyperbolic in the direction

e_{1}

. In addition, by Mathematica [17], we find that

| | f_{A} {| |}_{\infty} \approx 3.3646

. We also have:

d^{*} (e_{1}) \approx 1.2361, d_{2} (e_{1}) = \frac{1}{20 \sqrt{2}} \approx 0.0353, {\hat{d}}_{2} (e_{1}) = \frac{1}{24} \approx 0.0417,

and the zero Z-eigenvectors are

z_{1} = (0, 1), z_{2} = - z_{1}, z_{3} \approx (0.4472, - 0.8944), and z_{4} = - z_{3} .

It is evident that the nearest zero Z-eigenvector to

e_{1}

is

z_{3}

with

| | e_{1} - z_{3} {| |}_{2} \approx 1.0514

.

Example 12.

Let

A \in R^{[4, 3]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

\begin{matrix} f_{A} (x_{1}, x_{2}, x_{3}) & = & x_{1}^{2} x_{2} x_{3} - 2 x_{1} x_{2}^{2} x_{3} + 3 x_{1} x_{2} x_{3}^{2} \\ = & x_{1} x_{2} x_{3} (x_{1} - 2 x_{2} + 3 x_{3}) . \end{matrix}

This is not a PSD tensor. However, if we choose

e = (\frac{1}{\sqrt{3}}, \frac{1}{\sqrt{3}}, \frac{1}{\sqrt{3}})

, then it is easy to see

f_{A} (e) = \frac{2}{9}

. We use Mathematica [17] to find that

| | f_{A} {| |}_{\infty} = - min_{x \in S^{2}} f_{A} (x) \approx 0.6745

. Since

f_{A}

is a product of the hyperbolic polynomials in direction e, it is also hyperbolic in direction e. This immediately becomes

d^{*} (e) \approx 0.8334, d_{2} (e) = \frac{1}{810 \sqrt{3}} \approx 7.1278 \cdot 10^{- 4}, {\hat{d}}_{2} (e) = \frac{2}{1215} \approx 8.2305 \cdot 10^{- 4} .

Using Mathematica [17], we find the zero Z-eigenvectors to be:

\begin{matrix} z_{1} = (1, 0, 0), z_{2} = - z_{1}, z_{3} = (0, 1, 0), z_{4} = - z_{3}, z_{5} = (0, 0, 1), z_{6} = - z_{5}, \\ z_{7} = (\frac{2}{\sqrt{5}}, \frac{1}{\sqrt{5}}, 0), z_{8} = - z_{7}, z_{9} = (\frac{3}{\sqrt{10}}, \frac{1}{\sqrt{10}}, 0), z_{10} = - z_{9}, \\ z_{11} = (0, \frac{3}{\sqrt{13}}, \frac{2}{\sqrt{13}}), and z_{12} = - z_{11}, \end{matrix}

It turns out

| | e - z_{11} {| |}_{2} \approx 0.6314, | | e - z_{7} {| |}_{2} \approx 0.6714, and | | e - z_{9} {| |}_{2} \approx 0.7344 .

From this, we can see the upper bound

d^{*} (e) \approx 0.8334

in fact encloses all the points

z_{i}

for

7 \leq i \leq 12

projectively, while

z_{11}

is the nearest to e, as shown in the figure below.

The following example shows that, even though

f_{A}

may not be hyperbolic in any direction e, the upper bound provided by Theorem 4 may still remain valid.

Example 13.

Let

A \in R^{[4, 2]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

f_{A} (x, y) = x^{4} - 3 x^{3} y + x^{2} y^{2} + 4 y^{4} .

It is clear

f_{A} (e_{1}) = 1 and {\hat{d}}_{2} (e_{1}) = \frac{1}{96} \approx 0.0104 .

We now show that

f_{A}

is not hyperbolic in any direction

e = (a, b) \neq (0, 0)

. We compute:

p (t) = f_{A} (t e + (x, y)) = {(a t + x)}^{4} - 3 {(a t + x)}^{3} {((b t + y) + (a t + x)}^{2} {(b t + y)}^{2} + 4 {(b t + y)}^{4} .

Solving for t, Mathematica yields:

t = \frac{2 y - x}{a - 2 b} and t = \frac{- 2 a x - b x - a y - 2 b y \pm \sqrt{3 (- b^{2} x^{2} + 2 a n x y - a^{2} y^{2})}}{2 (a^{2} + a b + b^{2})} .

However,

- b^{2} x^{2} + 2 a b x y - a^{2} y^{2} = - {(b x - a y)}^{2}

. Thus,

p (t)

is really rooted if and only if

a = b = 0

, which is impossible.

On the other hand, since

| | f_{A} {| |}_{\infty} = 4

,

d^{*} (e_{1}) = \sqrt{2 - \sqrt{2}} \approx 0.7654

. Furthermore, the zero Z-eigenvectors are

z_{1} = (\frac{2}{\sqrt{5}}, \frac{1}{\sqrt{5}}) and z_{2} = - z_{1} .

Thus,

| | e_{1} - z_{1} {| |}_{2} \approx 0.4595 < d^{*} (e_{1})

.

We experimented with several other examples where

f_{A}

is not hyperbolic in any obvious direction e; however, the upper bound still held. For this reason, we end this section by proposing the following procedure which may lead to promising outcomes.

[Procedure] Given a nonzero m-order n-dimensional weakly symmetric tensor

A = (a_{i_{1} \dots i_{m}})

with the associated homogeneous polynomial

f_{A} (x) \in R {[x_{1}, \dots, x_{n}]}_{m}

. Define the index set

Δ (A) : = {1 \leq j \leq n : a_{j \dots j} \neq 0} .

Case 1.: Suppose $Δ (A) \neq \emptyset$ .

Step 1.1. Let $j_{0} : = min (Δ (A))$ be the least index. It is clear that $f_{A} (e_{j_{0}}) \neq 0$ , where $e_{j_{0}} = (0, \dots, 1, \dots, 0)$ with the only 1 in the $j_{0}$ -th coordinate. We then compute ${\hat{d}}_{2} (e_{j_{0}})$ using Theorem 3.
Step 1.2. Check whether $f_{A} (x)$ is hyperbolic in direction $e_{j_{0}}$ . If it is, we then compute $d^{*} (e_{j_{0}})$ using Theorem 5. Consequently, the nearest possible zero Z-eigenvector is nested in between the two spheres centered at $e_{j_{0}}$ of radii ${\hat{d}}_{2} (e_{j_{0}})$ and $d^{*} (e_{j_{0}})$ , respectively.
Step 1.3. Repeat Step 1.2 with any other direction $e_{j}$ for all subsequent indices $j \in Δ (A)$ . If $f_{A} (x)$ is also hyperbolic in direction $e_{j}$ , we then compute $d^{*} (e_{j})$ for each such j using Theorem 5. Consequently, the nearest possible zero Z-eigenvector is nested in between the two spheres centered at $e_{j}$ of radii $d_{2} (e_{j})$ and $d^{*} (e_{j})$ , respectively. Putting these spheres of different sizes together, we have projectively located many if not all of the zero Z-eigenvectors. We refer to Example 9 for a detailed demonstration.
Step 1.4. If $f_{A} (x)$ is not hyperbolic in direction $e_{j}$ for any $j \in Δ (A)$ , it is inconclusive. However, we can still compute $d^{*} (e_{j})$ , but only use it as a possible upper bound with caution, as demonstrated in Examples 13 and 14.

Case 2.: Suppose $Δ (A) = \emptyset$ .

Step 2.1. If there is an obvious choice $e \neq 0$ such that $f_{A} (e) \neq 0$ , then we can normalize e if necessary and use this as our initial point and follow the outlined steps 1.1 and 1.2 as given above. Otherwise, we move to the following step.
Step 2.2. Applying the shifted symmetric higher-order power method (SS-HOPM), as provided by Kolda and Mayo [9], we choose a parameter $α > 0$ large enough such that

${\hat{f}}_{A} (x) : = f_{A} (x) + {α | | x | |}_{2}^{m}$

becomes either convex or concave. According to [9], it is usually required to have

$α > (m - 1) \cdot max_{x \in S^{n - 1}} ρ (A x^{m - 2}),$

where $m (m - 1) A x^{m - 2}$ denotes the Hessian matrix of $f_{A} (x)$ and $ρ (A x^{m - 2}))$ denotes its spectral radius. A common conservative choice of $α$ is by letting

$α = (m - 1) \sum_{1 \leq i_{1}, \dots, i_{m} \leq n} | a_{i_{1} \dots i_{m}} | .$
Step 2.3. Starting with the initial point $e_{1} = (1, 0, \dots, 0) \in R^{n}$ , the SS-HOPM will converge to some $x_{0} \in S^{n - 1}$ such that $\nabla {\hat{f}}_{A} (x_{0}) = 0$ . As a consequence, $0 = f_{A} (x_{0}) + α$ ; hence, $x_{0} \in S^{n - 1} ∖ V_{R} (f_{A})$ .
Step 2.4. We now use $x_{0}$ in place of $e_{j_{0}}$ as in Step 1.1 and proceed in a similar fashion to find ${\hat{d}}_{2} (x_{0})$ as well as $d^{*} (x_{0})$ .
Step 2.5. We choose $x_{1} \in {x_{0}}^{⊥} \cap S^{n - 1}$ analogous to $e_{j}$ in Step 1.3 and proceed in a similar fashion to find ${\hat{d}}_{2} (x_{1})$ as well as $d^{*} (x_{1})$ . We continue this process until there is no more orthogonal vector left, at which point, we can locate many if not all of the zero Z-eigenvectors projectively.

We demonstrate the above procedure as follows.

Example 14.

Let

A \in R^{[4, 2]}

be the weakly symmetric tensor whose associated homogeneous polynomial is

f_{A} (x, y) = 4 x^{4} y^{2} - x^{2} y^{4} = x^{2} y^{2} (4 x^{2} - y^{2}) .

Clearly,

Δ (A) = \emptyset

and

f (e_{1}) = f (e_{2}) = 0

. However, by choosing

\hat{e} = (1, 1)

,

f_{A} (\hat{e}) = 3

. It is easy to check

p (t) = f_{A} (t \hat{e} + (x, y)) = 4 {(t + x)}^{4} {(t + y)}^{2} - {(t + x)}^{2} {(t + y)}^{4}

has all real roots:

t = - x

,

t = - y

,

t = - 2 x + y

, and

t = - \frac{1}{3} (2 x + y)

. Hence,

f_{A}

is hyperbolic in direction

\hat{e}

. We normalize

\hat{e}

to be the unit vector

e = (\frac{1}{\sqrt{2}}, \frac{1}{\sqrt{2}})

, then

f_{A} (e) = \frac{3}{8}

. Next, we calculate to see that

\begin{matrix} U (f_{A}) = 16, {\hat{d}}_{2} (e) = \frac{1}{256} \approx 0.0039, | | f_{A} {| |}_{\infty} \approx 0.5251, and \\ d^{*} (e) = \sqrt{2 - 2 \sqrt{1 - {(\frac{3 / 8}{0.5251})}^{1 / 2}}} \approx 1.1013 . \end{matrix}

It is not difficult to see that the zero Z-eigenvectors are

z_{1} = (1, 0), z_{2} = - z_{1}, z_{3} = (0, 1), and z_{4} = - z_{2} .

The nearest zero Z-eigenvector to e is

e_{1}

, which satisfies

| | e - z_{1} {| |}_{2} \approx 0.7654

. Next, if we choose

e^{⊥} = (- \frac{1}{\sqrt{2}}, \frac{1}{\sqrt{2}})

, then

f_{A} (e^{⊥}) = \frac{3}{8}, {\hat{d}}_{2} (e^{⊥}) \approx 0.0039, and d^{*} (e^{⊥}) \approx 1.1013 .

The nearest zero Z-eigenvector to

e^{⊥}

is

e_{2}

, which satisfies

| | e^{⊥} - z_{2} {| |}_{2} \approx 0.7654

. Hence, the nested spheres of inner radius

0.0039

and outer radius

1.1013

centered at e and

e^{⊥}

projectively encompass all zero Z-eigenvectors, as can be seen in the figure below.

Author Contributions

Conceptualization, K.P. and T.Z.; methodology, K.P. and T.Z.; writing—original draft preparation, K.P. and T.Z.; writing—review and editing, K.P. and T.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lim, L.H. Singular values and eigenvalues of tensors, A variational approach. In Proceedings of the 1st IEEE International Workshop on Computational Advances of Multi-Tensor Adaptive Processing, Le Gosier, France, 13–15 December 2005; pp. 129–132. [Google Scholar]
Qi, L. Eigenvalues of a real supersymmetric tensor. J. Symbolic Comput. 2005, 40, 1302–1324. [Google Scholar] [CrossRef]
Qi, L. Eigenvalues and invariants of tensors. J. Math. Anal. Appl. 2007, 325, 1363–1377. [Google Scholar] [CrossRef]
Qi, L.; Sun, W.; Wang, Y. Numerical multilinear algebra and its applications. Front. Math. 2007, 2, 501–526. [Google Scholar] [CrossRef]
Chang, K.; Pearson, K.; Zhang, T. On eigenvalue problems of real symmetric tensors. J. Math. Anal. Appl. 2009, 350, 416–422. [Google Scholar] [CrossRef]
Chang, K.; Pearson, K.; Zhang, T. Some variational principles of the Z-eigenvalues for nonnegative tensors. Linear Algebra Appl. 2013, 438, 4166–4182. [Google Scholar] [CrossRef]
Chang, K.C.; Qi, L.; Zhang, T. A survey on the spectral theory of nonnegative tensors. Numer. Linear Algebra Appl. 2013, 20, 891–912. [Google Scholar] [CrossRef]
Hu, S.; Qi, L. Convergence of a second order Markov chain. Appl. Math. Comput. 2014, 241, 183–192. [Google Scholar] [CrossRef][Green Version]
Kolda, T.; Mayo, J. Shifted power method for computing tensor eigenpairs. SIAM J. Matrix Anal. Appl. 2011, 34, 1095–1124. [Google Scholar] [CrossRef]
Liu, Y.; Zhou, G.; Ibrahim, N.F. An always convergent algorithm for the largest eigenvalue of an irreducible nonnegative tensor. J. Comput. Appl. Math. 2010, 235, 286–292. [Google Scholar] [CrossRef]
Ng, M.; Qi, L.; Zhou, G. Finding the largest eigenvalue of a nonnegative tensor. SIAM J. Matrix Anal. Appl. 2010, 31, 1090–1099. [Google Scholar] [CrossRef]
Bauschke, H.H.; Güler, O.; Lewis, A.S.; Sendov, H.S. Hyperbolic polynomials and convex analysis. Can. J. Math. 2001, 53, 470–488. [Google Scholar] [CrossRef]
Pemantle, R. Hyperbolicity and Stable Polynomials in Combinatorics and Probability. Available online: https://www2.math.upenn.edu/~pemantle/papers/hyperbolic.pdf (accessed on 1 July 2023).
Shub, M. On the distance to the zero set of a homogeneous polynomial. J. Complexity 1989, 5, 303–305. [Google Scholar] [CrossRef][Green Version]
De Lathauwer, L.; De Moor, B.; Vandewalle, J. On the best rank-1 and rank-(R₁, R₂, ⋯,R_N) approximation of higher-order tensor. SIAM J. Matrix Anal. Appl. 2000, 21, 1324–1342. [Google Scholar] [CrossRef]
Kofidis, E.; Regalia, P.A. On the best rank-1 approximation of higher-order supersymmetric tensors. SIAM J. Matrix Anal. Appl. 2002, 23, 863–884. [Google Scholar] [CrossRef]
Wolfram Research, Inc. Mathematica Online; Wolfram Research, Inc.: Champaign, IL, USA, 2023; Available online: https://www.wolfram.com/ (accessed on 1 July 2023).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pearson, K.; Zhang, T. The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point. Mathematics 2024, 12, 705. https://doi.org/10.3390/math12050705

AMA Style

Pearson K, Zhang T. The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point. Mathematics. 2024; 12(5):705. https://doi.org/10.3390/math12050705

Chicago/Turabian Style

Pearson, Kelly, and Tan Zhang. 2024. "The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point" Mathematics 12, no. 5: 705. https://doi.org/10.3390/math12050705

APA Style

Pearson, K., & Zhang, T. (2024). The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point. Mathematics, 12(5), 705. https://doi.org/10.3390/math12050705

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Nearest Zero Eigenvector of a Weakly Symmetric Tensor from a Given Point

Abstract

1. Introduction

2. Lower Bound for the Distance to the Real Vanishing

3. Upper Bound for the Distance to the Real Vanishing

4. The Zero Eigenvectors of a Nonzero Weakly Symmetric Tensor

5. Some Examples

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI