1. Introduction
The study of polynomial roots has been a fundamental area of mathematics for centuries, with applications ranging from pure algebra to various fields of science and engineering [
1,
2,
3,
4]. In recent years, the use of perturbation theory in estimating polynomial roots has gained significant attention, providing valuable insights into the behavior of roots under small parameter changes [
5]. Perturbation theory is also crucial in studying random polynomials, where coefficients are drawn from probability distributions. The analysis of how small random perturbations affect the roots of polynomials provides insights into the stability and sensitivity of polynomial systems under uncertainty [
6]. This approach has found applications in various fields, including the study of eigenvalues of random matrices and the behavior of complex systems [
7].
Galántai and Hegedüs [
8] present perturbation bounds on polynomial zeros that improve the results of Ostrowski, Elsner et al., and Beauzamy while also enhancing the backward stability result of Edelman and Murakami. Pakdemirli and Yurtsever [
9] offer theorems to estimate the order of magnitude of roots in polynomials, aiding in a priori estimation before solving the equation. Choque-Rivero and Garza [
10] study moment perturbation of matrix orthogonal polynomials, establishing an explicit relation between perturbed and original polynomials. In a significant study, Maiz [
11] applies perturbation theory to derive polynomial solutions for real quantum potential systems, providing a useful tool for verifying and improving numerical and approximate methods in quantum theory. Recently, Dmytryshyn [
12] developed an algorithm to recover a perturbation of a matrix polynomial from a perturbation of its first companion linearization. The accuracy and stability of numerical algorithms used in perturbation analysis are critical considerations, as highlighted by Higham [
13]. These aspects become particularly important when dealing with high-degree polynomials or those with closely spaced roots, where small perturbations can lead to significant changes in root locations. The advancement of computational power has enabled researchers to employ sophisticated statistical models and simulations to study the behavior of polynomial roots under various perturbation scenarios. These computational approaches complement theoretical results and provide valuable insights into the complex dynamics of perturbed polynomial systems.
Ostrowski [
14] provided a fundamental result on the continuity of polynomial roots with respect to their coefficients and provided bounds on the distance between the origins of two polynomials based on the differences in their coefficients. His theorem gives bounds on the distance between the roots of two polynomials based on the differences in their coefficients.
Theorem 1 (
Ostrowski [
14])
. Let and . For any root of , there exists a root of such thatwhere . Furthermore, the roots of p and q can be enumerated as and , respectively, in such a way that Research in this direction has been further developed and refined by subsequent researchers [
15,
16,
17,
18], contributing to a more comprehensive understanding of polynomial root perturbation. Bhatia et al. [
17] extended the study of root perturbations to matrix eigenvalues. They established an upper bound for the variation of eigenvalues between two matrices.
Definition 1. Let . Assume that and . Let be the set of all permutations of . The eigenvalue variation of A and B is defined by The quantity is also called the (optimal) matching distance between the eigenvalues of A and
Bhatia et al. improved Ostrowski’s matrix perturbation theorem in several aspects.
Theorem 2 (
Bhatia et al. [
17])
. Let . Then Beauzamy further refined the analysis of polynomial root perturbations by considering the multiplicity of roots. For their result, we first need the following definition.
Definition 2. For the polynomial , the Bombieri norm is defined by The theorem provides a more precise bound that takes into account the derivative of the polynomial at the root.
Theorem 3 (
Beauzamy [
18])
. Let be an integer and and be two polynomials of degree n, with . If is any zero of with multiplicity k, there exists a zero of , with Further, if then The work of Pakdemirli and Yurtsever [
9] introduced a novel approach to estimating roots of polynomials using perturbation theory. In [
9], Pakdemirli and Yurtsever, among other things, established the following theorems.
Theorem 4. For a polynomial equation of the form
If all coefficients are of the same order of magnitude, then the roots are of .
Theorem 5. For a polynomial equation of the form
If with all other coefficients being of , then the possible roots may be of either of the following:
Their method, based on the order of magnitude of coefficients, provided a framework for predicting the approximate magnitude of roots before solving the equation explicitly. This approach has proven particularly useful in cases where traditional numerical methods may struggle, such as when dealing with ill-conditioned polynomials or when a quick estimate is needed before applying more computationally intensive techniques [
19]. These examples illustrate the continued interest and wide interface of perturbation theory, especially in the context of polynomials. The applications of this theory extend to various fields, including control systems [
1], signal processing [
20], and electrical engineering [
3]. Numerical analysis techniques, crucial for implementing perturbation methods, are extensively covered in works such as [
2,
19,
21].
The interdisciplinary nature of polynomial perturbation theory is further evidenced by its applications in nonlinear dynamics [
22], computer-aided geometric design [
23], and quantum chemistry [
24].
In this paper, we aim to further expand the scope of perturbation-based root estimation techniques. We present several new theorems that address previously unexplored polynomial structures, including the following:
Polynomials with multiple large coefficients.
Polynomials with coefficients of different orders.
Polynomials with alternating coefficient orders.
Polynomials with large linear and constant terms.
Polynomials with exponentially decreasing coefficients.
Bounds of zeros of a randomly perturbed polynomial.
These theorems provide a comprehensive framework for estimating root magnitudes across a wide range of polynomial types, significantly extending the applicability of perturbation theory in root-finding problems. Moreover, we provide rigorous proofs for each theorem, along with illustrative examples that demonstrate their practical application.
Our work not only contributes to the theoretical understanding of polynomial root behavior but also has potential applications in various fields where quick root estimation is crucial. These include control systems design [
1], signal processing [
20], and numerical analysis [
13], among others.
1.1. Key Concepts
In perturbation theory, the order of a coefficient refers to its magnitude relative to a small parameter . Several examples follow:
means the coefficient is of order 1.
means the coefficient is much larger than 1, approximately .
means the coefficient is much smaller than 1, approximately .
1.2. Limitations
Despite the advancements presented in this paper, several limitations should be acknowledged. The perturbation-based approach provides estimates of root magnitudes rather than exact root values, which may not be sufficient for all applications. The accuracy of the estimates may decrease for polynomials with very closely spaced roots or in cases where multiple perturbation effects interact in complex ways. It is important to note that the theorems presented are primarily applicable to scalar polynomials and may not directly extend to matrix polynomials or multivariate cases without further development; while the method offers quick estimates, it does not replace the need for rigorous numerical methods in applications requiring high precision. Additionally, the approach may have limited effectiveness for polynomials with highly oscillatory coefficient patterns or those arising from certain transcendental equations. These limitations present opportunities for future research and refinement of the perturbation-based root estimation techniques presented in this paper. Addressing these challenges could further enhance the applicability and reliability of these methods across a broader range of mathematical and engineering problems.
The rest of this paper is organized as follows:
Section 2 presents the main theorems and their proofs.
Section 3 provides numerical simulations and discussions.
Section 4 explores the potential applications and implications of our results. Finally,
Section 5 concludes this paper and suggests directions for future research.
1.3. Motivation
The motivation for this research stems from several key factors. Primarily, the ubiquity of polynomial equations in various scientific and engineering fields, including control systems [
1], signal processing [
20], and electrical engineering [
3], underscores the importance of efficient root estimation techniques. There is a pressing need for quick and reliable estimation of polynomial roots, especially in cases where traditional numerical methods may struggle, such as with ill-conditioned polynomials [
19].
2. Main Theorems
Theorem 6 (
multiple large coefficients)
. Consider a polynomial equation of the formAssume there are multiple coefficients of order for some , where these coefficients correspond to indices p and q satisfying . The remaining coefficients, including the constant term and the leading term , are of order . Then, the roots are of order or , where p is the smallest index and q is the largest index among the large coefficients.
Proof. Let p be the smallest index and q be the largest index such that . We will consider two cases for the possible order of magnitude of the roots.
Case 1: Assume , where .
In this case, the term
will be the dominant term among those with large coefficients, as it has the smallest power of
x. We need to balance this term with the constant term
:
Substituting the orders of magnitude:
For relation (2) to hold, the exponent must satisfy
Therefore, one possible order for the roots is .
Case 2: Assume , where .
In this case, the term
will be the dominant term among those with
coefficients. We need to balance this term with
, which is the dominant term among those with large coefficients:
Substituting the orders of magnitude:
For Equation (3) to hold, the exponents must satisfy
Therefore, another possible order for the roots is .
Thus, we have shown that the roots are of order or , where p is the smallest index and q is the largest index among the large coefficients. □
Theorem 7 (
coefficients of different orders)
. Consider a polynomial equation where the coefficients have different orders of magnitude:If for , where some may be zero or positive, then the possible orders of the roots are given byfor any pair of distinct indices . Proof. We will prove this theorem by considering the balance between any two terms in the polynomial and showing that this leads to the stated order of magnitude for the roots.
Let
i and
j be any two distinct indices from the set
. Consider balancing the
i-th and
j-th terms of the polynomial:
Substituting the orders of magnitude for the coefficients:
Assume that
for some real number
r. Substituting this into the above equation:
For the relation (4) balance to hold, the exponents must be equal:
Therefore, one possible order for the roots is .
Since this argument holds for any pair of distinct indices i and j, we have multiple possible orders for the roots, each corresponding to a different pair of terms being balanced.
Thus, we have proven that the possible orders of the roots are given by for any pair of distinct indices . □
Theorem 8 (
alternating coefficient orders)
. Consider a polynomial equation of the formwhere the coefficients alternate between two orders of magnitude:with and . Then the roots of this polynomial are of order . Proof. Without loss of generality, let n be even. We consider two cases:
Case 1: Assume , where .
Balancing the constant term with the first-order term:
Substituting the orders of magnitude:
Case 2: Assume , where .
Balancing the
n-th term with the
-th term:
Substituting the orders of magnitude:
Expression (7) simplifies to
For Equation (8) to hold, the exponents must be equal:
giving us
In both cases, we find . Therefore, the roots are of order . □
Theorem 9 (
polynomial with large linear and constant terms)
. Consider a polynomial equation of the formwhere and with and . All other coefficients are . Then, the roots of this polynomial are of order , , or . Proof. We consider three cases based on the assumed order of the roots.
Case 1: Assume , where .
Balancing the constant term with the linear term:
Substituting the orders of magnitude:
For r to be positive, we require . Therefore, the roots are of order .
Case 2: Assume , where .
Balancing the
n-th term with the linear term:
Substituting the orders of magnitude:
The expression (10) simplifies to
For Equation (11) to hold, the exponents must satisfy
Thus, the roots are of order .
Case 3: Assume .
In this case, all terms are , and thus, the roots can be of order .
Therefore, the roots are of order (from Case 1), (from Case 2), or (from Case 3). □
Theorem 10 (
exponentially decreasing coefficients)
. Consider a polynomial equation of the formwhere the coefficients decrease exponentially:with , , and . Then, the roots of this polynomial are of order for each . Proof. To determine the order of the roots, we consider the balance between consecutive terms in the polynomial.
Consider balancing the
i-th and
-th terms of the polynomial:
Substituting the orders of magnitude:
Assume
for some real number
r. Substituting this into the above equation:
Subtract
from both sides:
Therefore, the order of the roots corresponding to the balance between the i-th and -th terms is .
Since this argument holds for each pair of consecutive indices i and where , we conclude that the roots of the polynomial are of order for each . That proves the theorem. □
Applications in Finding Bounds of Zeros
As an extension of our results to the study of bounds of zeros of random polynomials, taking inspiration from the work of Sheikh and Mir [
25] about the zero bounds of random polynomials, we present the following theorem that concerns the roots of a randomly perturbed monic polynomial with uniformly perturbed coefficients. In the study of polynomials with perturbed coefficients, understanding the impact of these perturbations on the roots is crucial for both theoretical and practical applications. The following theorem addresses this by considering two classes of perturbed polynomials by employing the Cauchy bound for roots. Specifically, we consider the case when all but the leading coefficient receives a uniformly distributed perturbation in the interval
each modeled as independent uniform random variables. By deriving an upper bound on the expected maximum modulus of the polynomial’s zeros, the theorem provides insights into how these specific perturbations influence the distribution of the roots.
Theorem 11 (
expected maximum modulus of zeros with uniform perturbations)
. Let be a polynomial where each is a random variable representing a small perturbation, modeled as where means being uniformly distributed over the interval . The expected maximum modulus of the zeros of is bounded above bywhere are the zeros of where is the Cauchy bound of the zeros of polynomial Before proceeding to the proof, we need a lemma which we state and prove as follows:
Lemma 1 (
expected maximum of uniform variables)
. For , the expected maximum of n such variables is Proof of Lemma 1. Let , where each is independently and identically distributed as .
The cumulative distribution function (CDF) of a single uniform variable
is given by
The CDF of the maximum
M of
n such independent variables is
The probability density function (PDF) of
M is the derivative of the CDF:
The expected value of
M is
Change variables: let
, then
and
. The limits of integration change from
to
to
to
.
Thus, the expected maximum of
n uniformly distributed variables is
□
Proof of Theorem 11. Let . We claim that the maximum modulus of the zeros of is bounded by , where R is the bound on the maximum modulus of the zeros of the unperturbed polynomial .
Consider the perturbed polynomial:
where each
is a random variable representing a small perturbation, modeled as
.
For the unperturbed polynomial:
Cauchy’s bound provides an upper limit on the magnitude of the roots:
For the perturbed polynomial
, the coefficients are
. The bound on the magnitude of the roots becomes
The maximum can be decomposed as
Define the maximum perturbation as
M:
Substituting the decomposed maximum into the bound for the perturbed polynomial, we have
Thus, the bound for the perturbed polynomial is
This derivation shows that the bound accounts for both the original bound R and the maximum effect of the perturbations M, ensuring that the magnitude of the roots is appropriately bounded.
Thus, the expected maximum modulus of the zeros is
Since
R is a constant, we can separate the expectation:
Substituting the bound for
using Lemma 1, we obtain:
That establishes the theorem. □
4. Conclusions
In this paper, we have significantly extended the application of perturbation theory to the estimation of polynomial roots. Building upon the foundational work of Pakdemirli and Yurtsever [
9], we developed and proved several novel theorems that address a wide range of polynomial structures while developing a theorem illustrating how the work can be extended to randomly perturbed polynomials. The results we study include polynomials with multiple large coefficients, coefficients of different orders, alternating coefficient orders, large linear and constant terms, and exponentially decreasing coefficients. These theorems provide a comprehensive framework for estimating the order of magnitude of polynomial roots based on the structure and magnitude of their coefficients. Our results offer several key contributions to the field. First, they extend the applicability of perturbation-based root estimation to a broader class of polynomials, including those with complex coefficient patterns. Second, they provide quick and reliable estimates of root magnitudes without the need for explicit root-finding algorithms, which can be particularly valuable in time-sensitive applications or as a precursor to more detailed numerical analysis. Third, they offer insights into the relationship between coefficient patterns and root behavior, deepening our theoretical understanding of polynomial properties.
The numerical examples presented for the randomly perturbed polynomials with uniformly and identically distributed perturbation errors in this paper demonstrate the utility of our theorems across various scenarios that might arise in practical situations. These results suggest that our approach could be particularly valuable in fields such as control systems, signal processing, and numerical analysis, where quick estimation of polynomial roots is often necessary.
However, it is important to note the limitations of our approach. While these theorems provide excellent estimates for the order of magnitude of roots, they do not give exact root values. Additionally, in some cases with very close root magnitudes, the method may not distinguish between individual roots.
Future research directions could include extending these techniques to multivariate polynomials, investigating the application of these methods to transcendental equations, developing algorithms that combine these estimation techniques with traditional root-finding methods for improved efficiency, and exploring the implications of these results in specific application areas, such as filter design or stability analysis of dynamical systems.
In conclusion, this work represents a significant advancement in the field of perturbation-based polynomial root estimation. By providing a more comprehensive set of tools for analyzing polynomial roots, we hope to facilitate advancements in both theoretical mathematics and practical applications across various scientific and engineering disciplines.
5. Potential Applications in Various Scientific Fields
The novel theorems and techniques presented in this paper for estimating polynomial roots using perturbation theory have potential applications across a wide range of scientific and engineering disciplines. These applications stem from the ubiquity of polynomial equations in modeling complex systems and phenomena. In the field of random polynomials, where coefficients are drawn from probability distributions, perturbation analysis helps in understanding the stability and sensitivity of roots to small random variations [
6]. This approach is particularly useful in fields such as statistical physics, where it aids in studying phase transitions and critical phenomena. Furthermore, perturbation techniques applied to random polynomials contribute to the analysis of eigenvalue distributions in random matrix theory, which has far-reaching implications in quantum chaos, wireless communications, and financial mathematics In the field of control systems engineering, our results can significantly impact the analysis and design of feedback control systems. The characteristic equation of a linear time-invariant system is often represented as a polynomial, and its roots determine the system’s stability and transient response [
1]. Our quick estimation methods can provide engineers with rapid insights into system behavior during the design phase, potentially streamlining the iterative process of controller tuning.
Signal processing is another area that could benefit from these advancements. Many digital filters are designed based on polynomial equations, where the roots of these polynomials correspond to the filter’s poles and zeros [
20]. Our theorems could offer a more intuitive understanding of how coefficient changes affect filter characteristics, possibly leading to more efficient filter design algorithms.
In the realm of numerical analysis, our work has implications for improving root-finding algorithms. By providing good initial estimates of root magnitudes, our methods could enhance the convergence speed and reliability of iterative root-finding techniques such as the Newton–Raphson method or Muller’s method [
21]. This could be particularly valuable when dealing with high-degree polynomials or ill-conditioned systems.
The field of computational physics often involves solving polynomial equations that arise from discretized differential equations. Our perturbation-based approach could offer physicists quick insights into the behavior of these discretized systems, potentially guiding the choice of numerical methods or helping to identify potential instabilities in simulations [
2].
In electrical engineering, particularly in circuit analysis and design, polynomial equations frequently appear in transfer functions and network analysis. Our methods could provide engineers with rapid estimates of circuit behavior, assisting in the early stages of design or in troubleshooting complex systems [
3].
The study of dynamical systems, which spans multiple disciplines including mathematics, physics, and biology, often involves analyzing polynomial equations. Our theorems could offer new tools for understanding bifurcations and stability changes in these systems, potentially leading to new insights in areas such as population dynamics, chemical kinetics, and climate modeling [
22].
In the field of computer graphics and geometric modeling, polynomial equations are used extensively in curve and surface representation. Our quick estimation techniques could potentially optimize algorithms for intersection detection or shape manipulation, leading to more efficient rendering and modeling software [
23].
Quantum chemistry calculations often involve solving polynomial equations that arise from approximations to the Schrödinger equation. Our methods could potentially assist in the rapid estimation of molecular orbital energies or in the analysis of potential energy surfaces [
24].
While these potential applications are diverse, they all share a common thread: the need for quick, reliable estimates of polynomial roots. Our work provides a theoretical foundation that could be adapted and refined for these specific contexts, potentially leading to advancements across multiple scientific disciplines. As researchers and practitioners in these fields begin to apply and extend our results, we anticipate a rich interplay between theory and application, driving further innovations in both perturbation theory and its practical implementations.