1. Introduction
Vibration problems with uncertain data are common in engineering. From the point of view of applied mathematics, development of the solution methods for this class of problems remains one of the final frontiers. Various approaches for stochastic eigenproblems have been proposed in relation to so-called stochastic finite element methods (SFEM). As always with probabilistic problems, the baseline solution technique is the Monte Carlo method [
1]. The justification for the new approaches is that they have provably faster convergence rates than the Monte Carlo. Broadly speaking, the methods can be divided into invasive (e.g., Galerkin methods [
2,
3]) or non-invasive ones (for instance, stochastic collocation [
4]). Our discussion here applies to the problem of resolving the smallest eigenmode when the design configuration has uncertainties.
Some applications of uncertainty quantification to investigate the static and dynamical responses of beam, plates, and shells with random parameters are found in the literature. We mention two papers that also have considered Young’s modulus as a random quantity. Bahmyari et al. [
5] provided a bending analysis of thick plates with elastically restrained edges and resting on Pasternark elastic foundation, where the plate’s Young’s modulus and stiffness of the boundary conditions and elastic foundation were assumed as random variables. The free vibrations of orthotropic plates with uncertainties in the Young’s modulus were investigated by Ernst et al. [
6] who obtained good correlation between experimental and numerical results.
In computational vibration or dynamic analysis of deterministic problems, the methods used in static loading problems can be applied directly. For the stochastic problem, this useful connection between dynamic and static problems is broken. The uncertainties, such as fluctuations in the material parameters, are random variables represented using series with random coefficients that are the stochastic parameters. One immediate consequence is that normalization of the eigenmodes has to be valid
over the whole parameter space. This global normalization constraint in fact makes the problem nonlinear as opposed to its deterministic counterpart. Moreover, current mathematical analysis (see, e.g., [
4,
7]) is based on second-order model problems where it is guaranteed that the smallest mode is well-isolated and hence the question of subspace convergence is still not resolved and is the subject of current research.
Our interest is in static analysis with low-rank approximation which is enabled by accurately resolving the correct eigenspaces. Identification of subdomains with high variance in stress components, say, could be useful in sensor placement design for online failure detection (For an exhaustive recent review, see [
8]).
What distinguishes the vibration problems in mechanics from the classical model problem in analysis is that the smallest mode can in fact be of higher multiplicity. This follows directly from symmetries. For planar elasticity, our chosen setting, free vibration on a symmetric domain, leads to double eigenvalues or clusters of dimension two. This makes it the ideal model problem for stochastic eigenvalue problems.
Let us consider free vibration on a square plate (see
Figure 1). The smallest eigenvalue is a double one, and two modes have simply interchanged the respective profiles on the vector fields
u and
v. However, if the domain is perturbed, for instance, extended in either one of the coordinate directions, the cluster breaks apart and either one of the modes becomes the true smallest one. If this phenomenon is described in terms of uncertain domains, we see that the eigenpairs
and
, say, depend on the random variable
, and crucially, if we trace the smallest eigenvalue
, the corresponding eigenmode may change, that is, the eigenmode is not consistent over the whole parameter space and we say that the eigenvalues
cross. Hence, to emphasize this nontrivial dependence of the eigenmodes on
, we call
the
effective smallest eigenvalue over the parameter space. We note that considering the corresponding eigenmode is problematic in the sense that such an eigenmode would change discontinuously in the proximity of the crossing.
This paper has two objectives: First, we illustrate that the question of the effective smallest eigenvalue can arise within planar elasticity, and second, despite the inherent ambiguity of the eigenvalues within a subspace, a basis of the subspace can be used in low-rank approximation of standard source problems. In the computational experiment, we demonstrate the efficacy of the subspace construction in terms of computational complexity in a problem with non-trivial geometry. Our model problem, an idealized 2D crankshaft section with random Young’s modulus, was initially proposed as an initial step in a metal fatigue problem related to ship engines.
The rest of the paper is structured as follows: First we cover preliminaries in
Section 2, including the Navier’s equations of elasticity, their stochastic formulation, and the collocation method used here. Subspace construction is given in
Section 3. Eigenvalue crossing is discussed in
Section 4. Computational study of a 2D crankshaft model highlights the low-rank solution method in
Section 5. Finally, conclusions are drawn in the final section.
2. Preliminaries
In this section, we give a brief overview of the (stochastic) model of planar elasticity and the collocation method used in the solution of stochastic problems.
2.1. Navier’s Equations of Elasticity
We use the Navier’s equations of elasticity to model the system of interest. Let
D be a domain representing a deformable medium subject to a body force
and a surface traction
. The 2D model problem is then to find the displacement field
, and the symmetric stress tensor
, such that
where
is a partitioned boundary of
D. The Lamé constants are
with
E and
being Young’s modulus and Poisson’s ratio, respectively. Further,
is the identity tensor,
denotes the outward unit normal to
, and the strain tensor is
The vector-valued tensor divergence is
This formulation assumes a constitutive relation corresponding to linear isotropic elasticity with stresses and strains related by Hooke’s generalized law
2.1.1. Weak Formulation
Let us introduce a function space
, and state: Find
such that
where the bilinear form
is the integrated tensor contraction
and
denotes the linear functional
As usual, we define the kinematic relation
and specify the constitutive matrix
for the purpose of rewriting the bilinear form as
2.1.2. Stochastic Formulation
We introduce a stochastic extension of Navier’s equations of elasticity, which we will consider for the remainder of this paper. Letting
denote a probability space, where
is a space of outcomes,
a sigma-algebra of events, and
a probability measure on
, respectively, we model the uncertainties in the domain and material parameters of the physical problem (
1) by assuming that there exists an explicit dependence of these quantities on the stochastic variable
via a priori known mappings
,
,
, and
.
The stochastic problem can then be stated as: find the displacement field
and stress tensor
such that
for
P-almost every
. Here the stochastic dependence
is propagated also into the Lamé constants
and
via the Equation (
2).
Analogously to the deterministic case, the stochastic system (
13) can also be expressed in weak form. Let us define stochastic extensions for the forms (
7) and (
9) by setting
and
The weak form of the stochastic problem then reads: find
such that
for
P-almost every
.
The spatial and stochastic dimensions of the system are decoupled: each individual realization of
fixes the measurement configuration—the geometry
and the inputs
,
and
– and corresponds to one instance of the deterministic problem (
1), which can be solved numerically using FEM. The stochastic problem can be solved through stochastic collocation, which is tantamount to solving an ensemble of deterministic problems corresponding to a list of collocation nodes
in
.
The solution of the stochastic problem can be used for determining the response statistics of any quantity of interest defined for the solution pair
with respect to any assumed uncertainties in the initial measurement setting. We consider the expectations and variances of the norms of the stresses of the stochastic elasticity problem given by
where
stands in for the spatial
and
norms, respectively.
2.1.3. Stochastic Eigenproblem
The stochastic eigenproblem corresponding to the Navier’s equations given in (
13) is obtained by replacing the body force
with a multiple of the displacement field
itself. The problem then reads: find the eigenvalue
, the eigenvector
, and the stress tensor
such that
for
P-almost every
. To make the eigenmodes physically meaningful, we also impose that
is normalized in
for
P-almost every
.
The weak form of the stochastic eigenproblem is: find the eigenvalue
and eigenvector
such that
for
P-almost every
. For each fixed
, the eigenproblem (
19) admits a countable number of eigenvalues that can be enumerated in increasing magnitude (counting multiplicities)
and corresponding eigenvectors
that form a basis of
. However, due to the eigenvalue crossings mentioned in the introduction, this enumeration of the eigenmodes becomes problematic when the eigenmodes are viewed as functions of
. Instead, in
Section 3, we consider resolving a basis for the subspace corresponding to a finite set of the eigenvalues and hence effectively avoid the question about the enumeration.
2.2. Stochastic Collocation
Standard stochastic finite element methods, in particular the stochastic collocation method considered here, assume that the dependence on the abstract variable
can be parametrized using a countable set of mutually independent random variables. Such a parametrization can sometimes be known a priori or it can be estimated statistically. For the material parameters such as the Young’s modulus
, a representation of the form
is typically assumed. Here
are spatial stochastic fluctuations decaying in
-norm, as
and
is a family of random variables assumed to be i.i.d. The series (
22) can be viewed as the Karhunen–Loevé expansion of the random underlying random field, in which case the functions
are the eigenfunctions of some covariance function
. In some simple geometric settings, these eigenfunctions are known explicitly, but they can also be resolved numerically as eigenvectors of the sampled covariance matrix; see, e.g., (Chapter 8) in [
1,
9] and [
10]. For numerical computations, the series is truncated after
M terms.
In the following paragraphs, we consider an anisotropic Smolyak-type collocation operator defined with respect to a finite multi-index set (see, e.g., [
4,
11,
12,
13]). In principle, a simpler formulation would also suffice for the purpose of our numerical experiments. However, we wish to keep the general form of the operator here, since it allows for efficient computation even in high-dimensional parameter spaces.
For the sake of accuracy, it is standard practice to choose the collocation points to be the abscissae of orthogonal polynomials; see [
14]. Here we formulate the collocation method using Legendre polynomials which are the optimal choice when the input random variables are uniform. For example, in the case of Gaussian random variables, one should use Hermite polynomials instead, but otherwise the collocation method remains the same.
Let
be the univariate Legendre polynomial of degree
p. Denote by
the zeros of
and by
the associated Gauss–Legendre quadrature weights. We define the one-dimensional Lagrange interpolation operators
via
where
are the related Lagrange basis polynomials of degree
p. Observe that
Now let
be a finite set of multi-indices. For
, we write
if
for all
. We assume that
is monotone in the following sense:
The sparse collocation operator is defined as
where
This may be rewritten in a computationally more convenient form
where
. The collocation points are of the form
for some
such that
. Similarly, we define the tensorized quadrature weights
. Statistics, such as the expected value and variance, for the collocated solution may now be computed by applying the quadrature rule
on the terms in Equation (
27).
The accuracy of the collocated approximation is ultimately determined by the smoothness of the solution as well as the choice of the multi-index set
. We refer to [
4,
12] for a detailed analysis. In this paper, we use tensor product grids defined by
as well as multi-index sets of the form
where
is a (typically decreasing) sequence of numbers in the interval
. Multi-index sets of the latter form have previously been employed in, for instance, [
15,
16].
3. Subspace Reduction
Given a finite set of indices
, let us denote by
the subspace associated to the eigenvalues
of the problem (
19). To avoid problems with possible eigenvalue crossings, we define a
canonical basis for the subspace
by setting
i.e., each canonical basis vector
is just the projection of some constant reference vector
onto the subspace
. The reference vectors
can for instance be chosen as the eigenvectors
evaluated at some fixed value of
. Note that as long as the Gram matrix
is non-singular, the vectors
in fact form a basis of
. Moreover, each
is now independent of the enumeration of the eigenmodes
. In the numerical experiments of
Section 5, we choose
to be the eigenvectors evaluated at
for all
m in the series (
22). Finally, observe that we can apply the Gram–Schmidt process to orthogonalize the basis
at each
.
Denote by
and
the stochastic stiffness and mass matrices obtained from a finite element discretization of the Equation (
20). Suppose that
is a matrix representation of the subspace
, more precisely,
has as its columns the discrete approximations of the normalized canonical basis vectors
. Then a low rank approximation of
is obtained from
where
is now a matrix of size
.
Instead of the generalized eigenproblem implied by Equation (
20), one may also consider the standard eigenproblem of the matrix
with eigenvectors normalized in
. If we again collect the discrete approximations of the resulting normalized canonical basis vectors into a matrix
, then an alternative low-rank approximation for
is obtained simply from
4. Eigenvalue Crossing
As we have already seen in the introduction, perturbations in the domain break the lowest mode cluster. Now we demonstrate that perturbations in the material parameters can have the same effect. The two eigenpairs of the cluster are said to cross, that is, the mode associated with the smallest eigenvalue can change within the parameter space.
In the context of our study, we model the Young’s modulus as random and represent it formally as a truncated Karhunen–Loève expansion in the spirit of (
22):
Without loss of generality, we can assume that the distribution involved is uniform and after possible rescaling for all , hence making it possible to employ the collocation scheme outlined above. We set (constant) and , where are coefficients depending on the geometry, and is a decaying sequence with a known rate of decay .
In the interest of reproducibility, as in one of the examples of [
16], the functions
have been chosen independent of any covariance function. For examples of analytic variants in simple geometric setting, or discretized approximations over general domains, see (Chapter 8) in [
1,
17,
18].
We choose
,
,
, and
in (
31), resulting in stochastic dimension
. Crossing can then be demonstrated by taking all
,
; that is, every
has the same constant value. This is shown in
Figure 2. The modes cross at
, since at that point the smallest mode is the cluster of dimension
. Notice that the actual parameter space has dimension
in this case. In order to trace the true smallest eigenpair for all realizations, the computational complexity would be roughly equal to integration problem of the same dimension; in other words, it would be very expensive indeed.
In
Figure 3, it is verified that the (essentially) same two modes appear as before. The practical implication of this phenomenon is that if we sample many manufactured specimens, the variation in observed quantities of interest can actually fall within the uncertainties of the inputs rather than deficiencies in the manufacturing process.
5. Low-Rank Solution: Crankshaft Model
Even though the question of the effective eigenvalues is complicated, the subspace associated to a cluster of eigenvalues is indeed typically well defined and smooth over the whole parameter space. More precisely, it follows from classical perturbation theory that if the eigenvalues in the cluster are separated from the rest of the spectrum for all parameter values and the underlying solution operator depends analytically on the parameters, then the associated subspace also depends analytically on the parameters; see chapter VII (especially Theorem 1.7) in [
19]. Therefore, such a subspace can be used in low-rank approximation. Our numerical experiment concerns a 2D configuration which is an idealized crankshaft model with a double fillet (
Figure 4). This problem was originally proposed as an example of industrial uncertain domains by Prof. J. Könnö (formerly Wärtsilä, now University of Oulu) and studied in a master’s thesis [
20]. In manufacturing, the errors arise from geometric imperfections and variation in material properties. In the spirit of our discussion, we study only the effects of the material parameters.
Since the material parameters (input) are uncertain or random, the quantities of interest are naturally statistical. Under fixed loading— a unit traction applied to a segment of the domain (see
Figure 4)—we are interested in the expected values of the potential and stress components of the solution field components, and more importantly, the variance of these quantities and their distribution over the elastic body, here a 2D domain.
The computational domain S of the crankshaft model is formally formed by subtracting from an L-shaped domain , two circles and , with radii , , , respectively, whose centres , , are uniquely defined by a condition that the tangents of the two circles coincide at a point with an -coordinate of , where the “drop” d is a given parameter. Thus, every parameterized realization has the form .
In the numerical experiments, the three geometry parameters could be modeled as random variables with uniform distributions:
,
, and
. Here we fix the values to their respective expected ones:
We set the Young’s modulus as the random variable as above with decay rate
and tolerance
in (
31), leading to
and
.
The finite element solution was carried out using second-order elements with a fine mesh resulting in each linear system with 45,772 degrees of freedom. This overrefinement was an attempt to ensure that the timing comparisons below were not affected by implementation details. The stochastic subspace has been constructed using the first 100 eigenmodes. With this choice, the average of the relative error of the low-rank approximation in the squared energy norm is less than 1%. The collocation step was carried out using a tensor product grid with 729 nodes (
30).
On an iMac Pro (2017) with 18 cores and 128GB RAM, using Mathematica 12.3.1 [
21], the subspace construction and a single solution set using (
34) combined took
s as opposed to more than 540 s for a full solution. It should be noted that the subspace construction implementation is a proof of concept, whereas the full solution used highly optimized solvers. It is clear that the potential benefits only increase with higher dimensional problems where the number of collocation points is significantly higher.
In the absense of real measurement data, any verification of the convergence of the proposed method has to be done numerically. In probabilistic setting, the Monte Carlo method always serves as the gold standard, since its expected rate of convergence in any aggregate quantity of interest is known. In
Figure 5, the convergence in energy is observed to be as expected: the reference result is taken to be an overkill solution with a combination of a strongly refined mesh and high-order collocation rule.
The potentials and two stress components are shown in
Figure 6 and
Figure 7. What sets these figures apart from the normal exposition is the inclusion of the variance plots. In particular, the information gained from the location of maximal variance is highly useful and in this case coincides with what one would expect in such a configuration. In particular, the variance in stresses is concentrated in the area where the moment due to loading is acting. This gives us high confidence in our results and the efficacy of the approach proposed here.
Of course, the proposed method only provides an approximation. In the current setting, the qualitative results obtained are accurate in the sense that visual comparison of the contour plots obtained with the two methods are indistinguishable. However, the errors in energy must translate to errors in norms of derived quantities.
6. Conclusions
Stochastic eigenvalue problems are nonlinear and multiparametric. They require their own solution methods and remain one of the challenge problems in computational mechanics. It turns out that planar elasticity on symmetric domains is the ideal setting for simplest possible reference problems, where the key is to have a cluster of eigenmodes at the low end of the spectrum.
In this paper, we have shown that the eigenvalue crossing can occur within clusters not only by perturbations of the domain, but also of material parameters. What is new is that in this setting, the crossing can be controlled; that is, the effect of the perturbations can actually be predicted.
Even though eigenvalue crossing means that identifying the smallest eigenpair is a difficult problem in multiparametric problems, the basis of the subspace is a well-defined concept and can be used for instance in low-rank approximation of solutions of problems with static loading. In our example, already with a relatively small number of collocation points, the reduction in solution times is significant.