1. Introduction
Many systems in nature and laboratories are far from equilibrium, constantly changing in time and space and exhibiting very complex behaviour. Examples include turbulence in astrophysical and laboratory plasmas, the stock market, and biological ecosystems. Despite having apparently different manifestations of complexity, these systems have much in common and are often governed by similar nonlinear dynamics. In particular, an ‘ordered’ collective behaviour (e.g., in the form of coherent structures) emerges on the macroscale out of complexity as a novel consequence of self-organisation. For example, in the laboratory, in geophysical and astrophysical systems, coherent structures such as large-scale shear flows (such as zonal flows and streamers in laboratory plasmas, in the atmosphere and oceans, and in giant planets) and differential rotations in the Sun and other stars emerge from small-scale turbulence. There is overwhelming evidence from laboratory experiments, observations, and computational studies that these coherent structures play an absolutely critical role in determining the level of transport in the flow.
In particular, one crucial effect of shear flows is the suppression of transport in the direction orthogonal to the flow (the shear direction) by shear-induced enhanced dissipation [
1,
2,
3,
4,
5,
6,
7,
8,
9,
10,
11]. This occurs as a shear flow distorts fluid eddies, accelerates the formation of small scales, and dissipates them when molecular diffusion becomes effective on small scales. This turbulence regulation leads to the formation of a transport barrier where transport is significantly reduced locally, providing one of the crucial mechanisms for controlling the mixing and transport in a variety of systems. Important examples include (i) the low-to-high (L-H) transition (or internal transport barrier formation), during which a system undergoes a remarkable, spontaneous transition to a more ordered state, despite the increase in free energy (e.g., [
3,
4,
5]); (ii) equatorial winds and polar vortices [
12] (azimuthal flows in the east–west direction) which have long been known to reduce transport, acting as a transport barrier in the latitudinal direction [
13]; (iii) transport barrier due to shear layers [
14] in oceans which is called shear sheltering; and (iv) the solar tachocline—the boundary layer between the stable radiative interior and unstable convective layer which has a strong radial differential rotation—which can also act as a transport barrier, leading to weak anisotropic turbulence and mixing [
5,
7]. Our theoretical predictions of turbulent quenching in different systems have been confirmed by various numerical simulations (e.g., refs. [
15,
16]).
The foregoing statements underscore the importance of self-regulation between small-scale fluctuations and large-scale shear flows. We proposed a one-dimensional (1D) continuous model of self-organised shear flow [
17] by extending a prototypical sand-pile model which evolves in discrete time. Specifically, we considered the formation of a shear flow driven by a short-correlated (white-noise) random forcing, where the shear gradient increases until it becomes unstable according to the stability criterion. For instance, in a strongly stratified medium, the stability is determined by the Richardson criterion: fluctuations on small scales (or internal gravity waves) amplify a shear gradient and thus, act as a forcing until the gradient exceeds the critical value given by the Richardson criterion,
. Here,
N is the buoyancy frequency due to the restoring force (buoyancy) in a stably stratified medium, and
A is the shear gradient with the critical value
. When unstable, the shear flow then relaxes its gradient and generates small-scale fluctuations, and this relaxation was modelled by nonlinear (cubic) diffusion; the shear gradient then grows again when small-scale turbulence becomes sufficiently strong to drive a shear flow. The same cycle repeats itself, exhibiting continuous growth and damping. This highlights that a self-organised state is never stationary in time, but involves persistent fluctuations.
The extension of refs. [
17,
18] solved a stochastic differential equation with a fourth-order stochastic Runga–Kutta method for Gaussian coloured noise in 1D and showed the transition from an unimodal stationary Probability Density Function (PDF) to a bimodal stationary PDF when the correlation time of a random forcing exceeds a critical value. The mean shear gradient is zero for a unimodal PDF, while its non-zero value represents the critical shear gradient around which a shear gradient continuously grows and damps through the interaction with fluctuations. The transition from a unimodal to bimodal PDF represents the formation of a non-zero mean shear gradient, or the formation of jets. Interestingly, In ref. [
18], we found similar results in a 0D model and 2D hydrodynamic turbulence. In particular, the 2D results showed that a shear flow evolves through the competition between its growth and damping due to a localized instability, maintaining a stationary PDF, and that the bimodal PDF results from a self-organising shear flow with a linear profile.
The purpose of this paper is to investigate the evolution of a time-dependent PDF to understand how a given initial (global) shear gradient modelled by a narrow PDF relaxes into a bimodal or unimodal stationary PDF. We are particularly interested in understanding the information geometry associated with this process. Our information geometry theory is based on the Fisher metric [
19] extended to time-dependent problems. (Note that we use information about statistically different states, refraining from the debate on the exact definition of information [
19,
20]). We recall that for a Gaussian PDF whose evolution is described by the movement of a peak and the change in its width, the uncertainty measuring the mean value of
x is set by the standard deviation. Two PDFs with the same standard deviation would differ by one statistical state when their mean values differ by the standard deviation (e.g., see ref. [
21]). To formalise this idea to quantify the information change associated with the time evolution of PDFs [
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32], we define an infinitesimal distance at any time by comparing two PDFs at adjacent times and sum these distances. The total distance gives us the number of statistically different states that a system passes through in time and is called the information length (
). While the detailed derivation of
and its applications are given in refs. [
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32], it is useful to highlight that
is a measure of the total elapsed time in units of a dynamical timescale for information change. To show this, we define the dynamical time (
) [
22,
23,
24,
25,
26,
27,
28,
29,
30] as follows:
Here,
is the characteristic timescale over which the information changes. Having units of time,
quantifies the correlation time of a PDF. Alternatively,
quantifies the (average) rate of change of information in time.
is then defined by measuring the total elapsed time (
t) in units of
as
measures the cumulative change in
, and depends on the intermediate states that a system evolves through between times 0 and
t. Thus, it is a Lagrangian quantity (unlike entropy or relative entropy) which depends on the time history of
, uniquely defined as a function of time
t for a given initial PDF.
represents the total number of statistically distinguishable states that a system evolves through, providing a very convenient methodology for measuring the distance between
and
continuously in time for a given
. References [
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32] showed that
is a new diagnostic for understanding a dynamical system and for mapping out an attractor structure. In particular,
captures the effect of different deterministic forces through the scaling of
against the peak position of a narrow initial PDF. For a stable equilibrium, the minimum value of
occurs at the equilibrium point. In comparison, in the case of a chaotic attractor,
exhibits a sensitive dependence on initial conditions like a Lyapunov exponent.
In this paper, we investigate the evolution of a shear gradient (x) starting from a relatively narrow PDF () with an initial mean value of which represents the mean value of an initial shear gradient. For a unimodal stationary PDF, the mean shear gradient decreases to zero in the long time limit, while for a bimodal stationary PDF with a peak of , the case of models the relaxation of an initial super-critical gradient () to the critical value (), and the case of models the build-up of the gradient from a subcritical initial value to the critical value (). We are interested in the information changes in these processes and in identifying the differences between the relaxation and build-up of the shear gradient in view of these information changes and in mapping out an attractor structure by using .
The remainder of this paper is organised as follows. We introduce our model and provide analytical solutions of time-dependent PDFs in limiting cases in
Section 2. In order to systematically undertake a numerical study, in
Section 3, we first provide a detailed discussion on stationary PDFs for different parameter values to determine the parameter space for unimodal versus bimodal PDFs.
Section 4 provides numerical solutions for time-dependent PDFs and
. The discussion and conclusions are found in
Section 5.
2. Model
In this section, we introduce our model and provide analytical solutions for time-dependent PDFs in limiting cases. As noted in
Section 1, given the universality of self-organisation in 0D, 1D, and 2D models and the challenge of the computation of time-dependent PDFs, we utilised a 0D model to facilitate the calculation of PDFs. Our 0D model is based on the cubic process for a stochastic variable (
x) (e.g., representing a shear gradient). Specifically, we considered
x driven by a finite correlated forcing (
f), governed by the following Langevin equations
Here,
;
are constants;
is a stochastic noise with a short correlation time with the correlation function
The highest cubic nonlinearity in our 0D model mimics a nonlinear cubic diffusion in the 1D model in refs. [
17,
18]. Equation (
3) is the Ornstein–Uhlenbeck process [
33] with the solution
For
, the correlation time of
is approximately
, as follows:
where we assumed
and used Equation (
5). Thus,
x in Equation (
3) is driven by the Gaussian noise with the correlation time
. While the set of Equations (
3) and (4) give a PDF in two dimensions
, it is useful to obtain an approximate PDF in the
x dimension only. To this end, we combine Equations (
3) and (4) to obtain the equation for
x as
and consider the overdamped limit where
is negligible compared with the damping term. This is the so-called unified-colored noise approximation [
34], and turns Equation (
8) into
We observe that for sufficiently small
, to
Equation (
9) is, again, an Ornstein–Uhlenbeck process [
33] for
:
Thus, the mean value of
, where
, decays exponentially in time while the variance,
, evolves according to
where
and
are the inverse temperatures of
and its initial value, respectively. Therefore, the time-dependent PDF of
Q is a Gaussian process and is given by
where
is the inverse temperature that satisfies Equation (
11).
Since
in Equation (
1) and
in Equation (
2) are invariant under the change of variables, the Gaussian PDF of
Q in Equation (
12) provides us with a convenient way of calculating them by utilising the property of the Gaussian PDF. Specifically, for the Gaussian PDF of
Q,
is given by
where the first and second terms on the right-hand side are due to the temporal changes in the width and peak position of the PDF. For sufficiently small
D (large
) and/or large
,
in Equation (
13) is dominated by the second term. Furthermore, with a small
D, Equation (
11) becomes
. Thus, by substituting
,
into Equation (
13), we obtain
where
, and
is the mean position of
x at
. To relate Equation (
14) to what is observed in the PDF of
x, we need to find the initial inverse temperature,
, for
that corresponds to
(which is the inverse temperature of the PDF of
Q at
). To this end, we use
to leading order for
and obtain
For
, Equation (
15) evaluated at
gives us
Equations (
14) and (
16) give us
Thus,
increases linearly with time with a slope that is proportional to
and
(for small time, small
D, small
, and large
). The numerical simulations in
Section 4 examine this behaviour in more detail.
Then, by using the conservation of the probability, the time-dependent PDF of
x is obtained as
It is interesting to note that
in Equation (
18) can be either unimodal or bimodal depending on the values of the parameters. This is discussed in detail in
Section 3.
Having gained some insight into the leading order behaviour of
for small
, we investigate a more general case of Equation (
9). To this end, it is convenient to recast Equation (
9) as
where
. The corresponding Fokker–Planck equation for
is
In Equation (
20), we used the Stratonovich calculus [
33,
35,
36,
37], which recovers the limit of a short correlated forcing from the finite correlated forcing [
37]. Although a time-dependent solution to Equation (
20) is not easily obtained analytically, a stationary solution can be found and is discussed in detail in
Section 3.
3. Stationary PDFs
In order to undertake a systematic numerical study in
Section 4, we here provide a detailed discussion of stationary PDFs for different parameter values, and determine the parameter space for unimodal versus bimodal PDFs. A stationary PDF found from Equation (
18) is
To
, Equation (
21) reproduces Equation (
18). To determine the location of the local maxima and minima of
in Equation (
21), we calculate
For
, Equation (
22) can be rewritten as
Equation (
23) gives the solution
and
, indicating the possibility of the bimodal PDF. We then find the non-zero solution by solving
To this end, it is convenient to make the following three successive changes in variables:
with
,
,
defined as
In order to solve the equation for
Z in Equation (
25), we use the Cardano formula and find the following three roots:
Here,
and
Equation (
27) gives the non-zero solutions of Equation (
24):
where
To find real solutions, we check the discriminant (
) of the last equation of Equation (
25),
as the sign of
determines the number of the real root as follows:
If , then one root is real, and two are complex conjugates,
If , then all roots are real, and at least two are equal,
If , then all roots are real and unequal.
From a detailed analysis of different cases provided in
Appendix A, we conclude that the existence of a bimodal PDF requires
in Equation (
31), and that the peak position of a bimodal PDF is given by
where
Finally, a convenient method of identifying parameter values for unimodal versus bimodal PDFs is to check the sign of
at
:
Since a unimodal PDF takes a local maximum at
when
and a local minimum at
when
, we can see from Equation (
33) that a unimodal PDF with
is more likely for larger
and smaller
D. Alternatively, a finite correlation time of
f (small
) and a large diffusion (
D) facilitate the formation of a bimodal PDF.
To illustrate these results,
Figure 1 and
Figure 2 show how the peak position
and peak amplitude
, respectively, vary with
for a range of
D values.
Figure 3 shows the boundary between the unimodal and bimodal PDFs in the
parameter space. These results are for
, but other values yield the same general boundary shapes, and in particular, the same agreement occurs between the two different evaluation methods,
and (
33). The condition
is therefore a necessary and sufficient condition to have a bimodal PDF.
Figure 4 shows what the PDFs look like and how the transition between unimodal and bimodal PDFs comes about.
4. Numerical Results
We provided analytical solutions for a time-dependent PDF in certain limiting cases, such as small
(e.g., Equation (
12)), large
and small time (e.g., Equation (
17)) in
Section 2, and in the limit of large time, where the PDF settles into a stationary solution, in
Section 3. To obtain exact time-dependent solutions to the Fokker–Planck equation (
22) for any parameter values, we now use numerical methods in this section and utilise results from
Section 3 to perform our numerical simulation systematically. As shown in
Appendix B, we can set
without any real loss of generality by rescaling the other quantities appropriately. The effective parameter space is therefore reduced to
, together with whatever parameters define the initial condition, which we take to be
. That is,
remains fixed, corresponding to a relatively narrow PDF, and the initial peak position (
) is the one additional parameter. The initial condition (
) represents the PDF for an initial shear gradient. When the final stationary PDF is unimodal, the mean shear will decrease to zero in the long time limit; when the stationary PDF is bimodal with a peak of
,
models the relaxation of an initial super-critical gradient (
) to the critical value (
) while
models the build-up of the gradient from an initial subcritical value to the critical value (
). We are interested in the information change in this relaxation problem and in identifying the difference between the relaxation and build-up of the shear gradient in view of the information change. The numerical implementation of Equation (
22) is based on second-order accurate finite-differencing in both
x and
t, with up to
grid points in
x, and timesteps as small as
. The domain in
x is truncated to the interval
rather than the original unbounded interval for which the analytic theory applies. As seen in
Figure 4, for example, for the parameter values of interest here, the PDFs are well-confined to the interval
, making a numerical solution of (
22) with boundary conditions of
at
an excellent equivalent to an infinite interval.
4.1. Time Evolution of PDFs
Figure 5 shows examples of how different values of
ultimately all relax to the same final PDF. Panels (a–d) correspond to
, respectively.
, according to
Figure 3, is slightly in the bimodal regime, consistent with the final PDF seen here.
Figure 6 focuses specifically on how the positions of the peaks evolve in time. Important observations that we can make from
Figure 5 and
Figure 6 are as follows:
- (a)
An initial PDF with a peak at remains unimodal before becoming a bimodal PDF;
- (b)
An initial PDF with a peak at
(0.32 for this case) does not maintain the same peak position at
, but moves outward first to
and then inwards to
. This initial outward movement explains why the minimum
does not occur for
in
Section 4.2;
- (c)
An initial PDF with a peak at
(where
is the
value which minimises
, as defined in
Section 4.2) constitutes the border line between different PDF evolutions (an initial PDF with a peak at
goes outwards and then inwards, while an initial PDF with a peak at
monotonically moves inwards to
);
- (d)
An initial PDF with a peak at monotonically moves inwards.
4.2. Information Length: Attractor Structure
Since
represents the cumulative change in information, it is zero at
and increases with time. As a PDF settles into a stationary PDF in the limits of a large time, the temporal change in PDFs becomes smaller and then becomes zero,
settling to a constant value of
. A typical evolution of
is shown in
Figure 7 for
,
, and 4, and a range of
values. The logarithmic scale on the right makes it especially clear that for small times,
grows linearly in time, before eventually equilibrating to its final value,
. In order to make more precise comparisons with the analytic prediction (
17),
Figure 8 shows the results of extracting a numerically computed slope, call it
, and compares with the analytic expectation
in Equation (
17). That
is expected to scale linearly with
and
and be independent of
D, is reasonably well reproduced by the numerical data with less than a
difference between the theoretical prediction and simulation results (note the small range of the
y-axis).
is a unique representation of the total number of statistically different states that a PDF evolves through to reach a final unimodal or bimodal PDF. The smaller
is, the smaller the number of states that the initial PDF passes through to reach the final equilibrium. Therefore,
provides us with a path-dependent Lagrangian measure of the distance between a given initial and final PDF. Thus, by choosing a narrow initial PDF at different peak positions (
), we can map out the attractor structure (the proximity of
to an equilibrium) by measuring
as a function of
. We were particularly interested in how differently
would behave for the final unimodal and bimodal PDFs, which have different stable equilibrium points:
and
, respectively. To this end,
Figure 9 shows
as a function (
) for a range of
D values. For final bimodal PDFs, the location of the final peak position (
) is shown by a little vertical line.
We note first in
Figure 9 that the overall shapes of the curves are drastically different depending on whether the final PDF is unimodal or bimodal. For a unimodal final PDF, the minimum value of
occurs for
. This is because
is a stable equilibrium for a unimodal PDF and thus, an initial PDF with the peak (
) closer to
undergoes less change during the evolution of time and is more similar to the final PDF. Therefore, the absolute minimum of
occurs at
, as can be seen in the orange and yellow curves in
Figure 9.
In comparison,
is an unstable equilibrium point for a final bimodal PDF, while
, given by Equation (
32), is a stable equilibrium point. Therefore,
has a local maximum around
(unstable point). Naively, the minimum value of
would be expected to occur for an initial PDF with
, that is, when the peak position of an initial PDF (
) coincides with that of the final PDF (
). However, the blue and green curves in
Figure 9 reveal the very interesting fact that
is actually minimised for
. As noted from
Figure 5 and
Figure 6, this is because the initial peaks that are sufficiently far away move inwards monotonically, but the initial peaks near
actually have a more complicated evolution (moving outwards and then inwards).
These observations confirm that
is a good Lagrangian measure that captures the attractor structure and dynamics. It is, thus, of particular interest to compare
with the Kullback–Leibler divergence [
19] (that is commonly used in comparing PDFs), defined as
where
is the initial PDF and
is the final one. Obviously, unlike
,
depends only on the initial and final PDFs, and thus, does not provide any information on dynamics (e.g., what different states an initial PDF passes through in the time evolution, or how the locations and the shapes of the PDFs evolve in time between initial and final PDFs). Since we have an analytic expression for the stationary PDFs, we computed
by numerical integration with the initial PDF used above.
Figure 10 shows these results, where the little vertical lines represent the positions of
.
We can see that the absolute minimum relative entropy always occurs when or for unimodal and bimodal PDFs, respectively, unlike . In retrospect, this is not particularly surprising, since the relative entropy only measures the difference between the two PDFs, and an initial PDF located at the final peak position is most similar to the final PDF. Specifically, for a bimodal PDF, the initial PDF at the peak position of the final PDF has the strongest resemblance to the final PDF, with the minimum occurring for .
For completeness, we also show
in
Figure 11. Unlike
Figure 10, the absolute minimum value occurs at
, even when the final PDF is bimodal, failing to capture the attractor structure associated with a bimodal PDF. Furthermore, the values of
are much larger than those of
, and thus, a symmetric version (
) would be dominated by
. This drastic difference between
and
calls for care in using symmetric versions.
5. Discussion and Conclusions
We investigated the time evolution of PDFs in a toy model of self-organised shear flows using a unified coloured approximation, and utilised the information length to understand information changes and attractor structures. In our model, the formation of shear flows was induced by a finite memory time of a stochastic forcing and was manifested by the emergence of a bimodal PDF, with the two peaks representing non-zero mean values of a shear flow (gradient). We presented a thorough study of PDFs for different correlation time and amplitude values for the stochastic forcing. By solving the Fokker–Planck equation numerically, we investigated the time evolution of PDFs starting with a narrow PDF at different peak positions (
) at time
. The cumulative change in information (
) beautifully maps out the underlying attractor structures. Specifically, for a unimodal PDF, the minimum value of
occurs for
, since
is a stable equilibrium for a unimodal PDF and thus, an initial PDF with a peak (
) closer to
undergoes less change during the time evolution and is more similar to the final PDF; for a bimodal PDF,
is minimised for
, where
is the peak position of a bimodal PDF. Recalling that
represents the mean shear gradient at
while
is a critical shear gradient,
implies that an initial narrow PDF with a super-critical shear gradient is, in fact, more similar to a final stationary state, while an initial narrow PDF with a mean critical shear gradient undergoes a complicated evolution through the interaction with fluctuations. This is likely to be due to the rapid relaxation of instability at the super-critical state, similar to what was observed in the forward process in the phase transition in [
27] (e.g., compare
Figure 6b and
Figure 7b). That is, a process triggered by instability involves a smaller change in information and thus, a larger change in entropy (as might be expected as a consequence of instability). This reflects a unique property of
which depends on a trajectory/history of a PDF evolution. In comparison, the relative entropy, which only measures the difference between the initial and final PDFs, does not provide any information on the dynamics between the initial and final times. In summary, we demonstrated the importance of studying the dynamics and the merit of the information length in understanding the dynamics and the evolution of PDFs in a toy model of self-organised shear flow. Further work will include the extension of this work to the analysis of our model without unified colored-noise approximation and to other turbulence models, in particular, to quantify the information change associated with intermittency and self-organisation.