1. Introduction
In many scientific problems, causal interactions of different processes or process components are clear from a mechanistic understanding of the system dynamics and reflected by model formulation. In other situations, the interaction structure of a large and/or complex system, e.g., the assembly dynamics of a chemical system [
1,
2], of an ecological [
3,
4] or social community [
5,
6] or of neuronal populations [
7,
8], may not be clear in advance. Then, the empirical reconstruction of causal interactions from multivariate time series can be attempted using a suitable concept of causality. Since the seminal works of Wiener [
9] and Granger [
10], this reconstruction is based on the rationale that a causal interaction can be inferred from an increased prediction error when excluding the causing process component from the prediction of the caused component. Operationalizing this idea leads to Granger causality (GC), a measure commonly used to compare causal interactions of different process components. The quantification of a predictive information flow is also behind the definition of transfer entropy [
11], which quantifies the predictive information transfer from the causing to the caused component. As shown by Barnett et al. [
12], Granger causality and transfer entropy are equivalent for Gaussian variables.
From the outlined definition, it is plausible to expect that a stronger coupling will enhance the predictive information flow. On the other hand, it can be expected that an increasing coupling strength will eventually synchronize the caused with the causing variable. The approach to the synchronization manifold can be monitored by the synchronization index (phase locking value) or, for a linear dynamics, by the cross-correlation coefficient. Once the dynamics is tied to the synchronization manifold, a reconstruction is no longer possible because in a synchronous mode of motion leader and follower cannot be told apart—early reports of this insight [
13,
14] were followed by repeated observations of the phenomenon in the context of various directionality indices and synchronization measures (e.g., [
15,
16,
17,
18,
19]), which underpins a generic phenomenon. Combining the synchronization triggered decline of directionality measures with the initial increase in the small coupling regime immediately suggests unimodal curve describing the relation between directionality and coupling strength, where any directionality index vanishes for zero (uncoupled) and infinite (tight coupling limit) coupling strength.
We note that, to the best of our knowledge, all reports of this phenomenon are solely based on numerical or empirical data and subsequent estimation of directionality indices. Moreover, directionality indices are mostly constructed as differences (asymmetries) between directed coupling indices, thus quantifying a two-way net directionality. Quite often estimators of the one-way directionality indices are plagued by a considerable bias and, in particular, non-zero values for uncoupled variables. Computing the net directionality index this bias may (or may not) be removed by forming differences. Therefore, an analytic treatment of a one-way directionality index vs. coupling strength relation for a tractable but generic dynamics seems desirable.
In the following, we address such a system dynamics that follows from a socio-dynamic model originally formulated in the framework of a master equation [
20]. In the thermodynamic limit, the description is shifted to the level of a Fokker–Planck equation [
21] with related drift and diffusion coefficients. The resulting expressions show that the coupling strength enters the diffusion matrix which means that random fluctuations scale up together with increasing coupling strength. Our analytic derivation leads to the interesting result that even in the tight coupling limit a residual directionality remains. Since the resultant dynamics can effectively be mapped to a vector-autoregressive process (linear stochastic process with Gaussian variables), the residual Granger causality can be interpreted as a residual predictive information flow remaining for tight coupling. Although we use a specific model at the outset of our presentation, we argue that the finding of a residual GC is a generic effect.
2. Granger Causality in a Nutshell
To set the stage, we briefly recall how GC is defined in the framework of a (two-dimensional) vector autoregressive process VAR[
∞] (of order infinity). In this case, two time series are fitted by the following full model
with white noise covariance matrix
In describing stable (non-divergent) time series by a VAR, the dynamical coefficients
and
will obey a stationarity condition [
22].
The prediction error of
, using both its own past
and the past of
, is quantified by
, the variance of
. Predicting
only from its own past means to use a reduced univariate autoregressive processes description for the second time series
In passing, we note that the parameters of this reduced model are not fitted independently but derived from Equation (
1) for example through a spectral factorization [
23]. Moreover, based on a state-space formulation of GC, a closed form involving partial covariances can be obtained for finite order vector-autoregressive models [
24].
Now, the prediction error of without inclusion of is quantified by , the variance of . Since in this step no information is added (but rather removed), will never be smaller than , guaranteeing that Granger causality GC (from ) defined as the quantity is non-negative.
VAR processes belong to the model class of time discrete linear stochastic processes. In a recent work, we extended the concept of GC to diffusion processes which constitute a special class of time continuous stochastic processes. Diffusion processes can be formulated either via a Langevin type stochastic differential equation system or via the related Fokker–Planck equation (FPE) [
25]. In the framework of a local linearization of the dynamics and a stroboscopic map (i.e., the temporal integration of the dynamic flow over a time increment
), this allows establishing a connection between drift and diffusion coefficients of the FPE and a local GC. Averaging the local GC with respect to the invariant measure yields a global GC for the original diffusion process. For details, we refer the reader to our recent publication [
26].
3. Weidlich’s Socio-Dynamic Model
Here, we apply this connection to a stochastic socio-dynamics model proposed by Weidlich [
20] to describe opinion formation in a system with two groups,
leaders (Group 1 comprising
members) and
followers (Group 2 with
members). Individuals of each group
can adopt one of two opinions
. Opinion diversity across the community is coded by variables
specifying the number of individuals of group
i sharing opinion
j. Since it is assumed that everybody has an opinion,
. Eventually, the system is considered in the “thermodynamic limit”
with
which suggests a change to quasi-continuous variables
Members of group
i change their opinion from
j to
with a transition rate
that may depend on the present opinion configuration
(with
). In line with Weidlich, we employ the following expressions
with a straightforward interpretation: without coupling (
) leaders and followers have a diametrical preference for one opinion over the alternative one; while leaders flip opinion without reference to the population of followers (
), with increasing coupling followers enhance the rate of switching their opinion to the opinion populated by leaders (
). This means that followers are uni-directionally forced by leaders with the coupling strength scaled by the parameter
.
These rates enter a master equation describing the evolution of a statistical ensemble of interacting social groups. After another change of variables
, reflecting directed imbalance and opinion diversity within group
i, the related Fokker–Planck equation can be written down [
20]. It is completely specified by its drift vector
and its diffusion matrix
Notice that the diffusion matrix is positive definite since . The inverse scaling with and , respectively, means that diffusion vanishes in the “thermodynamic limit” (with ). Moreover, since the parameter p (controlling the intrinsic opinion flip rate) scales both drift vector and diffusion matrix in the same way, it may well be absorbed in a reparameterization of time. Finally, we point out that in the step from the master equation to the FPE the coupling parameter does not only enter the drift vector but also the diffusion matrix.
4. Simulation Results
Being equipped with drift vector and diffusion matrix, we can compute the GC of the socio-dynamic diffusion process (as outlined in [
26]) for various values of the coupling parameter
. This means we compute GCs locally for subsampled vector-Ornstein–Uhlenbeck processes with local additive noise; global GC is then obtained as the statistical average of local GCs with respect to a stationary distribution estimated from the simulation (1,000,000 data points for each
value). Additionally, we compute the GC of a global vector-Ornstein–Uhlenbeck process with constant average-noise. The results are plotted with circle and crosses, respectively, in
Figure 1.
As expected, for
, the GC is zero because in this uncoupled situation inclusion of
cannot reduce the prediction error of
. With increasing coupling, parameter
GC initially increases up to a maximum around
before it decreases again. Surprisingly, we find that a linearized dynamics with average-noise (notice that the only non-linearity
resides in
) with state space averaged additive noise gives rise to the same values (plotted in
Figure 1 with crosses). Therefore, the non-monotonic characteristic of GC requires neither a non-linear dynamics nor multiplicative noise.
5. Analytic Description
To explain the non-monotonic behavior of the GC, we connect the diffusion process specified by Equations (
6) and (
7) with a VAR(1), i.e., vector-autoregressive process of first order. The general connection is elaborated in [
27] and proceeds via the VAR(1) written as
where
and
is a two-component vector of zero-mean Gaussian white noise residuals with a covariance matrix
given by the following matrix integral
where
is the Jacobian matrix of the diffusion process, i.e., of its drift part.
The above connection is derived for the standard vector-Ornstein-Uhlenbeck process with additive noise, which corresponds to a diffusion matrix
that is constant in state space. By contrast, in Weidlich’s model we see that the diffusion coefficients
and
(cf. Equation(
7)) vary with
and
. Since the gradients scale with the inverse of
and
, respectively, for rather large
, the noise is weakly multiplicative [
26]. Therefore, we may consider
as a local covariance matrix. A rigorous derivation would now strive to compute local GC maps and perform a subsequent average over state space (with respect to the stationary probability distribution, essentially a bivariate Gaussian). This route seems feasible but resultant expressions appear far from being handy. Therefore, we approximate state dependent diffusion coefficients by substituting mean values for state variables, i.e.,
The mean values can be read from Equation (
6)
with the consequence that we approximate
In line with these simplifications, we now consider the case of a VAR(1) with the following parameterization
where the parameters
are to be identified with the entries in Equation (
9), i.e.,
where we have used the abbreviation
.
The covariance matrix of residuals is obtained by inserting
and
of Equation (
13) into Equation (
10). The integration can be performed analytically and leads to the following expressions
With
being small (since
and
), this matrix has the eigenvalues
which shows that even in the tight coupling limit the covariance ellipsoid of residuals possesses two finite semi-axes.
Equation (
14) defines a VAR(1) that was recently considered by Barnett and Seth as a minimal model that allows deriving an analytic expression for GC that can be found in [
28] and was restricted to the special case that
was the identity matrix, i.e.,
and
were uncorrelated (
) and both with unit variance (
). Relaxing these restrictions and following the same steps outlined by Barnett and Seth we can derive an explicit formula for GC that is structurally identical to Barnett’s formula even for the most general, bivariate VAR(1) process in Equation (
8) with bidirectional coupling (
)
Through Equations (20) and (
21) in conjunction with Equation (
15), we have everything at hand to evaluate the dependence of GC on the coupling parameter
. Note that the GC does not depend on the parameter
a, and for a fixed covariance matrix is a monotonically increasing function of
c growing to infinity; indeed, in a simple autoregressive model, the parameter
c would be naturally considered (a monotonous function of) the coupling strength
. However, in our example, the dependence of the
on coupling parameter
is more complicated, as it also affects the covariance matrix. Moreover, the dependence is such that there is a delicate balance for infinite coupling strength
between the synchronizing influence of the driver and the diffusion term leading to the finite residual GC. The result is plotted in
Figure 1 (with solid line) and shows that the analytic formula perfectly matches the numerical values (plotted with circles and crosses) extracted from the simulation. The matching of our analytical results, based on Equation (
11), and global GC is surprising but fortunate. The coincidence of global GC and average-noise GC, on the other hand, is not surprising since, for weakly multiplicative noise, the latter is in line with our analytic approach.
The analytic formula allows considering the asymptotic limit
, which is given by inserting
into Barnett’s formula (Equation (20)). Interestingly,
and
again enter this expression only through the ratio
. For the chosen parameters, the resulting asymptotic value
GC is approximately 0.027, which is larger than zero. As the scaling with
indicates, the asymptotic residual GC is a consequence of the residual variance in the direction of the second semi-axes of the covariance ellipsoid in Equation (
18). Residual fluctuations remaining in the tight coupling limit enable an improvement of prediction.
These residual fluctuations reflect the fact that even for infinite coupling strength synchronization is not complete. To show this, we compute the cross-correlation function by solving the Yule–Walker equations [
22]. As can be seen in
Figure 2, the maxima of the curves rise with increasing coupling strength while the related lag approaches zero (in discrete steps) reflecting instantaneous response of the caused variable. However, our analytic treatment reveals that even in the tight coupling limit
the maximal cross-correlation is 0.995 and not one; the reason is, again, the balance between the synchronizing force (exponential relaxation) and the increasing noise strength
. We note that this value is not plagued by estimation errors.
The analytic approach also puts us in the position to sort out how the different system properties contribute to the formation of a maximum in the GC. In particular, we can investigate the effect of coupling dependent diffusivity through replacing
(cf. Equation (
13)) by the constant value
. Through these investigations, we arrive at the following conclusions:
6. Discussion
The increase of GC with is a natural consequence of enhanced information transfer with tighter coupling to the driver. The asymptotic decrease and the formation of a maximum, on the other hand, follow from the fact that the caused variable asymptotically adopts the residual variance of the driver which is two orders larger because , meaning that “every leader has one hundred followers”. This allows predicting that no such maximum can be found if or for , i.e., if the leader group is of comparable or even larger size than the follower group—a statement that we can confirm through our analytic analysis. It is interesting to note that the asymptotic residual GC, i.e., an improvement of prediction in the tight-coupling limit, remains even in the thermodynamic limit , i.e., when the diffusion constants approach zero, provided . This results from the scaling behavior of the variance .
The question whether the found residual GC is a property of the specific socio-dynamic model or a generic property of a larger class of systems can be answered in the following way: our results follow from a minimalistic linear system (vector-Ornstein–Uhlenbeck process or VAR(1)), which means it is not a fancy effect of non-linear dynamics. In this setup, the rGC results from the linear asymptotic scaling behavior of
(Equation (
13)), which balances the
in the denominator of the first term in Equation (
16) resulting from integrating the exponentials (in Equation (
10)) and reflecting the faster relaxation with increasing coupling strength. The question as to which other systems might give rise to a residual GC thus leads back to the linear scaling of the diffusion coefficient
. As the derivation in Weidlich’s original paper [
20] shows, this is a consequence of the ansatz of coupling groups via a coupling parameter that enters the flipping rates (Equation (
5)) additively and linearly. Therefore, we conclude that a residual GC will not be restricted to the specific details of Weidlich’s model but be a quite generic aspect of a stochastic dynamics described by a similar master equation.
The observed non-monotonic behavior of GC with coupling strength does not rely on non-linear dynamics or multiplicative noise. A peak of the information flow was pointed out and a finite value of the information flow, at large couplings, was also reported in [
29]. However, it is interesting to consider how our insights can be transferred to a general diffusion process. Our conclusions drawn from the minimalistic system (see Equation (
14) with Equation (
15)), i.e., a linear dynamics with additive noise, can be carried over to the class of nonlinear diffusion processes with weakly multiplicative noise in the framework of a local linearization generating a GC map in state space. Subsequent averaging with respect to the invariant measure on the attractor leads to the global GC of the system, a concept that we put forward in a recent publication [
26]. In this way, the global GC vs. coupling relation will be computable and the resultant curve will depend on the details of the considered system. The question whether a non-monotonic GC vs. coupling relation can result for a system with diffusion constants that do
not vary with coupling strength remains open and will be dealt with in a forthcoming publication.
Finally, one may ask whether the non-monotonic GC curve plays any role in natural systems. We speculate that coupling enhanced predictability might be useful for the performance of cognitive functions and that certain brain regions are tuned to optimal synaptic coupling. In this context, a scale up of effects through coupling large neuronal ensembles can also be imagined.