Next Article in Journal
Lagrangian submanifolds generated by the Maximum Entropy principle
Next Article in Special Issue
Deformed Generalization of the Semiclassical Entropy
Previous Article in Journal
Information Equation of State
Previous Article in Special Issue
Generalised Exponential Families and Associated Entropy Functions
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling Non-Equilibrium Dynamics of a Discrete Probability Distribution: General Rate Equation for Maximal Entropy Generation in a Maximum-Entropy Landscape with Time-Dependent Constraints

by
Gian Paolo Beretta
Università di Brescia, via Branze 38, Brescia, I-25123, Italy
Entropy 2008, 10(3), 160-182; https://doi.org/10.3390/entropy-e10030010
Submission received: 17 December 2007 / Revised: 30 July 2008 / Accepted: 30 July 2008 / Published: 14 August 2008

Abstract

:
A rate equation for a discrete probability distribution is discussed as a route to describe smooth relaxation towards the maximum entropy distribution compatible at all times with one or more linear constraints. The resulting dynamics follows the path of steepest en-tropy ascent compatible with the constraints. The rate equation is consistent with the Onsager theorem of reciprocity and the fluctuation-dissipation theorem. The mathematical formalism was originally developed to obtain a quantum theoretical unification of mechanics and ther-modinamics. It is presented here in a general, non-quantal formulation as a part of an effort to develop tools for the phenomenological treatment of non-equilibrium problems with appli-cations in engineering, biology, sociology, and economics. The rate equation is also extended to include the case of assigned time-dependences of the constraints and the entropy, such as for modeling non-equilibrium energy and entropy exchanges.

1. Foreword

The term entropy is used in many different contexts wherever the relevant state description is in terms of a probability distribution over some set of possible events which characterize the system descrip-tion. Depending on the context, such events may be microstates, or eigenstates, or configurations, or trajectories, or transitions, or mutations, and so on.
Mathematically, entropy is functional of the state probabilities which quantifies their spread according to some set of defining axioms discussed by several authors during this “Facets of Entropy” Conference. In this sense, the use of a common name for all the possible different state functionals that share such broad defining features, may have some unifying advantage from the conceptual point of view (in the same sense as the term “house” unifies in one concept an infinite variety of very different realizations), and may be useful to understand analogies and suggest inter-breeding developments between very dif-ferent fields of research that share similar probabilistic descriptions.
On the other hand, from a physics point of view, entropy is very precise, single property of mat-ter that can be measured in every laboratory by means of well defined, if not standard measurement procedures (the most general and rigorous axiomatic definition currently available, and valid also for non-equilibrium states, is found in Ref. [1], see also [2]). The first instance of the use of the term entropy for this property is in a 1865 paper by the German professor of mathematical physics, Rudolf Clausius [3]. Since then, the term remained restricted to the domain of physics and thermodynamics until in 1929 von Neumann [4] pioneered the reasoning that was subsequently followed by Shannon [5] and spread thereafter in various directions, many of which unrelated to physics considerations.
In fields outside physics and thermodynamics the characterization of probability distributions by means of some entropy functional has generally been beneficial and provided new insights. But the inflation of different meanings all assigned to the single word entropy has in turn originated confusion and misunderstandings about the physical (thermodynamic) entropy. The analogies between probabilis-tic mathematical theories in different fields is not sufficient to make inferences about interpretations of the meaning of the physical entropy. A physical theory [6] is not just a set of “formal links” between concepts (i.e., a mathematical theory) but also of “empirical links” between such concepts and the empir-ical world, the plane of perceptions, the empirical data, the measurement procedures. Hence, we should beware of the possible abuses and superficial or false conclusions that may arise from making inferences from incomplete analogies between theories.
Having said so, below I hope I will not add to the confusion by proposing some general mathematics which does originate from a physical theory, but (with the due caution about meanings and interpreta-tion) might find use in several other fields, where the maximum entropy formalism has already found an application and provides therefore an existing relevant background, or “landscape”. We are interested in “non-equilibrium probability distributions” and their time evolution towards the maximal entropy distri-bution. We develop a general family of dynamical equations with which we can describe for example a constrained, maximal entropy generation probability redistribution dynamics, with either “natural con-straints” such as the constants of the motion of the system, or externally assigned, (rate-controlled) time dependent constraints and/or entropy changes.

2. Introduction

The determination of a probability distribution of maximum entropy subject to a set of linear con-straints has applications in many areas of engineering, physics, chemistry, and information theory [7,8,9,10,11]. The maximum entropy distribution typically represents an equilibrium or a constrained-equilibrium state of the system under study.
This paper addresses a generalization of the maximum entropy problem to the nonequilibrium do-main, by discussing a general rate equation for the description of smooth constrained relaxation of an arbitrary non-equilibrium probability distribution towards the maximum entropy distribution. The main form of the nonlinear rate equation we propose for the probability distribution has the feature that it keeps the constraints constant at their initial values (or varies them in time according to an externally provided law) and increases the entropy until the probabilities converge to the maximum entropy distribution. The rate equation is also consistent with an Onsager reciprocity theorem and a fluctuation-dissipation theorem, both extended to the entire non-equilibrium domain.
Geometrically, every trajectory generated by the rate equation in state space has the property that it follows the path of steepest entropy ascent compatible with the constraints, which may have specified constant values (thus representing constants of the “motion”) or may have a specified time-dependence (as for example in rate-controlled constrained equilibrium modeling of chemical kinetics [10, 11] or other dynamical system modeling).
The formalism presented here has features of great generality, and is presented in several forms that are adaptable to different applications. It was originally “designed” by the author in 1981 [12] and subsequently developed [13,14,15,16,17,18] to obtain an equation of motion for a quantum theoretical unification of mechanics and thermodynamics [19, 20] whereby the most general known statement of second law of thermodynamics, due to Hatsopoulos and Keenan [21], emerges as a fundamental dynamical theorem about the stability of the equilibrium states of a microscopic dynamics which generalizes the Schrödinger equation. Recently, the original equation has been partially rediscovered in the same context [22]. The idea of steepest-entropy-ascent time evolution of a probability distribution has been recently rediscovered in [23] but with important differences we discuss in [18].
In the present paper, the formalism is abstracted from its original physics purpose. It is presented as a mathematical tool that may find phenomenological modeling applications in other fields. This abstracted version, originally developed by the author in 1986 [24, 25], has also been recently rediscovered in a maximum entropy formalism context [26].
Because the so-called maximum entropy formalism [7,8,9] is well known, and the constrained-equilib-rium method for the treatment of non-equilibrium in chemical engineering is easily understood [10, 11], we illustrate our rate equation is this framework, assuming for simplicity linear constraints. The gener-alization to nonlinear constraints has also been developed and is straightforward, but will be discussed elsewhere.

3. Maximum Entropy Landscape

The maximum entropy problem which sets our context (landscape, to use the terminology of nonlinear optimization) is that of seeking a probability distribution, namely, a probability vector p = p1,..., pi,..., whose entropy
S ( p ) = i p i ln p i
is maximal subject to given magnitudes 〈Ak〉 of one or more constraints
i p i A k i = A k    k = 0 , 1 , . . . , n
where Ak i is the magnitude of the k-th constraint in base state i. We will assume that the first constraint is the normalization condition, so that A0 i = 1 for each i and 〈A0〉 = 1.
The maximizing distribution p* can be written as
p i * = exp ( k = 1 n λ k A k i ) Q , with Q = i exp ( k = 1 n λ k A k i )
where the Lagrange multipliers λ1,. . . , λn are determined by the values 〈A1〉, . . . , 〈An〉 of the con-straints.
Given the above landscape, the extensions to the non-equilibrium domain that we wish to consider are the following.

4. Non-equilibrium Problem A: Constant Values of the Constraints

We seek a time-dependent probability distribution, namely, a vector function (one-parameter family) p (t) = {p1 (t), . . . , pi (t) , . . .}, whose entropy S (p (t)) is non-decreasing with time t,
d S ( p ) d t = d d t [ i p i ( t ) ln p i ( t ) ] 0
and such that the magnitudes 〈Ak〉 of the constraints are time-invariant, namely,
i p i ( t ) A k i = A k    k = 0 , 1 , . . . , n
for all times t. Alternatively, given an initial distribution p0 we seek a time-dependent distribution p (t) with p (0) = p0 such that at all times t
i p i ( t ) A k i = i p i ( 0 ) A k i    k = 0 , 1 , . . . , n

5. Non-equilibrium Problem B: Time-Dependent Values of the Constraints

A further generalization of the preceding non-equilibrium problem is one in which the magnitudes 〈Ak〉 of the constraints are assigned a definite time-dependence.
Given an initial distribution p0, we seek a time-dependent distribution p (t) with p (t) = p0 such that at all times t
d A k d t = d d t [ i p i ( t ) A k i ] = α k ( t , p ( t ) )    k = 0 , 1 , . . . , n
and
d S ( p ) d t = d d t [ i p i ( t ) ln p i ( t ) ] 0
where the rates αk (t, p (t)) are given functions of time and of the instantaneous probability distribution.
For example, Problem B can be applied in the context of the constrained-equilibrium method for chemical kinetics [10, 11]. According to this method, the chemical composition of a complex reacting system is assumed at all times to be that of a constrained-equilibrium state of maximum entropy sub-ject to the usual normalization, energy, and stoichiometry constraints each representing a class of rate-controlling reactions. The magnitudes of these additional constraints are continuously updated according to a kinetic model for the rates of the controlling reactions. Problem B represents a generalization of the constrained-equilibrium method where, instead of assuming instantaneous entropy maximization imme-diately after each update of the rate-controlling constraints, we assume a smooth approach to maximum entropy continuously compatible with the shifting magnitudes of the constraints.
First, we discuss a way to construct a differential equation for the probability distribution p, namely, an equation of the form
d p d t = D ( p )
whose solutions are solutions of Problem A.
Then, we discuss a way to construct a differential equation of the form
d p d t = R 1 s ( t , p ) + . . . + R n s ( t , p ) + d ( p )
whose solutions are solutions of Problem B.
These differential equations and their main properties are presented in terms of the notation introduced in the next two sections.

6. Reformulation in terms of square-root probabilities

Because the variables p1,. . . , pi,. . . represent probabilities, both non-equilibrium problems A and B just defined are subject to the additional set of inequality constraints
pi ≥ 0  or  pi (t) ≥ 0
It is therefore convenient to change variables to the set of square-root probabilities x = {x1,. . . xi,. . .} from which the probabilities may be computed according to the relations
p i = x i 2    i = 1 . . .
Were it only to cope with the nonnegativity constraints 11, we could choose as new variables any function of the probabilities whose inverse is even. The main reason for choosing x i = p i is geometrical and part of the steepest entropy ascent assumption, therefore we discuss it at the beginning of Section 8. We note here that for example in Ref. [23], where a steepest entropy ascent dynamics is derived without switching to the square root representation, the entropy gradient is not well-defined on the entire domain (it diverges whenever one of the probabilities is zero) and as a result the dynamics has unphysical effects and does not conserve positivity when solved backwards in time.
The normalization constraint may be written as
G 0 ( x ) = i x i 2 = x · x = x · I · x = 1
where I = diag(1) and the dot product has the obvious meaning (for example, f · g = f1 g1 + . . . + fi gi + . . .). Hence, in square-root probability space the set of possible probability distributions maps to the unit sphere.
In terms of the new variables, we rewrite Problem B as follows (Problem A is a particular case)
d G k ( x ( t ) ) d t = G ˙ k ( t , x ( t ) ) = α ( t , p ( t ) )
and
d F ( x ( t ) ) d t < 0
where our state vector is now
x = {x1, . . . , xi, . . .}
our constraints are
G k ( x ) = i x i 2 A k i = x · A k · x    k = 0 , 1 , . . . , n
where Ak = diag(Ak i) and the entropy functional is
F ( x ) = i x i 2 ln x i 2 = x · S · x
where S = diag( ln x i 2 ). Clearly F (x) coincides with the entropy functional S (p) in view of Equations 12.

7. Gradients of Entropy and Constraints with respect to the Square-Root Probabilities

Next, we define vectors representing “gradients” of the constraints Gk (x) and of the entropy func-tional F (x), with respect to the state vector x,
g k = G k x = { G k x 1 , . . . , G k x i , . . . } = { 2 x 1 A k 1 , . . . , 2 x i A k i , . . . } = 2 A k · x k = 0 , 1 , . . . , n
f = F x = { F x 1 , . . . , F x i , . . . } = { 2 x 1 2 x 1 ln x 1 2 , . . . , 2 x i 2 x i ln x i 2 , . . . } = 2 ( S I ) · x
so that the rates of change of the functionals G0, G1,. . . , Gn, and F induced by the time dependence x(t) of the state vector, are given by
d G k d t = g k · x ˙
d F d t = f · x ˙
where x ˙ = dx(t)/dt = {dx1(t)/dt, . . . , dxi(t)/dt,. . .}.

8. Time Evolution Along the Path of Steepest Entropy Ascent

In order to introduce the geometrical notion of steepest entropy ascent, we need to define what we mean by “distance” between two probability distributions, and by “length” of a curve in probability space defined by a trajectory, a time evolution. The proper unique natural metric for this purpose is known in statistics as the Fisher-Rao metric (see e.g. [27,28,29]). For a one-parameter family of discrete distributions, p(t), where t is the parameter, the distance between distributions p(t + dt) and p(t) is
d = 1 2 i p i ( d ln p i d t ) 2 d t = 1 2 i 1 p i ( d p i d t ) 2 d t = i ( d p i d t ) 2 d t = 2 i ( x ˙ i ) 2 d t = x ˙ · x ˙ d t
Thus, square-root probabilities xi = p i are the most natural variables in that:
  • the space becomes the unit sphere, x · x = 1 ∑i pi = 1);
  • the Fisher-Rao metric simplifies to d = x ˙ · x ˙ d t , or equivalently dℓ2 = dx · dx;
  • the distance between any two distributions is the angle d(x1,x2) = cos−1(x1 · x2).
We now focus on Problem A, and write x ˙ = x ˙ L + x ˙ L where L = L(g0, g1, . . . , gn) is the linear manifold spanned by the gradient vectors associated with the constraints, dGk/dt = 0, see Eq. 21. In order to satisfy all the constraints, it is clear that x ˙ must be orthogonal to L, i.e., x ˙ = x ˙ L . Noting that also f may be written as f = fL + fL, we may therefore rewrite Eq. 22 as
d F d t = f L · x ˙ L
It is now apparent that the maximal increase of F obtains when x ˙ L is in the same direction as fL.
Hence, the rate equation that we propose is expressed by the differential equation
x ˙ = 1 τ ( p ) f L ( g 0 , g 1 , . . . , g n )
where τ (p) is some strictly positive functional of the probability distribution p (possibly a constant) with dimensions of time, vectors g0, g1, . . . , gn, and f are defined by Relations 19 and 20, and the vector
fL(g0, g1, . . . , gn)
is the orthogonal complement of the projection of f onto the linear span of vectors g0, g1, . . . , gn, for which the Appendix gives explicit expressions using well-known but seldom encountered linear algebra notations.
The notation and more explicit relations are reviewed in the Appendix. Using Relations 12 and 91, and some procedure to eliminate from the set g0, g1, . . . , gn those vectors (if any) that are linearly dependent on the others, we may readily verify that Equation 25 induces a differential equation for the probabilities p which contains only the square x i 2 of the new variables and, therefore, is of the form of Equation 9.
By virtue of Relations 21 and 92, we verify that Equation 25 maintains the magnitude of each con-straint invariant, i.e.,
d G k d t = g k · x ˙ = 1 τ g k · f L ( g 0 , g 1 , . . . , g n ) = 0
whereas, by virtue of Relations 22 and 94 we verify that the value of the entropy functional is non-decreasing under Equation 25, i.e., (later we obtain other important expressions, e.g., Eq. 54)
d S d t = d F d t = f · x ˙ = 1 τ f · f L ( g 0 , g 1 , . . . , g n ) = 1 τ f L ( g 0 , g 1 , . . . , g n ) · f L ( g 0 , g 1 , . . . , g n ) = τ x ˙ · x ˙ 0
where the equal sign (no entropy generation) applies if and only if vector f is in L (g0, g1, . . . , gn) and, therefore, is a linear combination of vectors g0, g1, . . . , gn, namely, there is a set of multipliers λ0, λ1, . . . , λn such that
f = k = 0 n λ k g k
Using Relations 19 and 20, Condition 29 becomes
x i x i ln x i 2 = k = 0 n λ k x i A k i    i = 1 , 2 , . . .
which, by multiplying it by xi and using pi = x i 2 , becomes
p i ln p i = p i ( 1 + k = 0 n λ k A k i )    i = 1 , 2 , . . .
To be satisfied for each i this requires either pi = 0 or ln pi = − (1 + k = 0 n λ k A ki ) which we may express in compact form by
p i δ i exp ( k = 1 n λ k A k i ) 𝒬 , with 𝒬 = i δ i exp ( k = 1 n λ k A k i )
with where δi is a boolean variable that may be only either 0 or 1, and we evaluated λ0 by the normalization condition.
It is noteworthy that Equation 25 cannot alter the value of a pi (or xi) which is initially zero, namely, an initially zero probability remains zero at all times, a feature that guarantees that each probability pi, if positive at some instance must be positive at all times, for the dynamics prohibits its ever crossing zero. This feature also guarantees that the solutions are well defined not only in forward time but backwards in time as well. From the present state, we can solve for future states but we can also reconstruct the past. Notice that this mathematical reversibility (existence of inverse of the dynamical map) has nothing to do with the “physical” irreversibility whereby the entropy increases in forward time and decreases in backward time.
From Relations 28 and 31 it follows that the effect of Equation 25 is to smoothly rearrange – without violating any constraint – the nonzero probabilities in the distribution, towards higher entropy distribu-tions until a partially canonical distribution defined by Equation 32 is approached, in which the initially zero probabilities are still equal to zero whereas the initially nonzero probabilities are distributed canon-ically.
Clearly, such partially canonical equilibrium distribution maximizes the entropy functional when re-stricted to the subset of distributions that have equal to zero all the initially zero probabilities. Such distributions are equilibrium, i.e., time invariant, but are unstable whenever there is some probability pi equal to zero. We can see this by noting that a minor perturbation of the distribution which changes a zero probability to an arbitrarily small nonzero value, would proceed away towards a quite different equilibrium of higher entropy. If initially all the probabilities in the distribution have nonzero values, then Equation 25 takes it directly towards the unique stable equilibrium distribution compatible with the initial values of the constraints, which is the maximum entropy distribution given by Equation 3.
Geometrically, we can visualize the effect of Equation 25 as follows. Consider the surface defined by Gk (x) = 〈Ak〉 for k = 0, 1, . . . , n where 〈Ak〉 are the magnitudes of the constraints fixed by the initial distribution. On this surface we can identify contour curves of constant entropy, generated by intersecting it with the constant-entropy surfaces F (x) = S where S varies from 0 to the maximum value compatible with the given magnitudes of the constraints. Every trajectory x (t) generated by Equation 25 lies on the constant-Gk surface and is at each point orthogonal to the constant entropy contour passing through that point. In this sense, the trajectory follows a path of steepest entropy ascent compatible with the constraints.
We finally notice that, using Relations 28, Equation 25 may also be rewritten as
x ˙ = S ˙ f L ( g 0 , g 1 , . . . , g n ) f L ( g 0 , g 1 , . . . , g n ) · f L ( g 0 , g 1 , . . . , g n )
This form may be useful in variants of Problem A whereby one needs to specify explicitly the rate of entropy change S ˙ along the steepest entropy ascent (or descent, if S ˙ is for some reason specified negative and hence the inequality in Eq. 4 reversed) together with the condition that the probability distribution must maintain invariant all the constants of the motion Gk’s. Notice, however, that this form is undefined at any stable maximum entropy equilibrium state except if S ˙ is a function of and goes to zero faster than fL(g0, g1, . . . , gn) · fL(g0, g1, . . . , gn).
Moreover, going back to the Fisher-Rao metric, Relations 28 allow us to write the following equivalent expressions
d = x ˙ · x ˙ d t = 1 τ d S d t d t = 1 τ d S d d t
from which the relaxation time τ may be interpreted as the characteristic time of the rate dℓ/dt at which the state vector follows the steepest entropy ascent trajectory, modulated by the rate dS/dℓ at which the entropy functional increases along the trajectory,
1 τ = d / d t d S / d
or, alternatively, we see that when time is measured in units of the ’local’ (or instantaneous) value of τ , the ’speed’ along the steepest entropy ascent trajectory coincides with the ’local’ (or instantaneous) rate of entropy increase along the trajectory,
d d ( t / τ ) = d S d
which justifies the interpretation of τ as an intrinsic dynamical time.
In the next section we discuss two further properties of Equation 25 related to Onsanger’s reciprocity and the fluctuation-dissipation theorem. As already mentioned, the mathematical structure of Equation 25 was originally developed by the author within the context of a unified theory of mechanics and thermodynamics that we call quantum thermodynamics [3, 13, 14, 16,17,18]. A useful application in that framework could be to compute transport properties in the far non-equilibrium domain.

9. Onsanger Reciprocity and Fluctuation-Dissipation Relations

An indirect way to specify a probability distribution p is to specify the mean values of a sufficient number of independent linear functionals of the distribution such as
i p i A k i = A k    k = 0 , 1 , . . . , n , . . .
where the first n + 1 functionals coincide with the constraints, but the set is now extended to as many functionals as needed to completely specify the distribution p. If the functionals are all linearly in-dependent, then we need as many as there are probabilities in the distribution (minus one because of normalization). We call such a set of functionals a complete set of independent properties of the prob-ability distribution. Let us denote by Y0, Y1,. . . , Yk, . . . one such complete set of property functionals of the variables x i 2 , namely,
Y k ( x ) = i x i 2 A k i    k = 0 , 1 , . . .
such that if the values of all these functionals are given, then the values of all the x i 2 are determined.
For simplicity, let us further assume that functional Y0 is the normalization constraint, i.e., A0 i = 1 for each i. Next we define the gradient vector of each functional Yk as
yk = {∂Yk/∂x1,. . . , ∂Yk/∂xi,. . .} = {2x1Ak 1,. . . , 2xiAk i,. . .}  k = 0, 1, . . .
Note that in terms of the gradient vectors, y0 = 2x and the “mean” functionals Yk may be written as
Y k = 1 4 y 0 · y k     Y 0 = 1 4 y 0 · y 0 = 1
and we may define the “deviation” vectors
y k = 1 2 ( y k y 0 Y k )
so that clearly ∆yk · y0 = 0. By noting that k p ˙ k A ik A jk = y ˙ i · y j = y i · y ˙ j , it is also easy to check the following interesting relations
y ˙ i · y j = y i · y ˙ j
which may be interpreted as a kind of “kinematic reversibility” inherent in our gradient formalism, and holds independently of the steepest ascent dynamical assumption.
In terms of functionals Yk we may also form the following useful nonlinear functionals
Y k Y m ( x ) = y k · y m = i x i 2 A k i A m i i x i 2 A k i j x j 2 A mj = 1 4 y k · y m ( 1 4 y 0 · y k ) ( 1 4 y 0 · y m )
which represent the “covariance” or “codispersion” of properties Yk and Ym. In particular, the functional ∆YkYk represents the variance or dispersion (also, fluctuation) of property Yk. In the notation of the Appendix,
Y k Y m ( x ) = y k · y m = 1 16 | y k · y m y k · y 0 y 0 · y m y 0 · y 0 |
and
Y k Y k ( x ) = y k · y k = 1 16 det M ( y k , y 0 )
We now consider the entropy functional
F ( x ) = i x i 2 ln x i 2
and its gradient vector
f = { F / x 1 , . . . , F / x I , . . . } = { 2 x 1 2 x 1 ln x i 2 , . . . , 2 x i 2 x i ln x i 2 , . . .
and further assume that, when evaluated at a given distribution x, the property functionals Yk in the complete set have gradient vectors yk that are all linearly independent and span the entire set of vectors that have zero entries corresponding to the zero xi’s, so that there is a unique set of scalars λ0, λ1,. . . such that the vector f can be written as
f = k λ k y k
where the scalars λ0, λ1, . . . are determined by the set of equations
k λ k A k i = 1 ln x i 2 , with the index   i   restricted to the set for which   x i 0 .
The entropy functional F may also be written as
F ( x ) = i x i 2 ln x i 2 = 1 + 1 4 y 0 · f = 1 + 1 4 k λ k y 0 · y k = 1 + k λ k Y k
where we used Equations 40 and 48.
We see from Relation 50 that the scalar λk can be interpreted as an affinity or generalized “force” representing the marginal impact of property Yk onto the value of the entropy about a given distribution x, i.e., λk = ∂F/∂ Yk.
Also here we may define the “deviation” vector
f = 1 2 ( f y 0 ( F 1 ) )
so that clearly ∆f · y0 = 0.
In the notation of the Appendix, we may write the codispersion of entropy F and property Ym as
F Y m ( x ) = f · y m = 1 16 | f · y m f · y 0 y 0 · y m y 0 · y 0 |
and the entropy fluctuations as
F F ( x ) = f · f = 1 16 det M ( f , y 0 )
In terms of this notation, when vectors g0, g1, . . . , gn are linearly independent and g0 = 2x is the gradient associated with the normalization constraint, then the rate of entropy generation (Eq. 28) may be written as a ratio of Gram determinants as follows (see Relations 97 and 98 in the Appendix)
d S d t = 1 τ det M ( f , g 1 , . . . , g n ) det M ( g 1 , . . . , g n ) 1 τ f · f
This last inequality may be interpreted as a “time-entropy” uncertainty relation [30] in the following sense
τ S = f · f   τ
where the characteristic time of entropy change is defined by the relation
1 τ S = d S / d t f · f
We are now ready to consider the time dependence of the properties Yk and the entropy F as induced by the Equation 25 for the probability distribution, i.e., by the rate equation
x ˙ = τ 1 f L ( g 0 , g 1 , . . . , g n ) = τ 1 m λ m y m L ( g 0 , g 1 , . . . , g n )
where we made use of Equation 48 for f.
The rates of change of properties Yk are then given by
Y ˙ k = y k · x ˙ = y ˙ k · x = m L k m λ m
The rate of entropy production by
S ˙ = F ˙ = τ x ˙ · x ˙ = τ 1 k m λ k L k m λ m
where we define the functionals
Lk m = τ−1 yk · ymL(g0,g1,..., gn)
Relation 58 shows that the rates of change (or generalized “fluxes”) Y ˙ k and the affinities (or generalized “forces”) λm are linearly interrelated by the coefficients (or generalized “conductivities”) Lk m. The linear dependence of each rate from every affinity, is a fundamental physical assumption in the so-called “linear” theory of irreversible processes, a well-known theory [31,32,33,34,35,36,37,38,39] that deeply entangles physics and engineering by regularizing and unifying a wealth of experimental evidence of “direct” and “cross” effects. Direct effects (like Fick’s law of mass diffusion; Fourier’s law of heat conduction; Navier’s law of momentum diffusion; Ohm’s law of charge diffusion; and some aspects of chemical kinetics) arise from the linear relation between the rate of change (or the local flux vector) of one property and its conjugated affinity (or the spatial gradient of its conjugated driving force), such as between the heat flux vector and the temperature gradient in Fourier’s law. Cross effects (like Dufour’s, Soret’s and other thermodiffusion effects; Knudsen’s and other thermo-osmotic effects; electro-osmotic and streaming electro-kinetic effects; Seebeck’s, Peltier’s and other thermoelectric effects) arise from the linear relation between the rate of change (or the local flux vector) of one property and the conjugated affinity (or the spatial gradient of the conjugated driving force) of another property, such as between the heat flux vector and the species concentration gradient in Dufour’s thermodiffusion law.
Using Equation 95 (Appendix) with a = yk and b = ym we obtain
Lk m = τ−1 ykL(g0,g1,..., gn) · ymL(g0,g1,..., gn)
which shows that the conductivities Lk m satisfy the reciprocity relations
Lk m = Lm k
Using Equation 91 (Appendix) we also find the following explicit expression for the generalized con-ductivities Lk m,
L k m = 1 τ | y k · y m y k · h 1 y k · h r y m · h 1 h 1 · h 1 h r · h 1 y m · h r h 1 · h r h r · h r | | h 1 · h 1 h r · h 1 h 1 · h r h r · h r |
where vectors h1,. . . , hr may be any linearly independent set spanning the manifold L (g0, g1, . . . , gn).
Moreover, it follows from Relation 60 that the matrix of generalized conductivities
[ L ] = [ L 0 0 L 0 1 L 0 m L 1 0 L 1 1 L 1 m L k 0 L k 1 L k m ]
is a Gram matrix and as such it is nonnegative definite.
Notice that the above proof of Onsager reciprocity requires the steepest entropy ascent assumption. The “kinematic reversibility” feature expressed by Eqs. 42, which we noted is intrinsic in our gradient formalism, is in the background, but it is not sufficient to entail Eqs. 62, nor does it appear to play any role in our proof. On the other hand, other than the steepest entropy ascent assumption, the derivation of Eqs. 62 required neither any additional dynamical assumptions, such as other forms of “microscopic reversibility”, nor any ad hoc approximations, such as linearizations that would limit their validity only to the near equilibrium regime. Therefore, our result holds in the entire domain, i.e., even in the arbitrarily far nonequilibrium regime.
The literature on Onsager reciprocity for physical systems originates from the work of Onsager [31], Casimir [32], Callen [33], Kubo [34], and others [35,36,37,38,39]. Onsager’s reciprocal relations are important universal features of irreversible processes. In different ways, the cited pioneers have shown that the reciprocal relations can only be derived by complementing the known (reversible) dynamical principles with some reasonable additional principle (e.g., the so-called ”principle of microscopic reversibility”), assumption (e.g., that of sufficiently small perturbations, to ensure a linear behavior of the system), or approximation (e.g., that of sufficiently small deviations from thermodynamic equilibrium; see e.g. [40] for a derivation in the near-chemical-equilibrium chemical kinetics domain). Levine [7] has also shown that the reciprocal relations can be derived and extended to a higher nonequilibrium domain by adopting the assumption (or approximation) that systems proceed along a path continuously maximizing the entropy functional subject to some (possibly unknown) set of linear time-dependent constraints. This approach is closely related to the cited “rate controlled constrained equilibrium method”.
Matrix [L] is positive definite only if the vectors ykL(g0,g1,..., gn) are all linearly independent, in which case the set of Equations 58 may be solved to yield
λ m = k ( [ L ] 1 ) m k Y ˙ k
Therefore, in this case ([L] positive definite), the rate of entropy generation (Equation 28) may be rewrit-ten in the following several forms
S ˙ = F ˙ = k λ m y k · x ˙ = k λ k Y ˙ k = τ x ˙ · x ˙ = k m λ k L k m λ m = k m Y ˙ m ( [ L ] 1 ) m k Y ˙ k
In other words, the rate of entropy generation is a quadratic form in the affinities or generalized forces, it is also a quadratic form in the rates of change of the complete set of base observables, and it is the sum of the products of each such rate of change and its conjugated affinity. We emphasize again that each element of the generalized conductivity matrix L and of its inverse, the generalized resistance matrix L−1, are nonlinear functional of the probability distribution only, not of the square-root probabilities, which therefore have played only an auxiliary role in the derivation.
Finally, comparing Relation 43, for the codispersion ∆YkYm of properties Yk and Ym, and Rela-tion 63, for the generalized conductivities Lk m, we see that there is a relation between Lk m, ∆YkYm and all the codispersions of the constraints, and properties Yk and Ym. We may greatly simplify these relations if, for a given distribution x, we further restrict the choice of the complete set of linearly in-dependent property functionals Yk so that y0 = h1, y1 = h2,. . . , yr−1 = hr where h1,. . . , hr are linearly independent vectors spanning the manifold L (g0, g1, . . . , gn) generated by the constraints and, moreover, we select the functionals Yk for k > r so that the codispersions ∆YkY0, ∆YkY1, . . . , ∆YkYr−1 (Equation 43) are all equal to zero. For this particular choice, by studying Relation 63 for the generalized conductivities, we may readily verify that
Lk m = 0    whenever k < r or m < r
Lk m = 4∆YkYm/τ    for k ≥ r and m ≥ r
and, therefore, we find a direct relation between the covariance and the generalized cross conductivity of the pair of properties Yk and Ym. In particular, for k = m > r we find Lk k = 4∆YkYk/τ which is a relation between the variance (or fluctuation) and the direct conductivity (or dissipation) of property Yk.

10. Time-Dependent Constraints

In terms of the notation already introduced, let us now consider the differential equation
x ˙ = G ˙ k g k L ( g 0 , g 1 , . . . , g k 1 , g k + 1 , . . . , g n ) g k L ( g 0 , g 1 , . . . , g k 1 , g k + 1 , . . . , g n ) · g k L ( g 0 , g 1 , . . . , g k 1 , g k + 1 , . . . , g n )
where G ˙ k = αk (t, p (t)) are given functions of time and of the instantaneous probability distribution. Clearly, the rates of change cannot be completely arbitrary, as Equation 69 is undefined unless G ˙ k is a function of and goes to zero faster than gkL(g0, g1,..., gk−1, gk+1,..., gn) · gkL(g0, g1,..., gk−1, gk+1,..., gn). Once the rate G ˙ k is specified, the probability distribution changes along the steepest Gk ascent (or descent, if G ˙ k is negative) compatible with maintaining invariant the other constants of the motion.
Again, we also readily verify that Equation 69 induces an equation for p ˙ square x i 2 of the new variables and, therefore, is of the form which contains only the
p ˙ = R k ( t , p )
We may readily verify that Equation 69 induces an evolution of the probability distribution p along which the magnitudes of all the constraints except the k-th are time-invariant, whereas the magnitude of the k-th constraint varies with a rate of change equal to G ˙ k (t, p (t)).
Geometrically, we could visualize the effect of Equation 69 as follows. We consider the surface defined by Gm (x) = 〈Am〉 for m = 0, 1, . . . , k − 1, k + 1, . . . , n where 〈Am〉 are the magnitudes of the constraints (except the k-th) fixed by the initial distribution. On this surface, we can identify contour lines along which the k-th constraint is constant, generated by intersecting the surface just defined with the surface Gk (x) = 〈Ak〉 where 〈Ak〉 varies over a feasible range of values. Every trajectory p (t) generated by Equation 69 lies on the surface of the fixed constraints and is at each point orthogonal to the constant-Gk contour line passing through that point. In this sense, the trajectory follows a path along the gradient of Gk compatible with all other constraints. In this sense, Equation 69 determines the minimal change in x that is necessary in order to change the k-th constraint at the specified rate G ˙ k .
With this equation, however, the rate of entropy change does not have a definite sign. So, we finally note that by substituting L (g0, g1,..., gk−1, gk+1,..., gn) in Equation 69 with
L (f, g0, g1,..., gk−1, gk+1,..., gn)
we obtain a rate equation causing the k-th constraints to follow the specified rate G ˙ k while maintaining a zero change for the other constraints, as well as a zero rate of change of the entropy. In other words, this would describe an isoentropic change of the magnitude of the k-th constraints. We therefore denote by
p ˙ = R k s ( t , p )
the differential equation for the probabilities generated by the following differential equation for the square-root probabilities
x ˙ = G ˙ k g k L ( f , g 0 , g 1 , . . . , g k 1 , g k + 1 , . . . , g n ) g k L ( f , g 0 , g 1 , . . . , g k 1 , g k + 1 , . . . , g n ) · g k L ( f , g 0 , g 1 , . . . , g k 1 , g k + 1 , . . . , g n )
This form obtains the specified rate G ˙ k while the probability distribution changes along the steepest Gk ascent (or descent, if G ˙ k is negative) compatible with maintaining invariant all other constants of the motion and the entropy.
Clearly, if we want two or more constraints to have a specified rate of change, then Equation 10,
d p d t = R 1 s ( t , p ) + . . . + R n s ( t , p ) + D ( p )
yields many orthogonal contributions to x ˙ . The terms R 1 s , . . . , R n s (each with structure similar to that given by Equation 69) cause the shifting constraints to follow the specified rates of change G ˙ 1 , . . . , G ˙ n . The term D (as given by Equation 25) gives instead a non-negative definite contribution to the rate of entropy change and tends to attract the distribution x towards a path of steepest entropy ascent compatible with the instantaneous values of the constraints.

11. Two Thermodynamic Examples

A less general but more clear illustration of the above formalism obtains when we assume that in addition to the normalization constraint G0 there is only one constraint G1, the energy.

11.1. Modeling the Extraction of the Adiabatic Availability of a System in a Non-Equilibrium State

For example, suppose the task is to extract the adiabatic availability of the system (see Ref. [1] for the precise definition of this fundamental thermodynamic concept; briefly: the maximum amount of energy that can be extracted from the system in the form of mechanical work without leaving any other net effects in its environment). Then the rate equation
x ˙ = G ˙ 1 g 1 L ( f , g 0 ) g 1 L ( f , g 0 ) · g 1 L ( f , g 0 )
tells us, for a given non-equilibrium distribution x of square-root probabilities (in general representing some non-equilibrium state), the rate at which we must change each probability in the distribution in order to reduce the energy 〈G1〉 of the system at a given rate G ˙ 1 while maintaining the entropy constant. Of course, once we reach the minimum energy compatible with the initial entropy, the state is a canoni-cally distributed stable equilibrium state (i.e., we reached a maximum entropy distribution for the current energy) and g1⊥L(f, g0) = 0, no more energy can be extracted isoentropically.
Recall that the explicit expression of g1⊥L(f, g0) can be easily obtained using the notation developed in the Appendix.

11.2. Modeling a Non-Work Interaction between a System in a Non-Equilibrium State and a Thermal Reservoir at Temperature TR

Also pertaining to thermodynamics is the modeling of a non-work interaction (again see Ref. [1] for rigorous definitions), i.e., an interaction where in addition to energy exchange between the interacting systems, there is also entropy exchange. Suppose further that the exchange occurs between our system in a non-equilibrium state described by an arbitrary probability distribution, and a thermal reservoir at temperature TR. Then, the ratio of the energy to the entropy exchanged is equal to TR. So, the energy and entropy balance equations in this case are (recall that in our notation here, energy is G1)
G ˙ 1 = G ˙ 1
and
S ˙ = G ˙ 1 T R + S ˙ gen
where by G ˙ 1 and G ˙ 1 / T R we denote the rates at which respectively energy and entropy are exchanged via the heat interaction (assumed positive if into the system, negative if out of the system).
Then the rate equation
x ˙ = G ˙ 1 g 1 L ( f , g 0 ) g 1 L ( f , g 0 ) · g 1 L ( f , g 0 ) + ( G ˙ 1 T R + S ˙ gen ) f L ( g 0 , g 1 ) f L ( g 0 , g 1 ) · f L ( g 0 , g 1 )
tells us, for any given non-equilibrium distribution x of square-root probabilities (in general representing some non-equilibrium state), the rate at which we must change each probability in the distribution in or-der to obtain the given energy and entropy exchanges, as well as the given amount of entropy generation by irreversibility. We can even set S ˙ gen = 0 and see therefore what the changes would be if the non-work interaction occurs for example much faster than the typical time scale of entropy generation.
Finally, because form 78 may be somewhat misleading if one forgets the conditions mentioned above on the dependences of G ˙ 1 and S ˙ near a canonical distribution, we prefer to rewrite it in terms of charac-teristic times of energy exchange
G ˙ 1 = sign ( G ˙ 1 ) τ G ˙ 1 g 1 L ( f , g 0 ) · g 1 L ( f , g 0 )
and of entropy generation by irreversibility
S ˙ gen = 1 τ S ˙ gen f L ( g 0 , g 1 ) · f L ( g 0 , g 1 )
so that the rate equation takes the form
x ˙ = sign ( G ˙ 1 ) τ G ˙ 1 ( g 1 L ( f , g 0 ) + 1 T R g 1 L ( f , g 0 ) · g 1 L ( f , g 0 ) f L ( g 0 , g 1 ) · f L ( g 0 , g 1 ) f L ( g 0 , g 1 ) ) + 1 τ S ˙ gen f L ( g 0 , g 1 )
from which we see that the characteristic time of entropy exchange is related to that of energy exchange and to the temperature of the thermal reservoir as follows
τ S ˙ = f L ( g 0 , g 1 ) · f L ( g 0 , g 1 ) g 1 L ( f , g 0 ) · g 1 L ( f , g 0 ) T R τ G ˙ 1
or, equivalently, we may see that the following state functional of the system has to match the temperature of the reservoir
g 1 L ( f , g 0 ) · g 1 L ( f , g 0 ) f L ( g 0 , g 1 ) · f L ( g 0 , g 1 ) τ S ˙ τ G ˙ 1
and, at least in this restricted sense, we could enforce an interpretation of this state functional as a “non-equilibrium temperature” of the system. The question of how and whether the concept of temperature should be extended to the nonequilibrium domain is currently a frontier topic in physics and engineering (see, e.g., [41]).
Notice that, using the explicit forms developed in the Appendix and in Section 9, we may rewrite the above expressions in terms of the ratio of energy to entropy fluctuations, by virtue the identity
g 1 L ( f , g 0 ) · g 1 L ( f , g 0 ) f L ( g 0 , g 1 ) · f L ( g 0 , g 1 ) = G 1 G 1 F F

12. Conclusions

The structure of the family of rate equations we discuss in this paper represents a flexible framework in which to cast non-equilibrium time-dependent problems where it is necessary to describe a smooth constrained approach to a maximum entropy distribution with or without varying magnitudes of the constraints.
It is noteworthy that the time evolution generated by our rate Equation 25 is more general than any equation assuming that the probability distribution always maximizes the entropy functional subject to some “possibly unknown” set of constraints [7, 8, 28].
Generalizations of the present formalism to non-extensive entropy functionals or other convex func-tionals of the probability distribution, as well as to nonlinear constraints, have been developed and will be presented elsewhere.

Acknowledgements

I am personally deeply indebted to Dr. George N. Hatsopoulos for his support and his constant enthusiasm and inspiration in my continuous attempts to understand the fundamental physical roots of thermodynamics, and to develop methods for non-equilibrium thermodynamic analysis. Likewise, I am grateful to Prof. Elias P. Gyftopoulos for sharing with me his enthusiasm, passion, sometimes frustration, and certainly very long hours of hard joint work behind each dense page of Ref. [1]. Finally, I thank the MIT Mechanical Engineering Department for hosting me during the 2007 Fall term when this paper was prepared, and the Università di Brescia for supporting this visit.

Appendix. Orthogonal Decomposition of a Vector with respect to a Linear Manifold

Given a set of vectors g0, g1, . . . , gn, the symbol
L (g0, g1, . . . , gn)
will denote their linear span, i.e., the linear manifold containing all the vectors that are (real) linear combinations of g0, g1, . . . , gn. Given another vector b, the symbol
bL(g0, g1,..., gn)
will denote the orthogonal projection of b onto the linear manifold L (g0, g1, . . . , gn), namely, the unique vector in L (g0, g1, . . . , gn) such that its dot product with any other vector g in L (g0, g1, . . . , gn) equals the dot product of b with g, i.e.,
g · bL(g0, g1, . . . , gn) = g · b
for every g in L (g0, g1, . . . , gn).
In terms of a set of linearly independent vectors h1,. . . , hr spanning the manifold L (g0, g1, . . . , gn), where clearly rn, we can write two equivalent explicit expressions for the projection (b)L(g0,g1,..., gn) of vector b onto L (g0, g1, . . . , gn). The first is
b L ( g 0 , g 1 , . . . , g n ) = k = 1 r m = 1 r ( b · h k ) [ M ( h 1 , . . . , h r ) 1 ] k m h m
where M(h1,. . . , hr)−1 is the inverse of the Gram matrix
M ( h 1 , . . . , h r ) = [ h 1 · h 1 h r · h 1 h 1 · h r h r · h r ]
The second expression is a ratio of two determinants
b L ( g 0 , g 1 , . . . , g n ) = | 0 h 1 h r b · h 1 h 1 · h 1 h r · h 1 b · h r h 1 · h r h r · h r | | h 1 · h 1 h r · h 1 h 1 · h r h r · h r |
where the determinant at the denominator, also given by det M (h1,. . . , hr ), is always strictly positive because the vectors h1,. . . , hr are linearly independent.
In the paper, our rate equations are expressed in terms of vectors of the form
b L ( g 0 , g 1 , . . . , g n ) = b b L ( g 0 , g 1 , . . . , g n ) | b h 1 h r b · h 1 h 1 · h 1 h r · h 1 b · h r h 1 · h r h r · h r | | h 1 · h 1 h r · h 1 h 1 · h r h r · h r |
where in writing Equation 91 we make use of Equation 90. The vector bL(g0, g1, . . . , gn) is orthogonal to manifold L (g0, g1, . . . , gn); indeed, the vector represented by Equation 91 has the relevant property
gk · bL(g0,g1,..., gn) = 0   k = 0, 1, . . . , n
which follows directly from Relation 87, and hence the relation
b = bL(g0, g1, . . . , gn) + bL(g0, g1, . . . , gn)
represents the unique orthogonal decomposition of vector b with respect to manifold L (g0, g1, . . . , gn).
Moreover, we have the other obvious, but relevant properties
b · bL(g0, g1, . . . , gn) = bL(g0, g1, . . . , gn) · bL(g0, g1, . . . , gn) ≥ 0
where the strict inequality applies whenever b is not in L (g0, g1, . . . , gn), and for any a and b
a · bL(g0,g1,..., gn) = aL(g0,g1,..., gn) · bL(g0,g1,..., gn) ≥ 0
An important formula which derives from Eq. 91 and the usual properties of determinants, is
b L ( g 0 , g 1 , . . . , g n ) · b L ( g 0 , g 1 , . . . , g n ) det   M ( b , h 1 , . . . , h n ) det   M ( h 1 , . . . , h n )
Moreover, choosing the set of linearly independent vectors h1,. . . , hr so that hn = 2x is the gradient of the normalization constraint, and using the notation introduced in Section 9, i.e., defining the “mean” functionals Hj = hj · hn/4, B = b · hn/4, and the “deviation” vectors ∆hj = (hjhnHj)/2, ∆b = (bhnB)/2, it is easy to show that
det   M ( b , h 1 , . . . , h n ) det   M ( h 1 , . . . , h n ) = det   M ( b , h 1 , . . . , h n 1 ) det   M ( h 1 , . . . , h n 1 )
As a final remark, we write the following generalized form of the Cauchy-Schwarz inequality
det M (∆b, ∆h1,. . . , ∆hn−1) ≤ det M (∆h1,. . . , ∆hn−1) ∆b · ∆b
In the paper, we make extensive use of the notation and relations discussed in this Appendix [24, 25].

References

  1. Gyftopoulos, E.P.; Beretta, G.P. Thermodynamics. Foundations and Applications; Dover: Mineola, 2005. [Google Scholar]
  2. Zanchini, E.; Beretta, G.P. Rigorous Axiomatic Definition of Entropy Valid Also For Non-Equilib-rium States. In Meeting the Entropy Challenge; Beretta, G.P., Ghoniem, A., Hatsopoulos, G.N., Eds.; AIP Book; CP1033, in press (2008).
  3. Clausius, R. Üer Verschiedene für die Anwendungen Bequeme Formen der Hauptgleichungen der Mechanischen Warmetheorie. Memoir read at the Philos. Soc. Zürich on April 24. Pogg. Ann. 1865, 125, 353–400. [Google Scholar] English translation in: The Second Law of Thermodynamics. Kestin, J. (Ed.) Dowden, Hutchinson and Ros: Stroudsburg, 1976; pp. 162–193.
  4. von Neumann, J. Beweiss des Ergodensatzes und des H-Theorems in der neuen Mechanik. Zeitschrift fr Physik 1929, 57, 30–70. [Google Scholar] [CrossRef]
  5. Shannon, C.E. The Mathematical Theory of Communication. Bell System Technical J. 1948, 27, 379–423; 623–656. [Google Scholar] [CrossRef]
  6. Margenau, H. The Nature of Physical Reality; McGraw-Hill, 1950. [Google Scholar]
  7. Levine, R.D.; Tribus, M. (Eds.) The Maximum Entropy Formalism; The M.I.T. Press: Cambridge, Mass, 1979.
  8. Alhassid, Y.; Agmon, N.; Levine, R.D. Collision Experiments with Partial Resolution of Final States: Maximum Entropy Procedure and Surprisal Analysis. Phys. Rev. C 1978, 79, 1775–1788. [Google Scholar]
  9. Levine, R.D. Entropy and Macroscopic Disequilibrium. II. The Information Theoretic Characteri-zation of Markovian Relaxation Processes. J. Chem. Phys. 1976, 65, 3302–3315. [Google Scholar] [CrossRef]
  10. Keck, J.C. Rate-Controlled Constrained-Equilibrium Method for Treating Reactions in Complex Systems. In The Maximum Entropy Formalism; Levine, R.D., Tribus, M., Eds.; The M.I.T. Press: Cambridge, Mass, 1979; pp. 219–245. [Google Scholar]
  11. Beretta, G.P.; Keck, J.C. The Constrained Equilibrium Approach to Nonequilibrium Dynamics. In Computer-Aided Engineering of Energy Systems, Second Law Analysis and Modeling; Gaggioli, R.A., Ed.; ASME Book H0341C-AES, 1986; Vol. 3, pp. 135–139. [Google Scholar]
  12. Beretta, G.P. On the General Equation of Motion of Quantum Thermodynamics and the Distinc-tion between Quantal and Nonquantal Uncertainties. Sc.D. thesis, MIT, Cambridge, MA, 1981. [Google Scholar]
  13. Beretta, G.P.; Gyftopoulos, E.P.; Park, J.L.; Hatsopoulos, G.N. Quantum Thermodynamics. A New Equation of Motion for a Single Constituent of Matter. Nuovo Cimento B 1984, 82, 169–191. [Google Scholar] [CrossRef]
  14. Beretta, G.P.; Gyftopoulos, E.P.; Park, J.L. Quantum Thermodynamics. A New Equation of Motion for a General Quantum System. Nuovo Cimento B 1985, 87, 77–97. [Google Scholar] [CrossRef]
  15. Beretta, G.P. Steepest Entropy Ascent in Quantum Thermodynamics. In The Physics of Phase Space; Kim, Y.S., Zachary, W.W., Eds.; Lecture Notes in Physics; Springer-Verlag, 1986; Vol. 278, pp. 441–443. [Google Scholar]
  16. Beretta, G.P. A General Nonlinear Evolution Equation for Irreversible Conservative Approach to Stable Equilibrium. In Frontiers of Nonequilibrium Statistical Physics; Moore, G.T., Scully, M.O., Eds.; Plenum Press: New York, 1986; pp. 193–204. [Google Scholar]
  17. Beretta, G.P. Quantum Thermodynamics of Nonequilibrium. Onsager Reciprocity and Dispersion-Dissipation Relations. Found. Phys. 1987, 17, 365–381. [Google Scholar] [CrossRef]
  18. Beretta, G.P. Nonlinear Model Dynamics for Closed-System, Constrained, Maximal-Entropy-Gen-eration Relaxation by Energy Redistribution. Phys. Rev. E 2006, 73, 026113. [Google Scholar] [CrossRef] [PubMed]
  19. Hatsopoulos, G.N.; Gyftopoulos, E.P. A Unified Quantum Theory of Mechanics and Thermody-namics. Found. Phys. 1976, 6, 15–31; 127–141; 439–455; 561–570. [Google Scholar] [CrossRef]
  20. Maddox, J. Uniting Mechanics and Statistics. Nature 1985, 316, 11. [Google Scholar]
  21. Hatsopoulos, G.N.; Keenan, J.H. Principles of General Thermodynamics; Wiley, 1965. [Google Scholar] [CrossRef]
  22. Gheorghiu-Svirschevski, S. Nonlinear Quantum Evolution with Maximal Entropy Production. Phys. Rev. A 2001, 63, 022105. [Google Scholar] and Addendum. Phys. Rev. A 2001, 63, 054102.
  23. Lemanska, M.; Jaeger, Z. A Nonlinear Model for Relaxation in Excited Closed Physical Systems. Physica D 2002, 170, 72–76. [Google Scholar] [CrossRef]
  24. Beretta, G.P. A New Approach to Constrained-Maximization Nonequilibrium Problems. In Computer-Aided Engineering of Energy Systems: Second Law Analysis and Modeling; Gaggi-oli, R.A., Ed.; ASME Book H0341C-AES, 1986; Vol. 3, pp. 129–134. [Google Scholar]
  25. Beretta, G.P. Dynamics of Smooth Constrained Approach to Maximum Entropy. Second Law Analysis of Thermal Systems, Moran, M.J., Sciubba, E., Eds.; ASME Book I00236, 1987; 17–24. [Google Scholar]
  26. Caticha, A. Change, Time and Information Geometry. AIP Conf. Proc. 2001, 568, 72. [Google Scholar]
  27. Wootters, W.K. Statistical Distance and Hilbert Space. Phys. Rev. D 1981, 23, 357–362. [Google Scholar] [CrossRef]
  28. Salamon, P.; Nulton, J.D.; Berry, R.S. Length in Statistical Thermodynamics. J. Chem. Phys. 1985, 82, 2433–2436. [Google Scholar] [CrossRef]
  29. Braunstein, S.L.; Caves, C.M. Statistical Distance and the Geometry of Quantum States. Phys. Rev. Lett. 1994, 72, 3439–3443. [Google Scholar] [CrossRef] [PubMed]
  30. Beretta, G.P. Time-Energy and Time-Entropy Uncertainty Relations in Dissipative Quantum Systems. 2005; arXiv:quant-ph/0511091. [Google Scholar]
  31. Onsager, L. Reciprocal Relations in Irreversible Processes I. Phys. Rev. 1931, 37, 405. [Google Scholar] [CrossRef] Reciprocal Relations in Irreversible Processes II. 1931, 38, 2265.
  32. Casimir, H.B.G. On Onsager’s Principle of Microscopic Reversibility. Rev. Mod. Phys. 1945, 17, 343. [Google Scholar] [CrossRef]
  33. Callen, H.B. Phenomenological Theory of Irreversible Processes. Thesis, MIT, 1947; unpublished. [Google Scholar]
  34. Kubo, R. Statistical-Mechanical Theory of Irreversible Processes. I. General Theory and Simple Applications to Magnetic and Conduction Problems. J. Phys. Soc. Jpn. 1957, 12, 570–586. [Google Scholar] [CrossRef]
  35. Callen, H.B.; Barasch, M.L.; Jackson, L.L. Statistical Mechanics of Irreversibility. Phys. Rev. 1952, 88, 1382. [Google Scholar] [CrossRef]
  36. Callen, H.B.; Welton, T.A. Irreversibility and Generalized Noise. Phys. Rev. 1952, 83, 34. [Google Scholar] Greene, R.F.; Callen, H.B. On the Formalism of Thermodynamic Fluctuation Theory. Phys. Rev. 1951, 83, 1231. [Google Scholar] Callen, H.B.; Greene, R.F. On a Theorem of Irreversible Thermodynamics. Phys. Rev. 1952, 86, 702. [Google Scholar] Greene, R.F; Callen, H.B. On a Theorem of Irreversible Thermodynamics. II. Phys. Rev. 1952, 88, 1837. [Google Scholar]
  37. Onsager, L.; Machlup, S. Fluctuations and Irreversible Processes. Phys. Rev. 1953, 91, 1505. [Google Scholar] [CrossRef]
  38. Prigogine, I. Introduction to Thermodynamics of Irreversible Processes; Thomas, C.C., Ed.; Springfield: Illinois, 1965. [Google Scholar]
  39. De Groot, S.R.; Mazur, P. Non-Equilibrium Thermodynamics; Dover: Mineola, 1984. [Google Scholar]
  40. Beretta, G.P.; Gyftopoulos, E.P. Thermodynamic Derivations of Conditions for Chemical Equi-librium and of Onsager Reciprocal Relations for Chemical Reactors. J. Chem. Phys. 2004, 121, 2718–2728. [Google Scholar]
  41. Casas-Vazquez, J.; Jou, D. Temperature in Non-equilibrium States: a Review of Open Problems and Current Proposals. Rep. Progr. Phys. 2003, 66, 1937. [Google Scholar] [CrossRef]

Share and Cite

MDPI and ACS Style

Beretta, G.P. Modeling Non-Equilibrium Dynamics of a Discrete Probability Distribution: General Rate Equation for Maximal Entropy Generation in a Maximum-Entropy Landscape with Time-Dependent Constraints. Entropy 2008, 10, 160-182. https://doi.org/10.3390/entropy-e10030010

AMA Style

Beretta GP. Modeling Non-Equilibrium Dynamics of a Discrete Probability Distribution: General Rate Equation for Maximal Entropy Generation in a Maximum-Entropy Landscape with Time-Dependent Constraints. Entropy. 2008; 10(3):160-182. https://doi.org/10.3390/entropy-e10030010

Chicago/Turabian Style

Beretta, Gian Paolo. 2008. "Modeling Non-Equilibrium Dynamics of a Discrete Probability Distribution: General Rate Equation for Maximal Entropy Generation in a Maximum-Entropy Landscape with Time-Dependent Constraints" Entropy 10, no. 3: 160-182. https://doi.org/10.3390/entropy-e10030010

Article Metrics

Back to TopTop