Next Article in Journal
An Improved Incremental Procedure for the Ground Reaction Based on Hoek-Brown Failure Criterion in the Tunnel Convergence-Confinement Method
Previous Article in Journal
On Hermite–Hadamard–Fejér-Type Inequalities for η-Convex Functions via Quantum Calculus
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Invariant Finitely Additive Measures for General Markov Chains and the Doeblin Condition

by
Alexander Zhdanok
Institute for Information Transmission Problems (A.A. Kharkevich Institute) of Russian Academy of Sciences, 19/1 Bolshoy Karetny Per., 127051 Moscow, Russia
Mathematics 2023, 11(15), 3388; https://doi.org/10.3390/math11153388
Submission received: 26 June 2023 / Revised: 28 July 2023 / Accepted: 31 July 2023 / Published: 2 August 2023
(This article belongs to the Section Probability and Statistics)

Abstract

:
In this paper, we consider general Markov chains with discrete time in an arbitrary measurable (phase) space. Markov chains are given by a classical transition function that generates a pair of conjugate linear Markov operators in a Banach space of measurable bounded functions and in a Banach space of bounded finitely additive measures. We study sequences of Cesaro means of powers of Markov operators on the set of finitely additive probability measures. It is proved that the set of all limit measures (points) of such sequences in the weak topology generated by the preconjugate space is non-empty, weakly compact, and all of them are invariant for this operator. We also show that the well-known Doeblin condition ( D ) for the ergodicity of a Markov chain is equivalent to condition ( ) : all invariant finitely additive measures of the Markov chain are countably additive, i.e., there are no invariant purely finitely additive measures. We give all the proofs for the most general case.

1. Introduction

We study general Markov chains understood as random Markov processes with discrete time in an arbitrary phase space and homogeneous in time. Markov chains are given by a transition probability (function) that is countably additive by its second argument, i.e., we consider only “classical” Markov chains. Transition probabilities generate integral Markov operators acting in spaces of bounded measurable functions, bounded countably additive, and bounded finitely additive measures.
In our papers [1,2], we proved several theorems about the ergodicity of Markov chains in one sense or another depending on the properties of subspaces of invariant finitely additive measures. In the present paper, we continue to investigate these problems within the framework of the operator approach proposed by Kryloff and Bogoliouboff [3,4] in 1937, developed by Yosida and Kakutani [5] in 1941, that is used in many works on this subject (see, for example, Revuez’s book [6]).
In Section 2 and Section 3, we introduce the basic information and constructions used below, which are not fully represented in the known and available literature. The content and form of presentation of this preliminary information are approximately the same as in our previous articles on this topic, i.e., we repeat them for the convenience of readers.
The main results of this paper are contained in Section 4, Section 5 and Section 6. We present these results here too.
In Theorem 5, we consider sequences of Cesàro means of powers of a Markov operator in the space of finitely additive measures. It is proved that the set of all limit measures of such sequences in the ∗-weak topology is non-empty, ∗-weakly compact, and all of them are invariant for this Markov operator. Here, the ∗-weak topology τ B is the topology in the space of finitely additive measures generated by the preconjugate space of measurable functions.
The main statement of the whole article is proved in Theorem 7: the Doeblin condition ( D ) of ergodicity for general Markov chains is equivalent to the condition ( ) which states that all invariant finitely additive measures of a Markov chain are countably additive. Condition ( ) (Corollary 1), in turn, is equivalent to the condition ( ˜ ) which states that the Markov chain does not have invariant purely finitely additive measures.
In Section 6, Theorem 9, on the finite dimension of the set of invariant finitely additive measures for Markov chains satisfying the condition ( ) is given. Thereafter, in Theorems 10 and 11, we also give an inversion of Theorem 9 for the case when the invariant finitely additive measure is unique.
We present some theorems and definitions from other publications used in Section 4, Section 5 and Section 6 in exact and complete formulations (and with their new numbering) in order to make the text of this article more autonomous.
In Section 7, we discuss the results of Lin [7] and Horowitz [8], which are close to our theorems in the present paper. We also comment on recently published papers [9,10], in which the authors use finitely additive measures for classical Markov chains.

2. Finitely Additive Measures

2.1. Definitions, Designations and Some Information

Here are some of the basic definitions and concepts used by us, as well as their symbolism, focusing on [11,12].
Let us begin with notation and definitions.
Let X be an arbitrary infinite set and Σ be some sigma-algebra of its subsets. The pair ( X , Σ ) is called measurable space and each set A Σ is called measurable. We shall always (by default) assume that the sigma-algebra Σ contains all one-point sets { x } Σ , x X . Everywhere below, R = R 1 is the set of real numbers (number line).
We denote by B ( X , Σ ) the Banach space of bounded Σ -measurable functions f : X R with s u p -norm || f || = s u p | f ( x ) | .
Any function φ : Σ R will be called a set function on Σ . The set function φ ( E ) , E Σ , can take both positive and negative values. In this paper, we use only bounded set functions, i.e., such that s u p | φ ( E ) | < , where the supremum is taken over all sets E Σ .
Definition 1. 
Let ( X , Σ ) be an arbitrary measurable space. A bounded set function μ : Σ R is called finitely additive measure if μ ( ) = 0 and μ ( E 1 E 2 ) = μ ( E 1 ) + μ ( E 2 ) for any sets E 1 , E 2 Σ such that E 1 E 2 = .
Definition 2. 
Let ( X , Σ ) be an arbitrary measurable space. A bounded set function μ : Σ R is called countably additive measure if it is a finitely additive measure and the following condition is satisfied: if E 1 , E 2 , Σ , E i E j = for i j then
μ ( n = 1 E n ) = n = 1 μ ( E n ) .
Definition 3 
(see [12]). A finitely additive non-negative measure μ : Σ R is called purely finitely additive (pure charge, pure mean) if any countably additive measure λ satisfying the condition 0 λ μ is identically zero. The alternating measure μ is called is purely finitely additive if both non-negative measures μ + and μ of its Jordan decomposition μ = μ + μ are purely finitely additive.
If the measure μ is purely finitely additive, then it is equal to zero on every one-point set: μ ( { x } ) = 0 , x X (see [13], Lemma 1). The converse, generally speaking, is not true (for example, for the Lebesgue measure on the segment [ 0 , 1 ] ).
Theorem 1 
(Yosida-Hewitt decomposition, see [12]). Any finitely additive measure μ uniquely decomposes into the sum μ = μ 1 + μ 2 , where μ 1 is a countably additive measure and μ 2 is a purely finitely additive measure.
The first versions of such a decomposition of finitely additive measures on some topological spaces were constructed in the articles of Alexandroff ([14], (Chapter III, Section 13, Theorems 1–5)).
In this article, we also consider Banach spaces of bounded measures μ : Σ R with a norm equal to the total variation in the measure μ :
b a ( X , Σ ) is the space of finitely additive measures, and
c a ( X , Σ ) is the space of countably additive measures.
If μ 0 , then the norm is || μ || = μ ( X ) . These measure spaces are studied in detail in [11].
We can formally consider a measure that is identically equal to zero to be both countably additive and purely finitely additive.
The purely finitely additive measures also form a Banach space p f a ( X , Σ ) with the same norm, and b a ( X , Σ ) = c a ( X , Σ ) p f a ( X , Σ ) .
We denote the sets of measures:
  • S b a = { μ b a ( X , Σ ) : μ 0 , | | μ | | = 1 } ,
  • S c a = { μ c a ( X , Σ ) : μ 0 , | | μ | | = 1 } ,
  • S p f a = { μ p f a ( X , Σ ) : μ 0 , | | μ | | = 1 } .
All measures from these sets will be called probabilistic.

2.2. Order Properties of Measure Spaces

We present here some order properties of measure spaces that we will use in the next sections.
The measure space b a ( X , Σ ) is semi-ordered with respect to the natural order relation: for μ 1 , μ 2 b a ( X , Σ ) we write μ 1 μ 2 if μ 1 ( E ) μ 2 ( E ) for all E Σ .
In one paper ([12], Theorem 1.11), it was proved that the space of finitely additive measures b a ( X , Σ ) is a vector lattice and formulas were given for finding ( μ 1 μ 2 ) = i n f { μ 1 , μ 2 } and ( μ 1 μ 2 ) = s u p { μ 1 , μ 2 } , where μ 1 , μ 2 b a ( X , Σ ) .
In one paper ([12], Theorem 1.14 and Theorem 1.17), it was proved that the subspaces c a ( X , Σ ) and p f a ( X , Σ ) are also vector lattices.
Definition 4. 
Two non-negative elements x 1 and x 2 from the vector lattice X are called disjoint if x 1 x 2 = 0 , and denoted by x 1 d x 2 . We also use this term for the corresponding pairs of measures μ 1 , μ 2 b a ( X , Σ ) .
Definition 5 
(see, for example, ([11],  Chapter III, Item 4.12)). Nonnegative measures μ 1 and μ 2 are called singular if there exist sets D 1 , D 2 Σ such that μ 1 ( D 1 ) = μ 1 ( X ) , μ 2 ( D 2 ) = μ 2 ( X ) and D 1 D 2 = , and denoted by μ 1 μ 2 .
It is easy to check that if the measures μ 1 and μ 2 are singular then they are also disjoint, i.e., μ 1 μ 2 μ 1 d μ 2 . However, the converse, generally speaking, is not true (in [15] there are corresponding examples).
It follows from ([12], Theorem 1.21) that the countably additive measures μ 1 and μ 2 are disjunctive if and only if they are singular. In one paper ([12], Theorem 1.16), it is also stated that every purely finite additive measure is disjoint with any countably additive measure.
Let us give examples of families of purely finitely additive measures.
Example 1. 
Let X = [ 0 , 1 ] R and Σ = B , where B is the Borel sigma-algebra. There exists a finitely additive measure μ : B R , μ S b a such that, for any ε > 0 , the following holds: μ ( ( 0 , ε ) ) = 1 , μ ( [ ε , 1 ] ) = 0 , μ ( { 0 } ) = 0 .
It is easy to check that such a measure is purely finitely additive but it is not unique.
It is known (see [12]) that the cardinality of a family of such measures located “near zero (on the right)” is not less than 2 2 0 = 2 c (hypercontinuum).
Example 2. 
Let X = R and Σ = B . There exists a finitely additive measure μ : B R , μ S b a such that for any x R the following holds: μ ( ( x , ) ) = 1 , μ ( ( , x ) ) = 0 , μ ( { x } ) = 0 .
This measure is also purely finitely additive. And there are also a lot of such measures.
A detailed exposition of the foundations of the general theory of finitely additive measures is contained in the monograph K.P.S.B. Rao and M.B. Rao [15], in which such measures are called charges.

3. Markov Operators and Invariant Measures

Markov chains on a (phase) measurable space ( X , Σ ) are given by their transition function (probability) p ( x , E ) , for x X and E Σ under ordinary conditions:
  • 0 p ( x , E ) 1 , p ( x , X ) = 1 , x X , E Σ ;
  • p ( · , E ) B ( X , Σ ) , E Σ ;
  • p ( x , · ) c a ( X , Σ ) , x X .
The numerical value of the function p ( x , E ) is the probability that the system will move from the point x X to the set E Σ in one step (per unit of time).
We emphasize that transition function is a countably additive measure with respect to the second argument, i.e., we consider classical Markov chains.
The transition function generates two Markov linear bounded positive integral operators:
T : B ( X , Σ ) B ( X , Σ ) , ( T f ) ( x ) = T f ( x ) = X f ( y ) p ( x , d y ) ,
f B ( X , Σ ) , x X ;
A : c a ( X , Σ ) c a ( X , Σ ) , ( A μ ) ( E ) = A μ ( E ) = X p ( x , E ) μ ( d x ) ,
μ c a ( X , Σ ) , E Σ .
Let μ 0 S c a be the initial measure. Then the iterative sequence of countably additive probability measures μ n = A μ n 1 S c a , n N , is usually identified with the Markov chain. We will call { μ n } a Markov sequence of measures.
Topologically conjugate to the space B ( X , Σ ) is the (isomorphic) space of finitely additive measures: B * ( X , Σ ) = b a ( X , Σ ) (see, for example, [11]). In this case, the operator T * : b a ( X , Σ ) b a ( X , Σ ) serves as topologically conjugate to the operator T.
The operator T * is the only bounded continuation of the operator A to the entire space b a ( X , Σ ) while preserving its analytic form, i.e.,
T * μ ( E ) = X p ( x , E ) μ ( d x ) , μ b a ( X , Σ ) , E Σ .
The operator T * has its own invariant subspace c a ( X , Σ ) , i.e., T * [ c a ( X , Σ ) ] c a ( X , Σ ) , on which it matches the original A operator. The construction of the Markov operators T and T * is now functionally closed. Where it does not cause misunderstandings, we will continue to denote the operator T * as A.
In such a setting, it is natural to admit to consideration the Markov sequences of probabilistic finitely additive measures: μ 0 S b a , μ n = T * μ n 1 S b a , n N , keeping the countable additivity of the transition function p ( x , · ) with respect to the second argument.
Such a Markov chain can have cycles consisting of finitely additive measures. The properties of such cycles Markov chain are considered in detail in our paper [16].
It is permissible and cardinally to change the formulation of the problem: to allow the transition function p ( x , · ) itself to be only a finitely additive measure with respect to the second argument. Such Markov chains are also studied (see ([1], (Chapter II, Section 5)) and [13,17]), and they are called “finitely additive Markov chains”. In this paper, we do not consider such Markov chains.
Thus, in our case, the following terminology is appropriate: we study countably additive Markov chains with operators defined on the space of finitely additive measures.
Definition 6. 
If A μ = μ for some measure μ S b a , then we will call such a measure invariant for the operator A (and for the Markov chain). An invariant countably additive measure is often called the stationary distribution of the Markov chain.
We denote the sets of invariant probability measures of the Markov chain for the operator A as follows:
  • Δ b a = { μ S b a : μ = A μ } ,
  • Δ c a = { μ S c a : μ = A μ } , and
  • Δ p f a = { μ S p f a : μ = A μ } .
Let M b a be the linear subspace of invariant measures of the Markov chain in the space b a ( X , Σ ) . Obviously, M b a is generated by the set Δ b a : M b a = S p Δ b a . We will also use the notation M c a and M p f a with a similar meaning.
The linear dimension of a set Δ b a is the algebraic dimension of the linear space M b a generated by it, and we will denote it d i m Δ b a , d i m Δ b a = d i m M b a . Similarly, we will talk about the dimension of the sets Δ c a and Δ p f a .
The classical countably additive Markov chain may or may not have invariant probability countably additive measures, i.e., possibly Δ c a = (for example, for a symmetric walk on Z).
Šidak was one of the first to extend the Markov operator A to the space of finitely additive measures in the framework of the operator approach and proved the following two important theorems in [18] (1962).
Theorem 2 
(Šidak ([18], Theorem 2.2)). Any countably additive Markov chain on an arbitrary measurable space ( X , Σ ) has at least one invariant finitely additive probability measure, i.e., always Δ b a .
This result was then briefly proved in our paper [1] as a simple corollary of the Krein-Rutman theorem ([19], Theorem 3.1).
Theorem 3 
(Šidak ([18], (Theorem 2.5))). If a finitely additive measure μ for an arbitrary Markov chain is invariant A μ = μ , and μ = μ 1 + μ 2 is its decomposition into countably additive and purely finitely additive components, then each of them is also invariant: A μ 1 = μ 1 , A μ 2 = μ 2 .
Therefore, in many cases, it is sufficient to study invariant measures from Δ c a and from Δ p f a separately.
Remark 1. 
In [20], (1966), Foguel considered the same operator construction of a general Markov chain as Šidak in [18] (1962). However, in the main part of [20] it is assumed that X is a locally compact space. In one study [20], Foguel also studied the properties of invariant finitely additive measures of Markov chains. In this paper, we use the same construction of general Markov chains and their operators as discussed in the referenced papers by Šidak and Foguel. However, here we are solving our own problems not covered in the works of Šidak [18] and Foguel [20].
We note that in several other papers Foguel (see, for example, [21]) develops an operator approach for studying Feller Markov chains on locally compact topological spaces X with a Baire σ -algebra Σ .
Remark 2. 
Hernández-Lerma and Lasserre proved in ([22], Theorem 6.3.1) (2003) that for a Markov chain defined on a separable metrizable phase space ( X , Σ ) , under certain assumptions, there exists an invariant finite additive measure. It is also shown that if a finitely additive measure is invariant, then both its countably additive and purely finitely additive components are invariant. In this article, we consider general Markov chains and do not separately single out the particular case of the topological phase space.

4. Doeblin Condition and Invariant Measures

4.1. Doeblin Condition, Its Modification and Condition ( )

In 1937, Doeblin published a large work in two parts [23,24]. In one paper [24], [Chapter 2], for general Markov chains, condition ( D ) is formulated under which the Markov chain has the maximum set of ergodic properties.
In subsequent works, different authors constructed and used different versions of the Doeblin condition  ( D ) . We use the version suggested by Doob ([25], Chapter 5):
( D ) There is a bounded measure φ c a ( X , Σ ) , φ 0 , ε > 0 and k N , k 1 , such that , if φ ( E ) ε for E Σ , then p k ( x , E ) 1 ε for all x X .
The superscript k in p k denotes the order of the integral convolution (iteration) of the transition function, not its degree. Let us call the number k the parameter of the condition ( D ) .
Another ergodicity condition for Markov operators was introduced by Kryloff and Bogoliouboff in their two studies, also in 1937, already mentioned above, refs. [3,4]. In these studies, quasi-compactness condition ( K ) for the Markov operator T : B ( X , Σ ) B ( X , Σ ) are formulated and it is shown very briefly that this condition is sufficient for a certain uniform (mean) ergodic theorem to hold for Cesaro means of powers T n of the operator T.
Let us give this condition ( K ) for the operator T.
A Markov operator T : B ( X , Σ ) B ( X , Σ ) is called quasi-compact (quasi-completely continuous) if the following condition is satisfied
( K ) There is a compact ( completely continuous ) operator T 1 and an integer k 1 such that | | T k T 1 | | < 1 .
In these cases, the Markov chain itself will be called quasi-compact.
In one paper [5], (1941) it is noted that the given condition of quasi-compactness, denoted by ( K ) , can be used not only for the Markov operator T on the space of functions, as is performed in [3,4], but for the “dual” Markov operator A on the space of countably additive measures, and also for arbitrary linear operators in Banach spaces.
After the publication of [5], studies on the comparison of the conditions ( K ) and ( D ) under certain assumptions or in other formulations of the problem were continued.
Today, the equivalence of the conditions ( K ) and ( D ) in the general case can be considered established.
However, we will not actively use the condition ( K ) in this paper.
Let an arbitrary Markov chain with a transition function p ( x , E ) and Markov operators T and A be given on ( X , Σ ) . For any fixed m N , we define a new Markov chain with a transition function q m ( x , E ) and Markov operators T m and A m according to the rules for constructing Cesaro means:
q m ( x , E ) = 1 m k = 1 m p k ( x , E ) , T m = 1 m k = 1 m T k , A m = 1 m k = 1 m A k .
We will call such a Markov chain a finitely averaged Markov chain (by the original Markov chain).
In [2], we formulate one more ergodicity condition ( D ˜ ) for a finitely averaged Markov chain.
( D ˜ ) There is bounded measure φ c a ( X , Σ ) , φ 0 , ε > 0 and m N , such that , if φ ( E ) ε , E Σ , then q m ( x , E ) 1 ε for all x X .
Obviously, the condition ( D ˜ ) is Doeblin condition ( D ) for a finitely averaged Markov chain (for fixed m 1 ) with parameter k = 1 . In one paper ([2], Theorem 12.1) it is shown that if condition ( D ) is satisfied, then condition ( D ˜ ) is also satisfied. However, the converse statement was not proved in [2], so this condition ( D ˜ ) had to be introduced. We have obtained such a converse statement in this paper and it will be shown below.
In one author’s paper [2], the following statement was proved:
Theorem 4 
([2], Theorem 12.2). For an arbitrary Markov chain, the condition ( D ˜ ) is equivalent to the condition ( ) :
( ) Δ b a c a ( X , Σ ) ,
it means that all invariant finitely additive measures of the original Markov chain are countably additive.
In Section 5, we will prove a new statement that the classical Doeblin condition ( D ) is also equivalent to the condition ( ) . But for this, we need some information from functional analysis (see, for example, [11]) and the two preceding theorems.

4.2. Theorem on Invariance of All Weakly Limiting Measures for Cesàro Means

The space of measures b a ( X , Σ ) is topologically conjugate to the space of functions B ( X , Σ ) , i.e., B * ( X , Σ ) = b a ( X , Σ ) (up to isomorphism), as we have already noted. Therefore, in Banach space b a ( X , Σ ) , we can consider not only the strong (metric) topology, but also the ∗-weak topology τ B generated by the preconjugate space B ( X , Σ ) .
This topology is given by the Tikhonov base of neighborhoods of the point (measure) η b a ( X , Σ ) of the form
V ( η , f 1 , f 2 , , f k , ε )
= { μ b a ( X , Σ ) : | f i , μ f i , η | < ε , i = 1 , 2 , , k ; k N , ε > 0 } ,
where ε and k are arbitrary, and f 1 , f 2 , , f k B ( X , Σ ) . The notation f i , μ denotes the value of the function f i , as a linear functional on the measure μ , calculated by the formula
f i , μ = X f i ( x ) μ ( d x ) , i = 1 , 2 , , k .
We will also use the notation f ( μ ) = f , μ .
If a certain sequence of measures { λ n } = { λ 1 , λ 2 , } S b a is given, then the symbol M { λ n } denote the set of all limit points (measures) in the τ B -topology of the sequence of measures { λ n } . Note that the fact that the measure η is the limit point of the sequence of measures { λ n } , generally speaking, does not imply that in { λ n } there is a subsequence converging to η in the τ B -topology.
Let a Markov chain with operator A be given on ( X , Σ ) . We introduce the notation for Cesaro means for some initial measure μ S b a :
λ n = λ n μ = 1 n k = 1 n A k μ , n N .
The proofs of further theorems are based on Theorem 7.2 from [1], its proof in [1] is not given. It only says that it can be carried out by analogy with the proof of another theorem for a Feller–Markov chain on a topological space. However, the difference between general Markov chains and topological Markov chains is very large, and we restore below the unpublished proof of Theorem 7.2 from [1].
Theorem 5. 
Let an arbitrary Markov chain and an initial finitely additive measure μ S b a be given on an arbitrary ( X , Σ ) . Then each τ B -limit point (in the topology τ B ) of the sequence { λ n μ } in b a ( X , Σ ) will be the fixed point of the operator A, i.e., M { λ n } Δ b a . The set of such measures is nonempty, i.e., M { λ n } and M { λ n } is τ B -compact.
Proof. 
Let us choose some measure μ S b a . It is obvious that || λ n || = λ n ( X ) = 1 , n = 1 , 2 , , i.e., the set { λ n } is metrically bounded in b a ( X , Σ ) . Hence, the τ B -closure of the set { λ n } is compact in the τ B -topology (see [11], Chapter V, Item 4, Corollary 3).
By Kelley ([26], Chapter V, Theorem 5), any subsequence of λ n i , including λ n , in a compact set has an ω -limit point η = η { λ n i } such that any of its neighborhood contains infinitely many elements of the sequence. It means that for each τ B -neighborhood V ( η , f 1 , f 2 , , f k , ε ) , where ε > 0 , and f i B ( X , Σ ) , i = 1 , 2 , , k , the set { i : λ n i V ( η , f 1 , f 2 , , f k , ε ) } is infinite, i.e., there is a subsequence { λ n i j } , λ n i j V ( η , f 1 , f 2 , , f k , ε ) , j = 1 , 2 ,
Let the measure η be τ B -limit for the sequence { λ n } .
Now, let us do the following transformation.
A λ n = 1 n k = 1 n A k + 1 μ + 1 n [ A n + 1 μ A μ ] = λ n + 1 n [ A n + 1 μ A μ ] , n N .
Let f B ( X , Σ ) . Then for any ε > 0 there is a strictly increasing sequence of natural numbers { n i } = { n i } ( f , ε ) such that λ n i V ( η , f , T f , ε ) , i = 1 , 2 ,
Now
| f ( η ) f ( A η ) | = | f ( η ) T f ( η ) |
| f ( η ) f ( λ n i ) | + | f ( λ n i ) T f ( λ n i ) | + | T f ( λ n i ) T f ( η ) |
ε + | f ( λ n i ) f ( A λ n i ) | + ε = 2 ε + | f ( λ n i ) f ( λ n i ) f ( 1 n i [ A n i + 1 μ A μ ] ) |
2 ε + 1 n i | f ( A n i + 1 μ A μ ) | 2 ε + 2 || f || n i .
Since n i for i , then | f ( η ) f ( A η ) | 2 ε . As ε is arbitrary, then | f ( η ) f ( A η ) | = 0 .
So, for each f B ( X , Σ ) , the equality f ( η ) = f ( A η ) holds. The set B ( X , Σ ) ) is total on b a ( X , Σ ) . Therefore, η = A η . Moreover, η b a ( X , Σ ) .
Let us show that η S b a , i.e., the measure η is normalized and positive.
Consider τ B -neighborhood of the point η of the form V ( η , f , ε ) , where ε > 0 is arbitrary, f B ( X , Σ ) and f ( x ) 1 . Then there is n i such that λ n i V ( η , f , ε ) , i.e.,
| f , η f , λ n i | = | η ( X ) λ n i ( X ) | < ε .
Since λ n ( X ) = 1 for all n N , then | η ( X ) 1 | < ε for any ε > 0 , whence η ( X ) = 1 .
Suppose there is E Σ and r > 0 such that η ( E ) = r < 0 .
Let us take as f B ( X , Σ ) the characteristic function f = χ E of the set E and the number ε = r 2 . Then f , η = η ( E ) = r , f , λ n = λ n ( E ) and
| f , λ n f , η | = λ n ( E ) + r r > ε for all n N .
Therefore, λ n V ( η , f , ε ) for n N , i.e., the measure η is not τ B -limit for the sequence { λ n } . The resulting contradiction proves that η ( E ) 0 for all E Σ .
Summing up all the previous conclusions, we obtain as a result that M { λ n } Δ b a , M { λ n } and M { λ n } is τ B -compact. The theorem is proved. □
We also need one more theorem ([1], Theorem 7.3). Note that in this theorem the means are taken for Markov chains with different initial measures μ n for each n N .
This is a generalization of Theorem 5. Its proof follows the same scheme as the proof of Theorem 5.
Theorem 6 
([1], Theorem 7.3). Suppose that we have an arbitrary Markov chain on an arbitrary ( X , Σ ) , a sequence of measures μ n b a ( X , Σ ) , μ n S b a , and
λ n = λ n μ n = 1 n k = 1 n A k μ n , n = 1 , 2 , .
Then each τ B -limit measure of the sequence { λ n } is invariant for the operator A, i.e., M { λ n } Δ b a , the set of such measures is nonempty, M { λ n } and M { λ n } is τ B -compact.

5. Equivalence of Doeblin Condition to Condition ( ) and to Condition ( ˜ )

The aim of this section is to obtain the main result of the paper.
At the beginning, let us prove one technical lemma, that we need in the proof of the second part (necessity) of Theorem 7.
Lemma 1. 
For any Markov chain, for any set G Σ , and for all m N , the following equality holds
sup x X p m ( x , G ) = sup η S b a A m η ( G ) .
Proof. 
It is easy to see that the transition function for any m N , x X , and G Σ can be represented in the following form
p m ( x , G ) = A m δ x ( G ) ,
where δ x is the Dirac measure at the point x X .
Hence, it follows that
sup x X p m ( x , G ) = sup x X A m δ x ( G ) .
Then, it is obvious, that with an increase in the class of measures over which the supremum is sought, this supremum can only increase:
sup x X A m δ x ( G ) sup η S b a A m η ( G ) .
Let us show that, in fact, equality holds here. Suppose the above inequality is strict. Let us introduce numbers α , β > 0 such that α < β and
sup x X p m ( x , G ) = sup x X A m δ x ( G ) = α < β = sup η S b a A m η ( G ) .
Then there is a measure η 0 S b a such that
α < α + β α 2 A m η 0 ( G ) β .
But, at the same time,
A m η 0 ( G ) = X p m ( x , G ) η 0 ( d x ) X α · η 0 ( d x ) = α .
The resulting inequality contradicts the previous inequality: A m η 0 ( G ) α + β α 2 . Therefore, the equality in the formulation of the Lemma 1 is true. The lemma is proven. □
The main result of our paper reads as follows:
Theorem 7. 
For an arbitrary general Markov chain, the Doeblin condition ( D ) is equivalent to the condition ( ) : Δ b a c a ( X , Σ ) , which means that all invariant finitely additive measures of the Markov chain are countably additive.
Proof. 
First, let us prove that the fulfillment of condition ( ) implies the fulfillment of condition ( D ) (sufficiency).
Let the condition ( ) be satisfied. Then, by Theorem 8.2 from [1], we have the dimension d i m Δ b a = d i m Δ c a = n < . Let H b a = H c a = { μ 1 , , μ n } a singular basis of the space of invariant measures M b a = M c a , existing by Theorem 6.3 from [1].
Let φ = μ 1 + + μ n be a countably additive measure (this measure may be not normalized). We want to prove that for this measure φ and for some numbers m 1 ( m N ) and ε > 0 , the Doeblin condition ( D ) is satisfied.
Suppose that the given measure φ does not satisfy condition ( D ) . Then for each m 1 ( m N ) and for each 0 < ε < 1 , there is a set E m , ε Σ and there is a point x m , ε X , such that
φ ( E m , ε ) ε and p m ( x m , ε ; E m , ε ) > 1 ε .
For each m = 1 , 2 , and for some fixed δ ( 0 , 1 ) we choose ε = ε ( m ) = δ 2 m , and E δ = m = 1 E m , ϵ ( m ) .
Since the measure φ is countably additive, and therefore it is also countably semi-additive, then
φ ( E δ ) = φ ( m = 1 E m , ϵ ( m ) ) m = 1 φ ( E m , ε ( m ) ) m = 1 ε ( m ) = δ m = 1 1 2 m = δ ,
i . e . , φ ( E δ ) δ .
At the same time, for all m = 1 , 2 , we have:
p m ( x m , ε ( m ) ; E δ ) p m ( x m , ε ( m ) ; E m , ε ( m ) ) > 1 ε ( m ) = 1 δ 2 m .
Now we construct a sequence of Dirac measures η m S c a concentrated at the points x m , ε ( m ) , i.e., η m = δ x m , ε ( m ) . Then for powers of the Markov operator A s for s = 1 , 2 , , m we have
A s η m ( E ) = p s ( x m , ε ( m ) ; E )
for arbitrary sets E Σ .
Accordingly, for the Cesaro means for E = E δ and m = 1 , 2 , , the following will be true:
λ m η m ( E δ ) = 1 m s = 1 m A s η m ( E δ ) = 1 m s = 1 m p s ( x m , ε ( m ) ; E δ )
> 1 m s = 1 m ( 1 δ 2 s ) = 1 m ( m s = 1 m δ 2 s ) = 1 δ m ( 1 δ 2 m ) > 1 δ m ,
i . e . , λ m η m ( E δ ) > 1 δ m , m = 1 , 2 ,
Now we turn to the τ B -weak topology in the space b a ( X , Σ ) generated by the preconjugate space B ( X , Σ ) . By Theorem 6, the set M { λ m η m } of all τ B -limit points (measures) of the sequence λ m η m is nonempty and is contained in the set of all finitely additive measures invariant for the Markov chain, M { λ m η m } Δ b a = Δ c a , i.e., all τ B -limit points (measures) of the sequence { λ m η m } are invariant and, by the condition ( ) , countably additive.
Let the measure μ M { λ m η m } be arbitrary. Then, if for some E Σ there is a limit of the numerical sequence λ m η m ( E ) for m , then, obviously, μ ( E ) = lim m λ m η m ( E ) .
Take the set E = E δ . Then from the above estimate λ m η m ( E δ ) > 1 δ m , m = 1 , 2 , it follows that λ m η m ( E δ ) 1 for m . Consequently, for the τ B -limit (invariant) measure μ we have μ ( E δ ) = 1 . Moreover, as we obtained in the above, φ ( E δ ) δ < 1 .
Since the measure μ is invariant, it is decomposable on a singular basis H b a = { μ 1 , , μ n } , μ = i = 1 n α i μ i , where 0 α i 1 , i = 1 , 2 , , n . Hence, μ = i = 1 n α i μ i i = 1 n μ i = φ . Therefore, μ ( E δ ) φ ( E δ ) δ < 1 . But in the above, we obtained that μ ( E δ ) = 1 . From this contradiction, we can see that the condition ( ) implies the fulfillment of condition ( D ) for the measure φ = μ 1 + μ 2 + + μ n . The first part of the theorem is proved.
Now let us prove that the fulfillment of condition ( D ) implies the fulfillment of condition ( ) (necessity).
Let condition ( D ) be satisfied for some φ c a ( X , Σ ) , φ 0 , ε > 0 and m 1 ( m N ) .
Assume that condition ( ) is not satisfied. Then the Markov chain has invariant purely finitely additive measure λ S p f a , λ = A λ .
Any purely finitely additive measure is disjoint with any countably additive measure ([12], Theorem 1.16), whence λ φ = 0 . Then, by another theorem ([12], Theorem 1.19), for any number ε > 0 (and hence for our ε from the “triple” ( φ , ε , m )) there is a set G Σ such that φ ( G ) < ε and λ ( X G ) = 0 .
Hence, λ ( G ) = 1 .
Now we use Lemma 1. Let us make the following transformations and estimates:
1 sup x X p m ( x , G ) = sup η S b a A m η ( G ) A m λ ( G ) = λ ( G ) = 1 .
Hence,
sup x X p m ( x , G ) = 1 .
Therefore, there is at least one point x 0 X such that
1 = sup x X p m ( x , G ) p m ( x 0 , G ) > 1 ε 2 > 1 ε .
But by the condition ( D ) , which we assume to be true, there must be p m ( x , G ) 1 ε for all x X , including for x = x 0 . The resulting contradiction proves that condition ( ) is also satisfied. The theorem is proved. □
We introduce one more condition:
( ˜ ) Δ p f a = .
The fulfillment of the condition ( ˜ ) means that the Markov chain has no invariant purely finitely additive probability measures. Then all its invariant finitely additive measures (and there are always such) are countably additive, i.e., condition ( ) is satisfied.
Obviously, the converse is also true. Thus, we obtain the next corollary.
Corollary 1. 
For an arbitrary Markov chain, the conditions ( ) and ( ˜ ) are equivalent.
From the proved Theorem 7 and from Theorem 4, as a consequence, we obtain the following promised and psychologically important statement for us.
Theorem 8. 
For an arbitrary general Markov chain, the Doeblin condition ( D ) is equivalent to the condition ( D ˜ ) .
Remark 3. 
Obviously, the condition ( ) is satisfied if and only if Δ b a = Δ c a .
Now, let us collect all the results obtained in this section into one statement.
Proposition 1. 
For any general Markov chain on an arbitrary measurable space (X,Σ), the following conditions are equivalent:
(i) 
Doeblin condition (D) for the original Markov chain.
(ii) 
Doeblin condition ( D ˜ ) for a finitely averaged Markov chain.
(iii) 
Condition (∗): all invariant finitely additive probability measures of a Markov chain (for a Markov operator T * ) are countably additive.
(iv) 
Condition ( ˜ ): The Markov chain (the Markov operator T * ) has no invariant purely finitely additive probability measures.
(v) 
Condition: Δ b a = Δ c a .

6. Dimension of the Set of Invariant Measures

Theorem 7 and other results obtained above show that invariant finitely additive measures play a big role in the ergodic theory of Markov chains. However, this picture will be incomplete if we do not point to our already published results on the dimension of spaces of invariant measures. We present here two of our theorems on the dimension of spaces of invariant measures with new additions.
In one paper [1], the following statement was proved.
Theorem 9 
(see [1], Theorem 8.2). For an arbitrary Markov chain, if condition ( ) is satisfied, i.e., if Δ b a c a ( X , Σ ) , then the following condition is also satisfied
( ) d i m Δ b a = n , where 1 n < .
Corollary 2. 
For an arbitrary Markov chain, if the condition ( D ) or the condition ( D ˜ ) is satisfied, then the condition ( ) is also satisfied.
Corollary 3. 
For an arbitrary Markov chain, if the condition ( ) is satisfied, then
d i m Δ b a = d i m Δ c a = n , where 1 n < .
An intuitive assumption arose that the statement of Theorem 9 can be reversed. In the same paper, such a reversal was proved, but only for the case of dimension n = 1 . We present this theorem below.
Theorem 10 
(see ([1], (Theorem 8.3)) and ([2], (Theorem 12.3))). Let an arbitrary Markov chain be given on some ( X , Σ ) . If d i m Δ b a = 1 , i.e., if the Markov chain has a unique invariant measure μ in S b a , Δ b a = { μ } , then the condition ( ) is satisfied: Δ b a c a ( X , Σ ) , i.e., the invariant measure μ is countably additive.
We recall that, by Theorem 7, the condition ( ) is equivalent to Doeblin condition ( D ) .
This implies the following theorem.
Theorem 11. 
Let for an arbitrary Markov chain d i m Δ b a = 1 . Then the conditions ( D ) and ( D ˜ ) are satisfied.
Please note that if d i m Δ b a = 1 then the Markov chain can have only one cycle of any dimension consisting of countably additive measures.

7. Discussion

1. In the second part of [7] (1975), M. Lin considered an arbitrary measurable space ( X , Σ ) and investigated the properties of the Markov operator P defined on the space of functions B ( X , Σ ) and the properties of the conjugate operator P * defined on the conjugate space. It should be noted that the symbols B * ( X , Σ ) , b a ( X , Σ ) , and also the isomorphism B * ( X , Σ ) = b a ( X , Σ ) were not used explicitly in the text of article [7].
We are also using this formulation of the problem with the corresponding addition.
In Theorem 5 of [7], a number of assertions under some hard a priori condition of “ergodicity” on the Markov chain are considered. In particular, it is proved that the Doeblin condition (viii) is equivalent to the condition (vi): “The space of P * invariant functionals (i.e., finitely additive measures) is one-dimensional”. But this is true only due to the a priori “ergodicity” condition in ([7], Theorem 5). In the general case, only the finite-dimensionality of the space of invariant finitely additive measures, and hence also of invariant countably additive measures, follows from the Doeblin condition (see, for example, ref. [25] and also our theorems in Section 4 and Section 5).
In the same Theorem 5 from [7] it was proved (under the same a priori ergodicity condition) that the Doeblin condition (viii) and the condition (vi) are equivalent to the condition ( v ) : “Every P * -invariant functionals is a measure” (here it is countably additive measures) (viii).
If we translate these statements into the language we use, then we obtain a special case of our Theorem 7, but for a one-dimensional space of invariant finitely additive measures.
Our Theorems 10 and 11 also generalize the corresponding assertions from ([7], Theorem 5), since they do not assume that the Markov chain is ergodic.
Additionally, we would like to note that during the preparation of our articles [1,2] the author was not familiar with this article by Michael Lin [7]. Therefore, we did not cite this article in our works [1,2]. We apologize to Michael Lin and fill this gap.
2. In the work of S. Horowitz [8] (1972), some problems are solved that are close to our subject, but in a different formulation. Horowitz considered Markov chains defined on a measurable space ( X , Σ , m ) with a given fixed bounded non-negative countably additive measure m. The transition probability of the Markov chain p ( x , · ) is assumed to be absolutely continuous with respect to the measure m for each x X .
The Markov operator P generated by the function p ( x , E ) acts (left and right) in the spaces L 1 ( X , Σ , m ) and in L ( X , Σ , m ) , respectively.
In one paper ([8], Theorem 4.1), it is proved that if the Markov chain is ergodic and conservative (the definitions were given at the beginning of the article [8], we will not repeat them here), then the quasi-compactness of the Markov operator P on space L ( m ) (condition { h } ) is equivalent to the absence of an invariant purely finitely additive measure (condition { a } ) (pure charge) and is equivalent to the condition: the operator P has a unique invariant countably additive measure μ = μ P and μ is a measure equivalent to m (condition { d } ).
Thus, in Theorem 4.1 [8] a particular case of the one-dimensional space of invariant measures was considered. Our Theorem 7 and Corollary 1, as well as Theorems 10 and 11 contain similar statements, but for a different formulation of the problem and for other types of Markov operators.
In particular, in this paper and in our papers [1,2], it is not supposed to specify a priori some fixed countably additive measure to which all other objects are attached.
We also do not assume a priori ergodicity or conservativeness for the considered Markov chain.
It can be said that we consider a more general case of defining a Markov chain than Horowitz [8]. But it is better to talk about the parallel formulation of the problem.
The above Horowitz theorem from [8] is given in transformed form in Revuze’s book (see [6], (Chapter 6, Section 3)) with new additions, conditions, and corollaries.
3. Let us point out the possibilities of applying our main theorems in some physical problems using the article [9] as an example. Its authors develop a special mathematical apparatus, which they call “Thermodynamic Formalism”, to study the corresponding physical processes. In particular, a special classical Markov chain is constructed on some infinite-dimensional topological space, but finitely additive measures (purely finitely additive) are used as invariant measures.
In one paper [9], it is proved that an invariant finitely additive probability measure for a given special Markov chain exists, it is unique, and it does not have a purely finitely additive component. Based on these facts, the authors prove the “asymptotic stability” of the corresponding Markov operators, i.e., the convergence of a Markov sequence of countably additive measures to an invariant countably additive measure in some special metric.
We see here the points of contact of the construction developed in [9] with our results from the articles [1,2] (and from this article), since the main theorems from these articles are proved for any Markov chain on an arbitrary phase space, including on an arbitrary topological space, i.e., they are also applicable to the Markov chain studied in [9].
4. Questions of quasi-compactness of general linear operators are studied in [10]. In one paper ([10], Section 5), the results obtained are applied to Markov chains. In this case, finitely additive measures and the Doeblin condition are used. However, there is no analog of our Theorem 7.

8. Conclusions

Thus, if condition ( ) or condition ( ˜ ) is satisfied for the Markov chain, then Doeblin condition ( D ) is satisfied, and the condition of quasi-compactness ( K ) is also satisfied. Consequently, for such a Markov chain, the corresponding ergodic theorems are valid, including uniform ones. Such theorems are studied well and in detail in the literature in different versions.

Funding

This research was partially funded by Russian Foundation of Basic Research, RFBR project number 20-01-00575-a.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Zhdanok, A.I. Finitely additive measures in the ergodic theory of Markov chains I. Sib. Adv. Math. 2003, 13, 87–125. [Google Scholar]
  2. Zhdanok, A.I. Finitely additive measures in the ergodic theory of Markov chains II. Sib. Adv. Math. 2003, 13, 108–125. [Google Scholar]
  3. Kryloff, N.; Bogoliouboff, N. Sur les probabilités en chaine. C. R. Acad. Sci. Paris 1937, 204, 1386–1388. [Google Scholar]
  4. Kryloff, N.; Bogoliouboff, N. Les propriétés ergodiques des suites de probabilités en chaine. C. R. Acad. Sci. Paris 1937, 204, 1454–1456. [Google Scholar]
  5. Yosida, K.; Kakutani, S. Operator-theoretical treatment of Markoff’s processes and mean ergodic theorem. Ann. Math. Second Ser. 1941, 42, 188–228. [Google Scholar] [CrossRef]
  6. Revuz, D. Markov Chains, 2nd ed.; North-Holland Mathematical Library: Amsterdam, The Netherlands, 1984. [Google Scholar]
  7. Lin, M. Quasi-compactness and uniform ergodicity of Markov operators. Ann. Inst. Henri Poincaré Probab. Stat. 1975, 11, 345–354. [Google Scholar]
  8. Horowitz, S. Transition probabilities and contractions of L. Z. Wahrscheinlichkeitstheorie Verw Geb. 1972, 24, 263–274. [Google Scholar] [CrossRef]
  9. Cioletti, L.; Silva, E.; Stadlbauer, M. Thermodynamic formalism for topological Markov chains on standard Borel spaces. Discret. Contin. Dyn. Syst.-Ser. A 2019, 39, 6277–6298. [Google Scholar] [CrossRef] [Green Version]
  10. Mebarki, L.; Messirdi, B.; Benharrat, M. Quasi-compactness of linear operators on Banach spaces: New properties and application to Markov chains. Asian-Eur. J. Math. 2021, 14, 2150060. [Google Scholar] [CrossRef]
  11. Dunford, N.; Schwartz, J.T. Linear Operatiors, Part I: General Theory; Interscience Publishers, Inc.: New York, NY, USA; London, UK, 1958. [Google Scholar]
  12. Yosida, K.; Hewitt, E. Finitely Additive Measures. Trans. Amer. Math. Soc. 1952, 72, 46–66. [Google Scholar] [CrossRef]
  13. Zhdanok, A.; Khuruma, A. Decomposition of Finitely Additive Markov Chains in Discrete Space. Mathematics 2022, 10, 2083. [Google Scholar] [CrossRef]
  14. Alexandroff, A.D. Additive set-functions in abstract spaces. Mat. Sb. 1941, 9, 563–628. [Google Scholar]
  15. Rao, K.P.S.B.; Rao, M.B. Theory of Charges: A Study of Finitely Additive Measures; Academic Press: London, UK, 1983. [Google Scholar]
  16. Zhdanok, A.I. Cycles in Spaces of Finitely Additive Measures of General Markov Chains. In Recent Developments in Stochastic Methods and Applications. Springer Proceedings in Mathematics Statistics; Shiryaev, A.N., Samouylov, K.E., Kozyrev, D.V., Eds.; Springer: New York, NY, USA, 2021; pp. 131–143. [Google Scholar]
  17. Ramakrishnan, S. Finitely Additive Markov Chains. Trans. Am. Math. Soc. 1981, 265, 247–272. [Google Scholar] [CrossRef]
  18. Šidak, Z. Integral representations for transition probabilities of Markov chains with a general state space. Czechoslov. Math. J. 1962, 12, 492–522. [Google Scholar] [CrossRef]
  19. Krein, M.G.; Rutman, M.A. Linear operators leaving invariant a cone in a Banach space. Uspekhi Mat. Nauk 1948, 3, 3–95. [Google Scholar]
  20. Foguel, S.R. Limit theorems for Markov processes. Trans. Am. Math. Soc. 1966, 121, 200–209. [Google Scholar] [CrossRef]
  21. Foguel, S.R. Existence of Invariant Measures for Markov Processes. Proc. Am. Math. Soc. 1966, 17, 387–389. [Google Scholar] [CrossRef]
  22. Hernández-Lerma, O.; Lasserre, J.B. Markov Chains and Invariant Probabilities; Birkhäuser Verlag: Basel, Switzerland, 2003. [Google Scholar]
  23. Doeblin, W. Sur les propriétés asymptotiques de mouvements régis par certains types de chaines simples. Bull. Math. Soc. Roum. Sci. 1937, 39, 57–115. [Google Scholar]
  24. Doeblin, W. Sur les Propriétés Asymptotiques de Mouvements Régis par Certains Types de Chaines Simples. Ph.D. Thesis, Faculté de Sciences, Université Paris Cité, Paris, France, 1938. [Google Scholar]
  25. Doob, J.L. Stochastic Processes; John Wiley and Sons: New York, NY, USA, 1953. [Google Scholar]
  26. Kelley, J.L. General Topology; Van Nostrand Comp., Inc.: Princeton, NJ, USA, 1957. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhdanok, A. Invariant Finitely Additive Measures for General Markov Chains and the Doeblin Condition. Mathematics 2023, 11, 3388. https://doi.org/10.3390/math11153388

AMA Style

Zhdanok A. Invariant Finitely Additive Measures for General Markov Chains and the Doeblin Condition. Mathematics. 2023; 11(15):3388. https://doi.org/10.3390/math11153388

Chicago/Turabian Style

Zhdanok, Alexander. 2023. "Invariant Finitely Additive Measures for General Markov Chains and the Doeblin Condition" Mathematics 11, no. 15: 3388. https://doi.org/10.3390/math11153388

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop