Next Article in Journal
Using SNAP to Analyze Policy Measures in e-Learning Roadmaps
Previous Article in Journal
A Note on Finite Dimensional Odd Contact Lie Superalgebra in Prime Characteristic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Describing Conditional Independence Statements Using Undirected Graphs

Statistics Program, Department of Mathematics, Statistics and Physics, College of Arts and Sciences, Qatar University, Doha 2713, Qatar
Axioms 2023, 12(12), 1109; https://doi.org/10.3390/axioms12121109
Submission received: 1 November 2023 / Revised: 28 November 2023 / Accepted: 1 December 2023 / Published: 8 December 2023
(This article belongs to the Special Issue Graphical Models)

Abstract

:
This paper investigates the capability of undirected graphs (UGs) to represent a set of Conditional Independence (CI) statements derived from a given probability distribution of a random vector. While it is established that certain axioms can govern this set, providing sufficient conditions for UGs to capture specific CI statements, our focus is on covariance and concentration graphs. These remain the only known families of UGs capable of describing CI statements. We explore the issue of complete representation of CI statements through their corresponding covariance and concentration graphs. Two parameters are defined, one each from the covariance and concentration graphs, to determine the limitations concerning the cardinality of the conditioning subset that the graph can represent. We establish a relationship between these parameters and the cardinality of the separators in each graph, providing a straightforward computational method to evaluate them. In conclusion, we enhance the aforementioned procedure and introduce criteria to ascertain, without additional computations, whether the graphs can fully represent a given set of CI statements. We demonstrate that either the concentration or the covariance graph forms a cycle, and when considered in conjunction, they can represent the entire relation. These criteria also enable us, in specific cases, to deduce the covariance graph from the concentration graph and vice versa.
MSC:
62H05; 05C90; 60E05; 62H20; 05C75

1. Introduction

In the realm of statistics and graphical estimation, conditional independence is fundamental in simplifying complex multivariate relationships and plays a crucial role in the construction and interpretation of graphical models, including Bayesian networks and Markov random fields. These models, which are pivotal in fields such as bioinformatics, epidemiology, and machine learning, leverage conditional independence to efficiently represent variable dependencies, thus enabling a more intuitive understanding of the underlying data structure and facilitating model simplification by reducing the number of parameters required (see [1,2,3]). This approach is particularly valuable in high-dimensional data analysis for identifying relevant variables and constructing sparse models, essential for effective prediction and inference. In the estimation of these models, learning conditional independence relationships directly from data is a key aspect, especially in high-dimensional settings with complex mean structures and random effects, where constraint-based methods are employed to estimate these relationships and form Bayesian networks [4]. The use of Bayesian networks in analyzing gene expression data underscores the connection between these networks and the concept of direct causal influence, characterized by probabilities and conditional independence statements [5]. Furthermore, in epidemiology, Bayesian network modeling is recognized as a well-suited approach for studying messy and highly correlated datasets typical in systems epidemiology, while the application of probabilistic graphical models, such as conditional random fields, hypergraph convolution networks, and large margin models, highlights the prevalence of these methods in learning with graphical structure [6,7,8].
This paper uses undirected graphs as a mathematical framework for representing sets of Conditional Independence (CI) statements. These graphs, commonly called graphical models, have been extensively studied in [9,10,11]. A central question in this field is whether a simple mathematical surrogate, like a graph, can effectively represent most of the CI statements associated with a random vector without requiring additional computations. Given the simplicity and mathematical rigor of graphs (see [12]), they appear to be suitable candidates for this role.
Graphical models are constructed according to specific rules, associating them with a probability distribution. These models are derived from a subset of the CI statements and can subsequently be used to infer additional CI statements within the distribution. To formalize this, consider a finite set V and denote u v as the pair ( u , v ) of two elements in V = V × V { ( u , u ) V × V ; u V } .
Given a random vector X V = ( X v , v V ) indexed by V and associated with a probability distribution P, we can define an undirected graph G = ( V , E ) based on P and a fixed family P of subsets of V. Here, V serves as the set of vertices, and the set of edges E is a subset of V as defined in Equation (1).
E V = V × V { ( u , u ) V × V ; u V }
The graph G satisfies the following condition:
u v E S P   such   that   u v S
Here, u v S is shorthand for the statement “ X u is independent of X v given the random vector X S indexed by S, i.e., X S = ( X s , s S ) ”. When this property (2) is met, we say that P is pairwise Markov to G according to P .
This paper focuses on two well-known types of undirected graphical models. The first type is the concentration graph, denoted by G = ( V , E ) , which is constructed from P when the family P = P | V | 2 = { S V   such   that   | S | = | V | 2 } . The second type is the covariance graph, denoted by H = ( V , F ) , and is constructed from P when the family P = P 0 = { } .
Under certain conditions satisfied by the probability distribution P [10,13,14,15], both types of graphs can be used to infer a wide range of CI statements, although not exhaustively. Specifically, for a triplet ( A , B , C ) of pairwise disjoint subsets of V, if A is separated from B by C in G, then A B C . In this context, P is said to be globally Markov to G.
To formalize, consider the following sets of triplets ( A , B , S ) of pairwise disjoint subsets of V:
S ( G ) = ( A , B , S ) S   separates   A   and   B   in   G
and
C ( P ) = ( A , B , S ) A B S .
P is globally Markov to G if and only if S ( G ) C ( P ) .
Similarly, for the covariance graph H, if A and B are separated by V ( A B C ) in H, then A B C in P (see [15,16,17]). In this case, P is also considered globally Markov to H. Therefore, consider the following set:
S ¯ ( H ) = ( A , B , S ) V ( A B S )   separates   A   and   B   in   H ,
P is globally Markov to H if and only if S ¯ ( H ) C ( P ) .
In this paper, we aim to identify the families of probability distributions that the concentration and covariance graphs can fully represent. The central question is to ascertain the conditions on P such that
C ( P ) = S ( G ) S ¯ ( H )
where G and H are the concentration and covariance graphs associated with P, respectively.
It has been previously established in [18] that when the CI statements are fully captured by the concentration graph, i.e., S ( G ) = C ( P ) , the covariance graph must be a union of complete connected graphs. In such cases, P is said to be faithful to its concentration graph. Conversely, using a dual argument, it was shown that if the covariance graph can fully represent the CI statements, i.e., S ¯ ( H ) = C ( P ) , then the concentration graph must also be a union of complete connected graphs. Under these conditions, P is faithful to its covariance graph (see [19]).
In [19], the authors provided a sufficient condition for P to be faithful to its concentration graph. They demonstrated that if all connected components of a concentration graph are trees and P satisfies certain conditions, then the graph can fully represent the CI statements. This result holds when all connected components of the covariance graph are also trees. In the Gaussian case, it was further established in [20] and generalized in [17] that these graphs represent all sets of CI statements.
In this paper, we investigate whether the concentration and covariance graphs, when considered together, can fully describe a set of Conditional Independence (CI) statements. We introduce the term CC-faithful to describe a probability distribution P that can be completely represented by these graphs when considered in tandem. While it is unlikely that this holds universally, we aim to identify necessary conditions for P to be CC-faithful.
Our first main result reveals that the conditions depend on the size of the separators in both the covariance and concentration graphs. We then proceed to identify families of probability distributions that are CC-faithful, working within a general framework that utilizes the set of relations previously defined in [14,21].
We show that the study of CI statements can be restricted to those of the form u v S , where u , v V and S V { u , v } . These CI statements, denoted by ( u v S ) , are termed couples. We then consider the set of such couples as a relation on V.
We demonstrate that when a given relation L satisfies an axiom known as the pseudographoid axiom, a global Markov property holds. This implies that the set of relations constructed from the covariance and concentration graphs are subsets of L . We also establish a one-to-one mapping between the covariance and concentration graphs, simplifying many proofs. For instance, we show that less restrictive conditions are needed for the global Markov property on the covariance graph, contrary to the assumptions made in [17].
In the final section, we introduce a graphical criterion for identifying new dependencies from the covariance and/or concentration graphs, extending the work in [17]. We provide examples of CC-faithful probability distributions, showing that when either the covariance or concentration graph forms a cycle, P is CC-faithful.
The paper is organized as follows: Section 2 introduces the notations and is divided into two subsections focusing on basic graph theory and properties of relations, respectively. Section 3 discusses how relations derived from the concentration and covariance graphs are subsets of the originating relation, provided they satisfy the pseudographoid axiom. Section 4 introduces separatoids, a generalization of separators. Section 5 examines the role of separator size in the ability of the graphs to describe CI statements. Finally, Section 6 presents a graphical criterion for reading dependencies and provides examples of CC-faithful distributions.

2. Preliminaries

In this section, we delineate the essential notations utilized throughout this paper. It is systematically segmented into two subsections: the first focuses on the characteristics and nuances of undirected graphs, while the second meticulously details the assembly of relations, representing a comprehensive method to articulate the set of Conditional Independence (CI) statements within a specified probability distribution.

2.1. Undirected Graphs

An undirected graph G = ( V , E ) consists of sets V representing the set of vertices and E V where V is the set defined in (1).
u v E v u E
For u , v V , we write u G v when u v E and we say that u and v are adjacent in G.
Let us first recall some basic definitions of undirected graphs.
  • A path connecting two distinct vertices u and v in a graph G is a sequence of distinct vertices u 0 , , u n , where u 0 = u and u n = v and, for each i { 0 , , n 1 } , u i G u i + 1 . Such a path is denoted by p = p ( u , v , G ) , establishing that p ( u , v , G ) connects u 0 and u n . Alternatively, we can state that u 0 and u n are connected by p ( u , v , G ) . The length of this path, denoted by | p ( u , v , G ) | , is defined as the number of edges connecting the vertices within p, | p ( u , v , G ) | = n . Moreover, we denote by P ( u , v , G ) the set of all possible paths between u and v. Furthermore, we define two paths p 1 and p 2 in P ( u , v , G ) to be disjoint if their intersection is u , v , clearly indicating that u and v are the sole common vertices between p 1 and p 2 .
  • A subgraph of G induced graph by a subset U V is denoted by G U = ( U , E U ) , U V and E U = E U × U
  • A connected component of G is the largest subgraph G U = ( U , E U ) of G such that each pair of vertices can be connected by at least one path in G U . Equivalently we can define G U to be a connected component of G if and only if G U satisfies the following two conditions
    i.
    u v U × U , P ( u , v , G U ) .
    ii.
    u U w V U , P ( u , w , G ) = .
    It is straightforward to see that the two definitions above are equivalent.
  • For a connected graph, a separator is a subset S of V such that there exists a pair of non-adjacent vertices u and v such that u , v S and
    p P ( u , v , G ) ,   p S
    In that case, we write u G v S . It is also easily verified that every S S such that S V { u , v } is also a separator. We are thus led to the notion of minimal separator. The separator S is defined to be a minimal separator between two non-adjacent vertices u and v if for any w S , the subsets S { w } is not a separator of u and v.
  • For any triplet ( A , B , S ) of pairwise disjoint subsets of V, we say that S separates A and B in G and we denote by A G B S if for any pair of vertices u v A × B we have u G v S .

2.2. The Set of Relations

Let V be a finite set. We don’t distinguish in the next of this paper between singletons and elements and we denote by A B a union of two subsets A and B of V. We call a couple an object ( u v S ) where u and v are two distincts elements of V and S V u v . We denote by C ( V ) the set of all the couples
C ( V ) = ( u v S )   where   u v V × V , u v   and   S V u v ,
where V u v = V { u , v } .
Any subset of C ( V ) will be called a relation, and it will be generally denoted by L . We associate a given relation L the set V ( L ) the of subsets of V defined as follows
V ( L ) = S V   such   that     u v V   and   ( u v S ) L
Let us now give three examples of well-known relations. We will see later that these relations satisfy the pseudographoid axiom.
Example 1.
Let X V = ( X v , v V ) be a random vector with a probability distribution P. Let us define the relation c ( P ) as follows
c ( P ) = ( u v S ) C ( V )   s u c h   t h a t   u v S
where u v S denotes that the variables X u and X v are independent given the sub-random vector X S indexed by S.
Example 2.
Let G = ( V , E ) be an undirected graph and where we denote by u G v S when the set S V u v separates u and v in G.
s ( G ) = ( u v S ) C ( V )   s u c h   t h a t   u G v S
Let us notice the following remarks
i.
G is complete if and only if s ( G ) = .
ii.
G is connected if and only if V ( s ( G ) ) , where V ( s ( G ) ) is the set associated with s ( G ) according to (6).
iii.
G has n connected components U 1 , , U n then s ( G ) = i = 1 n s ( G U i ) s ( G ) where for i = 1 n the graph G U i is the subgraph induced from U i on G and s is the relation defined as follows
( u v S ) s ( G ) S =   a n d   i j { 1 , , n } s . t .   u U i   a n d   v U j .
Example 3.
Let us consider Σ a positive definite matrix, i.e., Σ = σ u v u , v V . We denote by Σ u S , v S the sub-matrix of Σ with rows in u S and columns in v S . We denote then the following relation
d ( Σ ) = ( u v S ) C ( V )   s u c h   t h a t   d e t ( Σ u S , v S ) = 0
Note that if Σ is considered as the covariance matrix of the probability distribution P then the relations d ( Σ ) and c ( P ) defined respectively (9) and in (7) are equal.
We also know that from any probability distribution P we can define two undirected graphs called covariance and concentration graphs. We will show in the next section that the set of relations constructed from these two graphs is included in the set of relations composed from the probability distribution. We will prove this result remains true for any relation satisfying an axiom called the pseudographoid relation. This fact is known for concentration graphs (see [14]). However, by using a dual relationship between the covariance and the concentration graph, we show that this inclusion can also be satisfied by covariance graphs, unlike what is proved in [17] where the author assumes a further axiom called the weak transitivity axiom.

3. Markov Relations

Let us denote by R ( V ) the set of relations defined on V and we define on R ( V ) the application T : R ( V ) R ( V ) such that the relation T ( L ) is defined as follows
T ( L ) = ( u v S )   such   that   ( u v V S u v ) L .
The relation T ( L ) will be called the dual of L . It is easily seen that for any L R ( V ) we have T ( T ( L ) ) = L , i.e., T T = id R ( V ) . In this section and in the next of this paper, we say that L satisfies the pseudographoid axiom if the following axiom is satisfied: for any { u , v , w } V and S V u v w the following axiom
{ ( u v S w ) , ( u w S v ) } L   then   { ( u v S ) , ( u w S ) } L
It was proved in [21] that all the relations c ( P ) , s ( G ) and d ( Σ ) defined in the Examples 1–3 satisfy the pseudographoid axiom (11).
Lemma 1.
Let L and L be two relations in R ( V ) . The following assertions are then satisfied
i.
If L is satisfying (11) then T ( L ) is a pseudographoid too.
ii.
If L L then T ( L ) T ( L ) .
Proof. 
The proof of (i) can be found in [21] (see Lemma 2).
Let us now prove (ii). We have
( u v S ) T ( L ) ( u v V S u v ) L L ( u v V S u v ) L ( u v S ) T ( L )
Let us now show how to associate to any relation L and a fixed set P of subsets of V, i.e.,
P = S , S V .
We will see later two kinds of such graphs constructed using pairwise relationships but their role will be larger. Indeed, they will help us determine all the elements of L from reading separation statements on those graphs. Let us consider the following application G : R ( V ) × P 2 ( V ) G ( V ) such that if G = G ( L , P ) then
u G v S P   such   that   ( u v S ) L .
We denote here by P 2 ( V ) the power set of the power set P ( V ) of V, i.e.,
P 2 ( V ) = { P   such   that   P P ( V ) } .
Let us fix k { 0 , , | V | 2 } and consider the following family of subsets of V, P k P 2 ( V ) , i.e.,
P k = { S V   such   that   | S | = k }
and we consider then the following pair of undirected graphs ( G k , H k ) defined as follows
G k = G ( L , P k )   and   H k = G ( L , P | V | k 2 )
We are now interested in the special case where k = | V | 2
Case 1
G = G | V | 2 = G ( L , P | V | 2 ) where P | V | 2 = { S V   such   that   | S | = | V | 2 } we say that G is the concentration graph associated with L .
Case 2
H = H | V | 2 = G ( L , P 0 ) where P 0 = { } . We say that H is the covariance graph associated with L .
Instead of considering the application G ( · , · ) we will rather consider the following two applications g and h and defined as follows
g = G ( · , P | V | 2 ) and h = G ( · , P 0 )
The applications g and h will be called respectively that concentration and the covariance map. It is easily seen that g = h T and h = g T . It is easily seen then that G can be viewed as the covariance graph of the dual of L , i.e., T ( L ) and vis versa for the graph H.
This section aims to show that a part of the element of a given relation can be described using the concentration and the covariance graphs. The theorem that will be proved below is a generalization of a result well known as the global Markov property to a distribution probability to its covariance or concentration graph.
Theorem 1.
Let L R ( V ) a pseudographoid. Let G = g ( L ) and H = h ( L ) be respectively the concentration and the covariance graphs associated with L . Then if L is a pseudographoid then
s ( G ) L
and
T ( s ( H ) ) L
Before we begin the proof of Theorem 1, it is important to note that the equalities s ( G ) L and T ( s ( H ) ) L may occur. These are related to the concept of faithfulness of the graphical structure to the pseudographoid structure. The definition of faithfulness will be discussed later in Definition 3.
Proof. 
Let us first proof (16). The proof below can also be found in [10,13,21].
Let then ( u v S ) S ( G ) , i.e., u G v S and let us show that ( u v S ) L . Let us prove this by backward induction on n = | S | . If n = | V | 2 there is nothing to prove.
Assume now that | S | = n < | V | 2 and that relations in s ( G ) are contained in L when | S | > n .
Since S u v V , let w V S u v . Hence ( u v S w ) s ( G ) and by induction we can say that ( u v S w ) L . Further more since { ( u v S ) , ( u v S w ) } s ( G ) then either ( u w S v ) s ( G ) or ( v w S u ) s ( G ) . Otherwise, a path exists between u and w that does not intersect S v , and another one between v and w that does not intersect S u . Hence we can construct a path between u and v that contains w and does not intersect S. It is a contradiction with ( u v S ) s ( G ) .
Let us then assume that ( u w S v ) s ( G ) . Hence ( u w S v ) L . Since L is a pseudographoid we can deduce from { ( u v S w ) , ( u w S v ) } L that ( u v S ) L .
To prove (17) it will be enough to apply (16) to the relation T ( L ) . According to Lemma 1 the relation T ( L ) satisfyies the pseudographoid axiom (see (11)). Then
s ( g ( T ( L ) ) T ( L ) s ( h ( L ) ) T ( L ) ( 17 ) is   satisfied
We can deduce then from Theorem 1 that if L is a pseudographoid the following inclusion holds
s ( G ) T ( s ( H ) ) L
Since it is well known that the relation defined from a given probability distribution P as in Example 1 is also satisfying the pseudographoid axiom (11). Let us then define the CC-faithful probability distributions.
Definition 1.
Let P be a probability distribution of a random vector X = ( X v , v V ) and let
i.
c ( P ) be the relation associated to P according to (7)
ii.
G = g ( c ( P ) ) be the concentration associated with P.
iii.
H = h ( c ( P ) ) be the covariance associated with P.
We say that P is a CC-faithful if c ( P ) = s ( G ) T ( s ( H ) ) .
Our main question now for the next of this paper is to determine which conditions should be satisfied by the undirected graph G and/or H to obtain the equality in (18)?

4. Separatoids on L

Let L be a non empty relation in R ( V ) and let us denote by V ( L ) the subset of pairs u v belonging in V × V such that u v and there exists at least one S V u v such that ( u v S ) L . Let us consider the following definition
Definition 2.
Let u and v be two distincts elements of V. Let S and T be two subsets of V u v such that ( u v S ) and ( u v T ) in L . We say that
i.
S is a minimum separatoid between u and v if either S = or w S , ( u v S w ) L .
ii.
T is a maximum separatoid between u and v if either T = V u v or w V T u v , ( u v T w ) L .
Let us then denote by S ( u , v , L ) and by T ( u , v , L ) be respectively the set of all the minimal and maximal separatoids defined between u and v. We define the following two parameters
s ( u , v , L ) = min S S ( u , v , L ) | S |
and
t ( u , v , L ) = max T T ( u , v , L ) | T |
Lemma 2.
Let L R ( V ) and let u v V and S ( u , v , L ) and S ( u , v , T ( L ) ) be the set of minimal separatoids associated respectively to L and T ( L ) . Then
S S ( u , v , L ) V S u v T ( u , v , T ( L ) ) .
Furthermore, we have
s ( u , v , L ) + t ( u , v , T ( L ) ) = | V | 2 .
Proof. 
Let S S ( u , v , L ) and let us prove that T = V S u v T ( u , v , T ( L ) ) . It is equivalent to show the following statements
i.
( u v T ) T ( L )
ii.
either T = V u v or w V T u v , ( u v T w ) L
Since ( u v S ) L , then ( u v V S u v ) T ( L ) . Hence ( u v T ) = ( u v V S u v ) T ( L ) . Then (i) is proved.
Let us show (ii). If S = , then T = V u v . Let w V T u v we know that ( u v S w ) L . Hence ( u v V ( ( S w ) u v ) T ( L ) since V ( ( S w ) u v ) = ( V S u v ) w = T w . Hence, (ii) is completely proved.
We can use the same argument to show the other way in the equivalence (21).
Let us now prove the equality (22). Let s = s ( u , v , L ) and t = t ( u , v , T ( L ) ) . let us show that s + t = | V | 2 .
Let S S ( u , v , L ) , then | S | s . Since V S u v T ( u , v , T ( L ) ) , then | V S u v | t . Hence | S | | V | t 2 . Then s | V | t 2 and s + t | V | 2 .
Let now T T ( u , v , T ( L ) ) , then | T | t . Since V T u v S ( u , v , L ) , then | V T u v | s . Hence | T | | V | s 2 . Then t | V | s 2 and t + s | V | 2 .
We deduce then that s + t = | V | 2 . □
Let us consider the case of a relation a type s ( G ) where G is an undirected graph. Since ( u v S ) belongs to S ( G ) means that S separates u and v in G then for any w V S u v we have also ( u v S w ) s ( G ) . Let us recall that if S separates u and v in G, then any subset containing S is also a separator of u and v in G. Then it is easily seen that the set T ( u , v , s ( G ) ) = { V u v } and consequently by applying Lemma 2 we deduce that t ( u , v , s ( G ) ) = | V | 2 , S ( u , v , T ( s ( G ) ) ) = { } and s ( u , v , T ( s ( G ) ) ) = 0 .
We have then proved the lemma below.
Lemma 3.
Let G be an undirected graph and let s ( G ) be the relation associated with G according to (2). Then T ( u , v , s ( G ) ) = { V u v } , S ( u , v , T ( s ( G ) ) ) = { } , t ( u , v , s ( G ) ) = | V | 2 and s ( u , v , T ( s ( G ) ) ) = 0 .
Let us define then the separability of an undirected graph.
Definition 3.
Let G = ( V , E ) be an undirected graph. If G is not complete, we define for any non adjacent pair of vertices u v in G the separability order of u v as
so ( u , v , G ) = s ( u , v , s ( G ) )
we define the separability of G by
so ( G ) = max u G v so ( u , v , G )
if G is not-complete and equal to + is G is complete.
In the following section, we will explore how understanding the separability order of the covariance graph aids in determining the capacity of the concentration graph to represent the elements of L , and vice versa. We will revisit a finding from a previous paper (see [18]), which elucidates the effects on the covariance graph when the concentration graph fully represents L . This was initially demonstrated for concentration and covariance graphs under specific conditions in [18]. Here, we expand upon this result under much broader conditions.
However, let’s make some remarks about the parameters introduced in Definition 3. Indeed, this parameter was defined and used in [22]. But determining the value of so ( u , v , G ) for a given non-adjacent vertices are known to be called Menger’s problem (see [23]), and it was proved in many references (see [24,25,26]), and many others) this number equals the maximal number of disjoint paths between u and v.
Hence if u and v are note belonging to the same connected component of G, then so ( u , v , G ) = 0 . Furthermore, so ( G ) = 0 if and only if G comprises complete connected components and if so ( G ) = + if and only if G is complete.

5. Ability of the Concentration and the Covariance Graphs to Represent L

In this section, we are interested in showing how much-undirected graphs can describe a pseudographoid relation L . We have already seen in Theorem 1 that the concentration graph G = g ( L ) and the covariance graph H = h ( L ) are both of them able to determine elements of L by reading separation statements on G and H. Our main question in this section is which of the elements of L can be missed by G and H. We will show that the couples ( u v S ) L that G can miss having necessary an S with a cardinality smaller than a number depending on the separability order of H. By using a “dual technique”, we show that the couples ( u v S ) L that H can miss having necessary an S with a cardinality smaller than a number depending on the separability order of G.
Before giving the proof of the main result of this section, we would like to introduce some notations. Let us recall that by the construction of G, the following equivalence is satisfied for any pair u v of non-adjacent vertices in G
( u v V u v ) s ( G ) ( u v V u v ) L
Hence if we denote for any l { 0 , , | V | 2 } an application ϕ l : R ( V ) R ( V ) such that for any relation L
ϕ l ( L ) = ( u v S ) L   such   that   | S | = l
From the equivalence in (25) it can be easily deduced that ϕ | V | 2 ( L ) = ϕ | V | 2 ( s ( G ) ) . Let us thus define the integer μ = μ ( L ) as the smallest integer such that
ϕ l ( L ) = ϕ l ( s ( G ) )
is satisfied. Hence we deduce that when μ is known, the undirected graph G allows us to determine all the couples ( u v S ) of L where | S | μ .
Similarly and by construction of H, the following equivalence is satisfied for any pair u v of non-adjacent vertices in G
( u v ) T ( s ( H ) ) ( u v ) L
Hence the equivalence in (28) can be written ϕ 0 ( L ) = ϕ 0 ( T ( s ( H ) ) ) . We can also define the integer ν = ν ( L ) as the largest integer such that
ϕ l ( L ) = ϕ l ( T ( s ( H ) ) )
Hence we deduce that when ν is known, the undirected graph H allows us to determine the couples ( u v S ) of L where | S | ν .
The two lemma below help us to give a relationship between μ ( L ) and ν ( T ( L ) ) .
Lemma 4.
Let L be a relation in R ( V ) and ϕ l an application defined as in (26) where l { 0 , , | V | 2 } . Then
T ϕ l = ϕ | V | l 2 T .
Proof. 
Let L L ,
T ϕ l ( L ) = ( u v S )   such   that ( u v V S u v ) ϕ l ( L ) = ( u v S )   such   that   ( u v V S u v ) L   and   | V S u v | = l = ( u v S )   such   that   ( u v S ) T ( L )   and   | V | | S | 2 = l = ( u v S )   such   that   ( u v S ) T ( L )   and   | S | = | V | l 2 = ϕ | V | l 2 ( T ( L ) )
Lemma 5.
Let L R ( V ) a pseudographoid and let G and H be, respectively the concentration and the covariance graph associated to L . Then
μ ( L ) = | V | ν ( T ( L ) ) 2
Proof. 
Let l μ ( L ) , then
ϕ l ( L ) = ϕ l ( s ( G ) ) .
But ϕ l ( L ) = ϕ | V | l 2 ( T ( L ) ) according to Lemma 4 and to the fact that G is the covariance graph associated with T ( L ) . Then
ϕ | V | l 2 ( T ( L ) ) = ϕ l ( s ( h ( T ( L ) ) ) ) = ϕ | V | l 2 ( T ( s ( h ( T ( L ) ) ) ) )
Hence if we denote L = T ( L ) and, we have
ϕ | V | l 2 ( L ) = ϕ | V | l 2 ( T ( s ( h ( L ) ) ) )
Hence | V | l 2 ν ( L ) l | V | ν ( L ) 2 . Since μ is the smallest l satisfying (30) then
| V | ν ( L ) 2 μ ( L ) ν ( L ) | V | μ ( L ) 2 .
Similarly if we consider l ν ( L ) and let us write H = h ( L ) = G which is the covariance graph associated to L . Then
ϕ l ( L ) = ϕ l ( T ( s ( H ) ) ) .
But ϕ l ( L ) = ϕ | V | l 2 ( L ) and
ϕ l ( T ( s ( H ) ) ) = ϕ | V | l 2 ( s ( H ) ) = ϕ | V | l 2 ( s ( G ) )
Hence (32) becomes
ϕ | V | l 2 ( L ) = ϕ | V | l 2 ( s ( G ) ) .
This implies that | V | l 2 μ ( L ) l | V | μ ( L ) 2 . Since ν ( L ) is the largest integer. Then
ν ( L ) | V | μ ( L ) 2
Hence the Lemma is proved by combining (31) and (33). □
Let us then prove the main result of this section.
Theorem 2.
Let L R ( V ) be a pseudographoid. Let G = g ( L ) and H = h ( L ) be respectively the concentration and the covariance graph associated with L . Let μ = μ ( L ) , ν = μ ( L ) , s G = so ( G ) and s H = so ( H ) . Assume that G and H are connected. Then, the following two inequalities hold
μ | V | s H 1
ν s G 1
Before starting the proof of Theorem 2, we must first prove the following lemma.
Lemma 6.
Let L R ( V ) be a pseudographoid. Let G = g ( L ) and H = h ( L ) be the concentration and the covariance graph associated with L . Let μ = μ ( L ) and s H = so ( H ) . Assume that
so ( H ) | V | μ ( L ) 2
then E F and for any u v F there exists S V u v such that ( u v S ) s ( H ) and ( u v V S u v ) s ( G ) .
Proof. 
Let us assume that (36) holds. Since μ ( L ) is an integer that belongs to { 0 , , | V | 2 } then (36) implies that H is necessarily not complete and the set of pairs on non-adjacent vertices in H V F is not empty. Let u v be a pair of non-adjacent vertices in H, i.e., u v F . According to (36) there exists a subset S V such that ( u v S ) s ( H ) and | S | | V | μ ( L ) 2 .
We can also write S as follows
S = V ( T u v )
where T = V S u v . By applying the Markov property (17) in Theorem 1, we deduce that ( u v T ) L . Secondly as | S | | V | μ 2 and | T | = | V | | S | 2 , then | T | = l μ and by using the definition of μ we deduce that ( u v T ) ϕ l ( L ) = ϕ l ( s ( G ) ) ) . Hence ( u v T ) s ( G ) . So u G v T and then u G v .
We can deduce then that E F .
We have also proved that for any non-adjacent pair of vertices u v in H we can find a subset S V u v such that ( u v S ) s ( H ) and ( u v V S u v ) s ( G ) . □
Lemma 7.
Let L R ( V ) be a pseudographoid. Let G = g ( L ) and H = h ( L ) be respectively the concentration and the covariance graph associated with L . Let ν = μ ( L ) and s G = so ( G ) . Assume that
s G ν
then F E and for any u v E there exists S V u v such that ( u v S ) s ( G ) and ( u v V S u v ) s ( H ) .
Proof. 
We will show that the proof of Lemma 7 can be obtained by Lemma 6 to L = T ( L ) . Let us first assume that (36) holds.
Let us remaind that g ( L ) = g ( T ( L ) ) = h ( L ) . By using also Lemma 5 we have μ ( L ) = | V | μ ( L ) 2 . Hence
so ( G ) ν ( L ) so ( h ( L ) ) ν ( T ( L ) ) so ( h ( L ) ) | V | μ ( L ) 2
Hence (36) holds for L . By applying Lemma 6 for L we deduce that H = g ( L ) is an undirected graph where the set of edges F is included in E the set of edges of G = h ( L ) . The remaining conclusion of the lemma and can be obtained obviously. □
Proof of Theorem 2.
Note that if G and/or H are complete, there is nothing to prove since in that case, the separability order is equal to + , and the inequalities remain satisfied in that case.
Let us first prove that (34) and (35) are equivalent.
Assume that (34) is true for any relation L . Since for any pseudographoid L , the dual of L which is T ( L ) is also a pseudographoid and Hence it satisfies (34):
μ ( T ( L ) ) | V | so ( h ( T ( L ) ) 1
Bu using Lemma 5 we deduce that
= μ ( T ( L ) ) = | V | ν ( T ( T ( L ) ) 2 = | V | ν 2
Then by using (40) and as h T = g the inequality (39) becomes
| V | ν 2 | V | so ( g ( L ) ) 1 = | V | so ( G ) 1 = | V | s 1 ν s 1
Hence we obtain (35). We have proved that (39) ⇒ (40). We similarly also prove that (40) ⇒ (39).
Let us now assume that (34) does not hold. Then by equivalence (35) does not hold. We can then assume that (36) and (38). Note that once (36) is satisfied by equivalence (38) is also satisfied. These former inequalities are the principal hypothesis in Lemmas 6 and 7. Let us then apply these lemmas. We deduce that H = G and for any non-adjacent pair of vertices u v in G there exists S V u v where ( u v S ) and ( u v V S u v ) are in S ( G ) and where neither S is empty nor V S u v .
Hence we can find four vertices u 1 , u 2 , u 3 and u 4 in G such that u 1 G G u 2 G u 3 G u 4 where u 2 S , u 3 V S u 1 u 4 and u 1 G u 4 . The vertices u 1 and u 3 are also not adjacent in G. Otherwise, a path exists between u 1 and u 4 which is u 1 G u 3 u 4 that does not intersect S. Similarly, we can say that u 2 and u 4 are not adjacent in G. Hence, ( u 1 , u 3 ) is a pair of non-adjacent vertices in G such that u 1 G u 2 G u 3 is a path connecting these two vertices. Hence any subset T separating u 1 and u 3 contains u 2 . However, it will not be possible to V T u v to separate u 1 and u 3 in G. We obtain then a contradiction. □
The second part now of this section is devoted to device an algorithm allowing us to compute the parameters μ and ν for a giving L . It will then be an algorithm that may help us to verify if a given L is at the same satisfying the pseudographoid axiom.
The following result concerns relations that satisfy the following two axioms:
  • the semigraphoid axiom: for any triplet ( u , v , w ) of distinct elements of V and S V u v w such that
    { ( u w S ) , ( u v S w ) } L { ( u v S ) , ( u w S v ) } L .
  • the weak transitive axiom: for any triplet ( u , v , w ) of distinct elements of V and S V u v w such that
    { ( u v S ) , ( u v S w ) } L ( u w S ) L   or   ( v w S ) L
Lemma 8.
Let L R ( V ) be a relation satisfying the axioms (11), (42) and (41). Let l { 0 , , | V | 2 } . Let ( u v S ) L . Then the following assertions hold
i.
Assume that ϕ l + 1 ( L ) = ϕ l + 1 ( s ( G ) ) and | S | = l . Then
( u v S ) s ( G ) w V S u v   s u c h   t h a t   ( u v S w ) s ( G ) .
ii.
Assume that ϕ l ( L ) = ϕ l ( T ( s ( H ) ) ) and | S | = l + 1 . Then
( u v S ) T ( s ( H ) ) w S   s u c h   t h a t   ( u v S w ) s ( H ) .
Proof. 
Let us start by proving (i). The left-to-right of the equivalence (43) is indeed obvious. Let us now show the right-to-left way. Hence assume that there exists w V S u v such that ( u v S ) s ( G ) . Since | S w | = l + 1 and ϕ l + 1 ( L ) = ϕ l + 1 ( s ( G ) ) then
( u v S ) L   and   ( u v S w ) L .
Since L satisfies the weak transitivity axiom (42) we deduce that
( u w S ) L   or   ( v w S ) L .
Assume that ( u w S ) L . Since ( u v S w ) L and by applying (41) we deduce that ( u w S v ) L . Since also that ϕ l + 1 ( L ) = ϕ l + 1 ( s ( G ) ) we deduce that u G w S v .
As u G v S w and u G w S v then by applying (11) we have u G v S .
Let us now prove (ii). We will indeed apply (i) to the relation T ( L ) , which also satisfies the axioms (11), (42) and (41), and to its concentration graph which is H.
Since the equality ϕ l ( L ) = ϕ l ( T ( s ( H ) ) ) and by applying Lemma 4 we obtain
ϕ | V | l 2 ( T ( L ) ) = ϕ | V | l 2 ( s ( H ) ) .
Let l = | V | l 3 . Then (45) is equivalent to ϕ l + 1 ( T ( L ) ) = ϕ l + 1 ( s ( H ) ) . Let us then apply (i) to H and T ( L ) . Then
( u v S ) T ( s ( H ) ) ( u v V S u v ) s ( H ) w V ( ( V S u v ) u v ) = S   such   that   ( u v ( V S u v ) w ) s ( H )
Since | V S u v | = | V | ( l + 1 ) 2 = l . Furthermore the fact that ( u v ( V S u v ) w ) s ( H ) is equivalent to ( u v S w ) T ( s ( H ) ) . Hence, (ii) is proved. □
Lemma 9.
Let L R ( V ) be a relation satisfying the axioms (11), (42) and (41). Let μ and ν are parameters associated with L . Let l { 0 , , | V | 2 } .
i.
Assume that l | V | 3 . Assumes that ϕ l + 1 ( L ) = ϕ l + 1 ( s ( G ) ) and let ( u v S ) L where | S | = l , then
( u v S ) s ( G ) u G v
ii.
Assume that l 1 . Assumes that ϕ l 1 ( L ) = ϕ l 1 ( s ( G ) ) and let ( u v S ) L where | S | = l , then
( u v S ) T ( s ( H ) ) u G v
Proof of Lemma 9.
First, we prove (ii) by induction on l ν .
If l = 1 , then | S | = 1 and S = { w } where w V u v . By applying (44) in Lemma 8 we deduce that
( u v S ) = ( u v w ) T ( s ( H ) ) ( u v ) T ( s ( H ) ) u H v
Assume now that (49) is satisfied for any subset S with cardinality l. Let us prove it for | S | = l + 1 , ν .
Let us then apply (44) since the hypothesis of Lemma 8 are still satisfied:
( u v S ) T ( s ( H ) ) w S   such   that   ( u v S w ) s ( H ) .
Since | S w | = l and the hypothesis of Lemma 8 are still satisfied. Hence, by applying the induction hypothesis, we can deduce that (49) is still satisfied for | S | = l + 1 . Hence, (ii) is proved.
The proof of (ii) is easily deduced from ( i ) . It is obtained by applying (i) to G, which is the covariance associated with T ( L ) . □
Lemma 10.
Let L R ( V ) be a relation satisfying the axioms (11), (42) and (41). Let μ and ν are parameters associated with L . Let l { 0 , , | V | 2 } .
i.
Assume that l | V | 3 . If ϕ l + 1 ( L ) = ϕ l + 1 ( s ( G ) ) . Then
ϕ l ( L ) = ϕ l ( s ( G ) ) V ( s ( G ) , l ) = V ( L , l ) E ¯
ii.
Assume that l 1 . If ϕ l 1 ( L ) = ϕ l 1 ( T ( s ( H ) ) ) . Then
ϕ l ( L ) = ϕ l ( T ( s ( H ) ) ) V ( T ( s ( H ) ) , l ) = V ( L , l ) E ¯

6. Graphical Criteria for Not Belonging to L

In this section, we give a graphical criterion from either the concentration and/or the covariance graphs that can be used to determine now which of the couples in C ( V ) could not be in the relation L . In other terms, we are giving here the reciprocal way of the global Markov property. If L is, for example, a relation as the one associated with a random vector and defined in Example 1 this criteria will allow us to read now dependencies statements between a triplet of sub-random vectors. This criterion was also used in [17] but to read dependencies only from covariance graphs.
Definition 4.
Let u v be a pair of vertices in an undirected graph G and n N * . We say that u and v are n -connected given S if there exists exactly n paths p 1 , , p n in P ( u , v , G ) pairwise disjoints, i.e., i j ( p i p j ) { u , v } = such that i = 1 , , n p i S u v . We denote then that u ~ G n v S .
The lemma below is needed for the proof of the main result in this paper.
Lemma 11.
Let L R ( V ) be a pseudographoid. Let U V . Let us denote by L U the relation deduced from as following
L U = ( u v S ) L   a n d   S u v U
Let us denote by H and H ( U ) be the covariance graph associated respectively with L and L U . Then H ( U ) = H U where H U is the subgraph of H induced on U.
Proof. 
We know that the graph H ( U ) is defined as follows
u H ( U ) v ( u v ) L U ( u v ) L u H U v .
Hence, the lemma is proved. □
The following statements are all the possible contrapositive of the semigrpahoid axiom. Some of them will be needed in the proof of the main result. If L is a semigraphoid then it satisfies the following statements
( u w S ) L   and   ( u v S ) L , ( u w S v ) L
( u w S ) L   and   ( u w S v ) L , ( u v S ) L
( u v S w ) L   and   ( u v S ) L , ( u w S v ) L
( u v S w ) L   and   ( u w S v ) L , ( u v S ) L
We now define another family of relations: the graphoids.
Definition 5.
A relation L is called a graphoid if it is at the same a pseudographoid and a semigraphoid.
As we see in the prove above, we will rather use the contrepositive of the weak transitive axiom defined (42).
If   ( u w S ) L , ( v w S ) L   and   ( u v S ) L then   ( u v S w ) } L
Here is now the last lemma before giving the proof of the main result of this section.
Lemma 12.
Let L R ( V ) be a graphoid. Let H be the covariance associated with L . Let u , v V and S V u v . Assume that the following two hypotheses are satisfied
A.
i/ w S , any path between w and v contains u and ii/ u H v . Then ( u v S ) L
B.
i/ w V S u v , any path between w and v contains u and ii/ u G v . Then ( u v S ) L
Proof. 
Let us prove (A).
Let us first deduce from (i) that w S the vertices v and w can not be adjacent in H then ( v w ) L . We can also deduce from the condition (i) that ( u v ) L .
Secondly, according to (ii), we can deduce that the subset V ( ( S w ) w v ) = V S v contains u since u S and this later subset intersects any path between v and w. Then by using Theorem 1 we deduce that ( v w S w ) L .
Let prove Lemma 12 by induction on n = | S | .
Let us prove the lemma for n = 1 . Let us note that ( u v ) L and ( v w ) L . Hence by applying (50) we deduce that ( u v w ) L .
Let us assume that the lemma is true for subsets S with cardinality n. Let us prove it for n + 1 . Hence assume that | S | = n + 1 . Using the induction hypothesis for all w S , ( u v S w ) L . Assume that ( u v S ) L . Since S = ( S w ) w . Then by using the induction hypothesis we have ( u v S w ) L . Since we have already proved that ( v w S w ) L , then by using (50) we deduce that ( u v ( S w ) w ) L . Hence ( u w S ) L .
The part (B) of the lemma is deduced by applying (A) to T ( L ) and G which is the covariance graph associated with T ( L ) . □
Let us now give the criteria needed to determine if a couple ( u v S ) can not be in a relation L . Considering the relations that are derived from probability distributions, we can then be able to determine which relations do not correspond to Conditional Independence statements.
Theorem 3.
Let L R ( V ) be a relation satisfying the axioms (11), (41) and (42). Let G and H be the concentration and the covariance graph associated with L . Then
i.
if u H 1 v S then ( u v S ) L
ii.
if u G 1 v S then ( u v V S u v ) L
Proof of Theorem 3.
Let us first prove (i).
Since u H 1 v S then there exists a unique path between u and v such that p S { u , v } . Let us prove (i) by induction on n = | p | the length of p.
If n = 1 then u H v . We have to prove that
( u v S ) L .
Let us now prove (55) by induction on n = | S | .
When | S | = 1 , S = { w } . Hence, two cases are possible:
w H u H v   or   u H v H u .
Let us consider the case where w H u H v . Note that w H v otherwise it exists another path between u and v different from the path-edge u H v . Hence
( u v ) L   and   ( w v ) L
Let us then apply (50) in the case where ( u , w , v , S = ) then ( u v w ) L . Then the statement is proved when | S | = 1 .
Let us assume that ( u v S ) L when | S | = n and satisfies the hypothesis of Theorem 3. Let us assume now that | S | = n + 1 . Let us denote by H S u v the induced graph of H on S u v . Let us then define the following two subsets of S:
S ( u ) = w S p   such   that   P ( w , u , H S u v )   and   P ( w , v , H S ) =
and
S ( v ) = w S p   such   that   P ( w , u , H S u v ) =   and   P ( w , v , H S )
First we claim that S = S ( u ) S ( v ) and that S ( u ) S ( v ) = .
The second statement in our claim is obvious, i.e., S ( u ) S ( v ) = . let us show that S = S ( u ) S ( v ) .
Let w S . If P ( u , w , H S u v ) and P ( v , w , H S u v ) then there exists p P ( u , w , H S u v ) and p P ( v , w , H S u v ) , both of them are included in H S u v and p = p p which is the union of p and p is a path connecting u and v different from the edge path u H v . We obtain a contradiction with the fact that a single path exists between u and v in H.
Since S then either S ( u ) or S ( v ) .
Let us assume then that S ( u ) . Since for any w S ( u ) , w H v , then for all w S ( u ) we have ( v w ) L . However ( u v ) L . Hence by using Lemma 12 we deduce that
( u v S ( u ) ) L .
Similarly if we assume that S ( v ) we deduce that ( u v S ( u ) ) L .
Let w S . It is easily seen that S w is a set with cardinality n and satisfying the hypothesis of Theorem 3. Hence, by using the induction hypothesis, we deduce that ( u v S w ) L . Assume that w S ( u ) . Since there is no path between w and v that does not intersect u then V ( ( S w ) w v ) contains u. Hence any path between w and v intersect V ( ( S w ) u v ) in u. Then ( v w S w ) L . We have shown then that
( v w S w ) L   and   ( u v S w ) L .
Hence by using (50) we deduce that ( u v S ) L since S = ( S w ) w .
Let us assume that the result remains true when p is a path with length n. Assume now that | p | = n + 1 . Let then w p u v . The vertex w S as p u v S . As
V ( ( S w ) u v ) = ( V ( S u v ) ) w .
Hence p V ( ( S w ) u v ) = w . Since p is the unique path between u and v then
( u v S w ) L .
Since u H 1 w S w and v H 1 w S w then by applying the induction hypothesis we deduce that ( u w S w ) L and ( v w S w ) L .
As L is a relation satisfying the weak transitivity axiom (42), then by applying (54) we deduce that ( u v S ) L .
Finally, (ii) is deduced by applying (i) to T ( L ) . □
Let us now consider the following axiom
{ ( u v S ) , ( u w S ) } L { ( u v S w ) , ( u w S v ) } L
where u , v , w V and S V u v w . We will say that a relation L is a gaussoid if it is relation satisfying (11), (41), (42) and (57).
Let us now show there is also another criterion as the one shown in Theorem 3 that can help us to determine elements of L .
Theorem 4.
Let L R ( V ) be a gaussoid. Let u , v in V and S V S . Let us assume that there exists w S such that w H 1 v S and let p be the single path between v and w Then
( u v S ) L ( u v S p ) L
Proof. 
Let us write the proof by induction n = | p | .
Let n = 1 then w H v and S p = S v w = S w . We have then to prove (58).
Since any path between w and u should contain v and S w does not contain v. Since p is the only path between v and w. Then any path between w and v intersects S w in v. Then, by using Theorem 1 we deduce that ( u w S w ) L .
Let us assume that ( u v S w ) L and since ( u w S w ) L we apply (50) to ( u , w , v , S w ) we deduce that ( u v S ) L .
If we assume now that ( u v S w ) L and since L is a gaussoid then by applying (57) we deduce that ( u v S ) L . Hence the equivalence (58) is proved.
Let us assume that Theorem 4 is true when p is with length equal to n, i.e., | p | = n : there are n edges in the path p. Assume now that | p | = n + 1 .
Let w p , w H w . We have also easily noted that w H 1 v S w . Since p = p w is a path with n vertices we can apply (58) to S w and p :
( u v S w ) L ( u v S w p ) L
However let us note that ( S w ) p = ( S p ) w . By induction hypothesis, any path between w and u should contain { v } . Then S u v ( S w ) u w = { v } seprates u and v in H S u v . Hence by applying Theorem 1 and Lemma 11, we deduce that ( u w S w ) L .
Let us assume that ( u v S w ) L and since ( u w S w ) L , we apply (50) to ( u , w , v , S w ) and we deduce that ( u v S ) L .
If we assume now that ( u v S w ) L and since L is a gaussoid then by applying (57) we deduce that ( u v S ) L .
Hence we have shown that
( u v S w ) ( u v S ) L .
Hence by using (59) we deduce also that
( u v S ) L ( u v ( S p ) w ) L .
We can also note that S u v ( ( ( S p ) w ) w u ) = p w u . Then this former set separates u and w in H S w u . Hence ( u w ( S p ) w ) L .
Let us assume that ( u v ( S p ) w ) L and since ( u w ( S p ) w ) L we can apply (50) to ( u , w , v , ( S p ) w ) we deduce then that ( u v S p ) L .
If we assume now that ( u v ( S p ) w ) L and since L is a gaussoid then by applying (57) we deduce that ( u v S p ) . Hence the equivalence (58) is proved. □
Corollary 1.
Let L R ( V ) be a gaussoid and let G and H be respectively the concentration and the covariance graph associated with L . Let μ = μ ( L ) , ν = ν ( L ) , s = so ( G ) and t = so ( H ) . Then
i.
if H is a cycle, then ν = | V | 3 , s G | V | 2 , μ = | V | 2 and L = S ( G ) T ( S ( H ) ) .
ii.
if G is a cycle, then μ = 1 , s H | V | 2 , ν = 0 and L = S ( G ) T ( S ( H ) ) .
Proof. 
Let us prove (i). Assume that H is a cycle. Let u and v two vertices in H and S V u v such | S | = l | V | 3 and let us show that
( u v S ) L u H v V S u v
Firstly that note that if l | V | 3 then the cardinality of | V S u v | = | V | l 2 1 . Then V S u v .
Let us assume, to the contrary, that u H v V S u v . Then, there exists a path between u and v that does not intersect V S u v . Given that H is a cycle, this path p is unique and is contained within S u v . Consequently, u H 1 v S . According to Theorem 3, this implies that ( u v S ) L , leading to a contradiction. Since ν is defined as the largest integer for which ϕ l ( L ) = ϕ l ( T ( s ( H ) ) ) , it follows that ν | V | 3 .
Let us denote ϕ | V | 2 ( T ( s ( H ) ) ) , given that H is connected. It is important to note that ν | V | 2 , as H is a connected graph. This means that we cannot find edges that an empty set can separate.
Since ν s G 1 (as stated in Theorem 2), and we have proven that ν | V | 3 , it follows that s G | V | 2 . Consequently, if G is not a complete graph, then all separators in G have a cardinality equal to | V | 2 . Therefore, for all l = 0 , , | V | 3 , ϕ l ( s ( G ) ) = , and L can include pairs ( u v S ) L where | S | = | V | 3 . To establish this, it suffices to select u and v as non-adjacent in H. By applying Theorem 3, we can deduce that ( u v S ) must belong to L .
Let us now prove that L = s ( G ) T ( s ( H ) ) . Consider ( u v S ) L . If | S | = | V | 2 , then ( u v S ) ϕ | V | 2 ( L ) = ϕ | V | 2 ( s ( G ) ) s ( G ) . Given that s G | V | 2 , it follows that for all l = 0 , , | V | 3 , ϕ l ( s ( G ) ) = . Furthermore, as demonstrated in the first part of this proof, if ( u v S ) is valid, then ( u v V S u v ) s ( H ) . Consequently, we conclude that L = s ( G ) T ( s ( H ) ) . □

7. A Discussion about the Gieger and Pearl Conjecture

Geiger and Pearl conjectured that for any undirected graph GG, it is possible to construct a probability distribution PP such that
c ( P ) = L
(see [10,27]). This conjecture is particularly relevant in the context of concentration graphical models, where the faithfulness assumption plays a pivotal role. The concept of faithfulness in these models, as defined in relation to Equation (61), is critical, especially when considering the PC algorithms (see [18]). In faithful graphical models, the 0-1 graph is expected to include, at a minimum, all the edges present in the full conditional independence graph, also known as the concentration graph (see [28]). This relationship underscores the importance of the faithfulness assumption in accurately representing and analyzing the underlying probabilistic structures in graphical models.
It was also proved in [21] that from any undirected graph H = ( V , F ) and for a fixed non-negative ϵ “sufficiently close de zero” we can construct a real positive definite matrix A H , ϵ = ( a u v ) u v V × V as follows
a u v = 1 if   u = v ϵ if   u v E 0 otherwise ,
and they showed (see Corollaries 2 & 3 in [21]) that T ( s ( H ) ) = d ( A H , ϵ ) . Hence if we consider a random vector X with Gaussian distribution having A H , ϵ as a covariance matrix the undirected graph H is then the covariance graph associated with P the probability distribution of X . According to Theorem 2, the separability order of the concentration graph G associated with P is a complete graph, and then s ( G ) = .
On the other hand let now consider an undirected graph G and let us consider B G , ϵ = A G , ϵ 1 where A G , ϵ 1 is constructed as in (62) and let us consider the random vector Y as a Gaussian vector with covariance matrix B G , ϵ . According to the construction of A G , ϵ , the undirected graph G is the concentration graph associated to Q, the probability distribution of Y . Since T ( s ( G ) ) = d ( A H , ϵ ) , then s ( G ) = T ( d ( A G , ϵ ) ) = d ( B G , ϵ ) . We have applied here Lemma 1 in [21]). By applying also 2 the covariance graph H associated to Q is complete and hence T ( s ( H ) ) = .
We think then that Geiger and Pearl’s conjecture is completely proved.

8. Conclusions

In conclusion, this paper has comprehensively examined the role of undirected graphs in representing sets of Conditional Independence (CI) statements. Through rigorous exploration of properties and implications, we have established that specific axioms govern these sets, providing a mathematical framework for their representation. Our investigation has focused on covariance and concentration graphs, the only known families of undirected graphs capable of comprehensively describing CI statements. We have introduced parameters to assess the limitations of these graphs and provided a computational method for their evaluation. Additionally, we have offered criteria to ascertain the complete representation of CI statements through their corresponding graphs. These findings advance the theoretical understanding of graph theory in statistical contexts and offer practical tools for researchers in related fields. Future work could extend these insights to more complex graph structures or apply them in empirical network data studies. Overall, this research significantly enhances our understanding of the capabilities and limitations of undirected graphs in representing CI statements, making a substantial contribution to the field of graphical models.

9. Notations

  • V is a finite set
  • | V | is the cardinality of V
  • E is a subset of V × V { ( u , u ) , u V }
  • u v is the couple ( u , v ) in V × V
  • G = ( V , E ) is an undirected graph where V is the set of vertices and E is the set of edges that satisfies u v E v u E
  • P = { S , S V }
  • P k = { S , S V and | S | = k }
  • X V = ( X v , v V ) is a random vector with values in R | V |
  • X S = ( X s , s V ) is sub-random-vector of X V
  • u v S means that X u X v X S
  • A , B and S disjoint subsets of V, A B S means that X A X B X S
  • S ( G ) = ( A , B , S ) , S   separates   A   and   B   in   G
  • C ( P ) = ( A , B , S ) , A B S
  • ( u v S ) is a couple
  • A set of couples L is called a relation
  • If G = ( V , E ) is an undirect graph u G v means that u v E is an edge in E.

Funding

This research received no external funding.

Data Availability Statement

No data was used for this research.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. De Morais Andrade, P.; Stern, J.M.; De Bragança Pereira, C.A. Bayesian Test of Significance for Conditional Independence: The Multinomial Model. Entropy 2014, 16, 1376–1395. [Google Scholar] [CrossRef]
  2. Markowetz, F.; Spang, R. Inferring Cellular Networks—A Review. BMC Bioinform. 2007, 8, S5. [Google Scholar] [CrossRef] [PubMed]
  3. Shen, X.; Zhu, S.; Zhang, J.; Hu, S.; Chen, Z. Reframed GES with a Neural Conditional Dependence Measure. arXiv 2022, arXiv:2206.08531. [Google Scholar]
  4. Kasza, J.; Glonek, G.; Solomon, P.J. Estimating Bayesian Networks for High-dimensional Data With Complex Mean Structure and Random Effects. Aust. New Zealand J. Stat. 2012, 54, 169–187. [Google Scholar] [CrossRef]
  5. Friedman, N.; Linial, M.; Nachman, I.; Pe’er, D. Using Bayesian Networks to Analyze Expression Data. J. Comput. Biol. 2000, 7, 601–620. [Google Scholar] [CrossRef]
  6. Kratzer, G.; Furrer, R. Information-Theoretic Scoring Rules to Learn Additive Bayesian Network Applied to Epidemiology. arXiv 2018, arXiv:1808.01126. [Google Scholar]
  7. Fathony, R.; Rezaei, A.; Bashiri, M.A.; Zhang, X.; Ziebart, B.D. Distributionally Robust Graphical Models. arXiv 2018, arXiv:1811.02728. [Google Scholar]
  8. Xie, Y.; Li, S.; Wu, C.; Lai, Z.; Su, M. A novel hypergraph convolution network for wafer defect patterns identification based on an unbalanced dataset. J. Intell. Manuf. 2022, 1–14. [Google Scholar] [CrossRef]
  9. Whittaker, J. Graphical Models in Applied Multivariate Statistics; Wiley: Hoboken, NJ, USA, 1990. [Google Scholar]
  10. Lauritzen, S.L. Graphical Models; Oxford University Press: New York, NY, USA, 1996. [Google Scholar]
  11. Studenỳ, M. Probabilistic Conditional Independence Structures; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
  12. Diestel, R. Graph Theory, 3rd ed.; Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 2005; Volume 173. [Google Scholar]
  13. Pear, J.; Paz, A. Graphoids: A graph based logic for reasoning about relevancy relations. Adv. Artif. Intell. 1987, 2, 357–363. [Google Scholar]
  14. Matúš, F. On equivalence of Markov proerties over undirected graphs. J. Appl. Probab. 1992, 29, 745–749. [Google Scholar] [CrossRef]
  15. Kauermann, G. On a dualization of graphical Gaussian models. Scand. J. Stat. 1996, 23, 105–116. [Google Scholar]
  16. Banerjee, M.; Richardson, T. On a Dualization of Graphical Gaussian Models: A Correction Note. Scand. J. Stat. 2003, 30, 817–820. [Google Scholar] [CrossRef]
  17. Peña, J.M.; Nilsson, R.; Björkegren, J.; Tegnér, J. An Algorithm for Reading Dependencies from the Minimal Undirected Independence Map of a Graphoid that Satisfies Weak Transitivity. J. Mach. Learn. 2009, 10, 1071–1094. [Google Scholar]
  18. Malouche, D. Implication of faithfulness assumption. Sankhyā B Indian J. Stat. 2022, 84, 495–515. [Google Scholar]
  19. Becker, A.; Geiger, D.; Meek, C. Perfect Tree-like Markovian Distributions. Probab. Math. Stat. 2005, 25, 231–239. [Google Scholar]
  20. Malouche, D.; Rajaratnam, B. Gaussian covariance faithful markov trees. J. Probab. Stat. 2011, 2011, 152942. [Google Scholar] [CrossRef]
  21. Lňenička, R.; Matúš, F. On Gaussian Conditional Independence Structures. Kybernetika 2007, 43, 327–342. [Google Scholar]
  22. Malouche, D. Determining full conditional independence by low order conditioning. Bernoulli J. 2009, 15, 1179–1189. [Google Scholar] [CrossRef]
  23. Menger, K. Zur allgemeinen kurventheorie. Fund. Math. 1972, 10, 96–115. [Google Scholar] [CrossRef]
  24. Lovász, L. A remark on Menger’s theorem. Acta Math. Acad. Sci. Hung. 1970, 21, 365–368. [Google Scholar] [CrossRef]
  25. O’Neil, P.V. A new proof of Menger’s theorem. J. Graph Theory 1978, 2, 257–259. [Google Scholar] [CrossRef]
  26. McCuaig, W. A simple proof of Menger’s theorem. J. Graph Theory 1984, 8, 427–429. [Google Scholar] [CrossRef]
  27. Geiger, D.; Pearl, J. Logical and algorithmic properties of conditional independence and graphical models. Ann. Stat. 1993, 21, 2001–2021. [Google Scholar] [CrossRef]
  28. Wille, A.; Bühlman, P. Low-Order Conditional Independence Graphs for Inferring Genetic Network. Stat. Appl. Genet. Mol. Biol. 2006, 5, 1–32. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Malouche, D. Describing Conditional Independence Statements Using Undirected Graphs. Axioms 2023, 12, 1109. https://doi.org/10.3390/axioms12121109

AMA Style

Malouche D. Describing Conditional Independence Statements Using Undirected Graphs. Axioms. 2023; 12(12):1109. https://doi.org/10.3390/axioms12121109

Chicago/Turabian Style

Malouche, Dhafer. 2023. "Describing Conditional Independence Statements Using Undirected Graphs" Axioms 12, no. 12: 1109. https://doi.org/10.3390/axioms12121109

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop