Next Article in Journal
Solving a Novel System of Time-Dependent Nuclear Reactor Equations of Fractional Order
Previous Article in Journal
Design and Analysis of a Symmetric Joint Module for a Modular Wire-Actuated Robotic Arm with Symmetric Variable-Stiffness Units
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Correspondence Analysis for Assessing Departures from Perfect Symmetry Using the Cressie–Read Family of Divergence Statistics

by
Eric J. Beh
1,2,*,† and
Rosaria Lombardo
3,†
1
National Institute for Applied Statistics Research Australia (NIASRA), University of Wollongong, Wollongong, NSW 2522, Australia
2
Centre for Multi-Dimensional Data Visualisation (MuViSU), Stellenbosch University, Stellenbosch 7602, South Africa
3
Department of Economics, University of Campania “Luigi Vanvitelli”, 80100 Capua, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Symmetry 2024, 16(7), 830; https://doi.org/10.3390/sym16070830
Submission received: 3 June 2024 / Revised: 25 June 2024 / Accepted: 29 June 2024 / Published: 2 July 2024
(This article belongs to the Section Mathematics)

Abstract

:
Recently, Beh and Lombardo (2022, Symmetry, 14, 1103) showed how to perform a correspondence analysis on a two-way contingency table where Bowker’s statistic lies at the numerical heart of this analysis. Thus, we showed how this statistic could be used to visually identify departures from perfect symmetry. Interestingly, Bowker’s statistic is a special case of the symmetry version of the Cressie–Read family of divergence statistics. Therefore, this paper presents a new framework for visually assessing departures from perfect symmetry using a second-order Taylor series approximation of the Cressie–Read family of divergence statistics.

1. Introduction

The correspondence analysis of a two-way contingency table, denoted here by N, formed from the cross-classification of two categorical variables has a long history; see, for example, Beh and Lombardo [1], Greenacre [2] and Lebart, Morineau and Warwick [3]. When these variables each consist of S responses so that the contingency table is of dimension S × S , studying the symmetry (or lack of symmetry) between the variables has been a topic of research undertaken by, for example, Beh and Lombardo [4] and Greenacre [5]. Both approaches involve the partition of N such that
N = Y + K = 1 2 N + N T 1 2 N N T
where Y is the matrix that reflects the symmetric part of the table, and K reflects the skew-symmetric part. This partition was considered in various contexts by many, including but certainly not limited to Bove [6], Constantine and Gower [7] (Section 3), and Gower [8].
The methods of Greenacre [5] and Beh and Lombardo [4] approach the visualisation of the departure from perfect symmetry using correspondence analysis by partitioning the transformed contingency table into a skew matrix and a skew-symmetric matrix, as (1) does. While both correspondence analysis approaches have (1) as a common thread, they are quite different. Greenacre [5] uses Pearson’s chi-squared statistic, X 2 , that centres the elements of the contingency table with respect to the mean of the row and column marginal totals, yielding two low-dimensional displays; one depicting departures from perfect symmetry and the other depicting departures from skew-symmetry. On the other-hand, Beh and Lombardo [4] use Bowker’s chi-squared statistic, X B 2 [9], producing a single low-dimensional display depicting departures from perfect symmetry.
While the difference between X 2 and X B 2 is that the former assesses departures from complete independence while the latter assesses departures from perfect symmetry, both can be expressed as a special case of the Cressie–Read family of divergence statistics (Cressie and Read [10]) when viewed as a goodness-of-fit measure. For more on how correspondence analysis can be performed on a two-way contingency table using the Cressie–Read family of divergence statistics, refer to Beh and Lombardo [11]. Their method includes, as special cases, the classical approach to correspondence analysis [1,2,3], log-ratio analysis (LRA) [12,13], and the Hellinger Distance Decomposition (HDD) method [14,15]. Rather than using Pearson’s statistic as the numerical foundation, LRA and HDD use the modified log-likelihood ratio statistic [16] and the Freeman–Tukey statistic [17], respectively.
This paper shows how a correspondence analysis can be performed on N for assessing departures from perfect symmetry using a family of symmetry divergence statistics. To do so, this paper is divided into five further sections. Section 2 gives an overview of the classic test of the departure from perfect symmetry for a two-way contingency table. It also describes how the Cressie–Read family of divergence statistics can be used for performing such a test. This family is dependent on the power parameter δ , where changes in δ lead to special cases of the family. In this paper, we focus on the second-order approximation of this family which exactly yields Pearson’s chi-squared statistic ( δ = 1 ), the Freeman–Tukey statistic ( δ = 1 / 2 ), and the modified likelihood ratio statistic ( δ = 0 ). Section 3 describes the core interest of this paper; the development of a correspondence analysis framework that can be applied to a two-way contingency table to visualise sources of departure from perfect symmetry when using the Cressie–Read family of divergence statistics. As part of this discussion, we also show that when δ = 1 , Bowker’s statistic is the numerical foundation of the correspondence analysis method that is a special case of this new framework. Two examples are given that demonstrate the various features of this new framework. Section 4 studies a 4 × 4 artificial contingency table which exhibits perfect symmetry when a constant C = 0 is added to a cell frequency. As C increases, the artificial table exhibits features consistent with increasing departures from perfect symmetry and so this example examines the features of this correspondence analysis framework as C and δ change. Our second example (Section 5) examines the data of Wiepkema [18] that are concerned with 12 pre- and post-courtship behaviours of a small European fish called a bitterling (Rhodeus amarus Bloch). Some final comments on the framework outlined here are made in Section 6.

2. Test of Perfect Symmetry and the Cressie–Read Family of Divergence Statistics

2.1. Notation

Suppose we have an S × S contingency table, N , where the i , j th cell entry has a frequency of n i j for i = 1 , 2 , , S and j = 1 , 2 , , S . Let the grand total of N be n and let the matrix of relative frequencies be P so that its i , j th cell entry is p i j = n i j / n , where i = 1 S j = 1 S p i j = 1 . Define the ith row marginal proportion by p i = j = 1 S p i j . Similarly, define the jth column marginal proportion as p j = i = 1 S p i j .

2.2. Testing Departures from a Hypothesised p i j

Testing whether there is evidence of a statistically significant association between the row and column variables of N can be made by considering any member of the Cressie–Read family of divergence statistics
CR δ = 2 n δ δ + 1 i = 1 S j = 1 S p i j p i j p ^ i j δ 1 ,
for any δ , , where p ^ i j is some value of p i j under a well-defined null hypothesis. For example, as we discuss in Section 2.3 and Section 2.4, this null hypothesis can define p i j under complete independence or perfect symmetry between the variables. When assessing departures from complete independence, p ^ i j = p i p j for i , j = 1 , 2 , , S so that (2) is a chi-squared random variable with S 1 2 degrees of freedom. However, Cressie and Read [10] also presented a second-order approximation of (2) around p i j / p ^ i j δ = 1 that is very useful for the purposes of applying a correspondence analysis to N. This approximation is
CR δ CR * δ = n i = 1 S j = 1 S p ^ i j 1 δ p i j p ^ i j δ 1 2 .

2.3. Testing Departures from Complete Independence

Any reasonable choice of p ^ i j may be defined but we confine ourselves to briefly discussing its definition under complete independence and perfect symmetry. For the case where one is interested in assessing departures from complete independence of the variables of N, (2) is expressed as
CR A δ = 2 n δ δ + 1 i = 1 S j = 1 S p i j p i j p i p j δ 1 .
The subscript “A” has been added to the left-hand side to show that this family of statistics assesses departures from complete association. The general nature of (2) and (4) ensures that specific values of δ lead to well-defined and well-understood measures of association, all of which are chi-squared random variables. These include Pearson’s chi-squared statistic, the log-likelihood ratio statistic, and the Freeman–Tukey statistic which are X 2 = CR δ = 1 , G 2 = CR δ = 0 , and T 2 = CR δ = 1 / 2 , respectively. The modified chi-squared statistic, the modified log-likelihood ratio statistic, and the Cressie–Read statistic are also special cases such that N 2 = CR δ = 2 , M 2 = CR δ = 1 , and C 2 = CR δ = 2 / 3 , respectively.
Beh and Lombardo [11] showed that a correspondence analysis of N when assessing departures from independence can be undertaken by making use of the second-order Taylor series approximation of (2) around
p i j p ^ i j δ = p i j p i p j δ = 1
resulting in
CR A δ CR A * δ = n i = 1 S j = 1 S p i p j 1 δ p i j p i p j δ 1 2 .
This approximation may be obtained by substituting p ^ i j = p i p j into (3). See also Cressie and Read [10] (pp. 94–95) for a derivation of this approximation of (4). This family of statistics gives exactly the following commonly used chi-squared statistics: Pearson’s statistic X 2 = CR A 1 = CR A * 1 , the Freeman–Tukey statistic T 2 = CR A 1 / 2 = CR A * 1 / 2 , and the modified log-likelihood ratio statistic M 2 = CR A 1 = CR A * 0 . In the context of a correspondence analysis, X 2 serves as the numerical foundations of the traditional approach, T 2 is the foundations of the method described in Beh, Lombardo and Alberti [19] and Cuadras and Cuadras [14], while M 2 serves as the foundations of LRA, a variant of correspondence analysis described by Greenacre [12].

2.4. Testing for Departures from Perfect Symmetry

Sometimes it is the case that the two variables of N are, say, identical but measured over two different time periods. It may be that these same variables are collected between two different cohorts. In cases such as these, it is more typical to analyse the departures from perfect symmetry between the rows and columns of N. Therefore, when testing for departures from perfect symmetry, the null hypothesis is
H 0 : p i j = p j i
for i = 1 , 2 , , I and for j = 1 , 2 , , J .
When assessing whether there exists any evidence of symmetry between the variables of N in the population, Agresti [20] (p. 427) and Anderson [21] (p. 321) showed that the most appropriate choice of p ^ i j was
p ^ i j = p i j + p j i 2 .
Therefore, the Cressie–Read family of divergence statistics can be defined for testing departures from perfect symmetry so that (2) can be expressed as
CR S δ = 2 n δ δ + 1 i = 1 S j = 1 S p i j 2 p i j p i j + p j i δ 1 ,
and is a chi-squared random variable with S S 1 / 2 degrees of freedom. The subscript “S” has been added to the left-hand side of (7) to show that this family of statistics assesses departures from perfect symmetry. This statistic has been the topic of interest by Tomizawa, Seo and Yamamoto [22], Ando, Hoshi, Ishii and Tomizawa [23], and Altun and Saraçbaşi [24]. Our focus is to examine the role of a second-order approximation of CR S δ for performing a correspondence analysis to visually detect departures from perfect symmetry.

2.5. A Second-Order Approximation

A second-order Taylor series approximation of (7) around
p i j p ^ i j δ = 2 p i j p i j + p j i δ = 1
can be obtained by substituting (6) into (3). Doing so yields the family of asymptotically chi-squared random variables with S S 1 / 2 degrees of freedom under the null hypothesis of perfect symmetry, (6),
CR S * δ = n 2 i = 1 S j = 1 S p i j + p j i 1 δ 2 p i j p i j + p j i δ 1 2
or, alternatively but equivalently,
CR S * δ = n i > j S p i j + p j i 1 δ 2 p i j p i j + p j i δ 1 2 .
There are three special cases of this family of divergence statistics that we consider in our analysis of symmetry in a two-way contingency table. The first is when δ = 1 :
X S 2 = CR S * 1 = n 2 i = 1 S j = 1 S p i j p j i 2 p i j + p j i = n i > j S p i j p j i 2 p i j + p j i
which is just Bowker’s chi-squared statistic [9]. Beh and Lombardo [4] used this statistic as the basis for performing a correspondence analysis to assess departures from perfect symmetry in N.
Secondly, suppose that we consider the case where (8) is evaluated when δ = 1 / 2 . Then, we can show that
T S 2 = CR S * 1 2 = 4 n i = 1 S j = 1 S p i j p i j + p j i 2 2
is the Freeman–Tukey statistic when assessing departures from perfect symmetry.
The third special case of (8) is when δ = 0 . For this value of δ , (8) does not exist. However, we can obtain the limiting value of (8) as δ 0 . Doing so means that we can use the Box–Cox transformation so that
lim δ 0 1 δ 2 p i j p i j + p j i δ 1 = ln 2 p i j p i j + p j i .
Therefore,
CR S * 0 = lim δ 0 CR S * δ = n i = 1 S j = 1 S p i j + p j i 2 lim δ 0 1 δ 2 p i j p i j + p j i δ 1 2
simplifies to
M S 2 = CR S * 0 = n i = 1 S j = 1 S p i j + p j i 2 ln 2 p i j p i j + p j i
and is the modified version of the log-likelihood ratio statistic when testing for perfect symmetry in N . Note that Bishop, Fienberg and Holland [25] (Equation (8.2-11)), Haberman [26] (p. 489), and Ireland, Ku and Kullback [27] (Equation (1.3)) gave the (unmodified) log-likelihood ratio statistic
M ˜ S 2 = n i = 1 S j = 1 S p i j ln 2 p i j p i j + p j i
for assessing departures from perfect symmetry in an S × S contingency table.
When there is perfect symmetry between the variables of N so that (5) holds, (9)–(11) is zero. When there exists a statistically significant departure from perfect symmetry, we can then visually assess the statistical significance of this departure using a correspondence analysis. We now show how (8) can be used to perform a correspondence analysis on N when assessing these departures.

3. Correspondence Analysis and Perfect Symmetry

3.1. The Divergence Residual

To perform a correspondence analysis on N under the null hypothesis of perfect symmetry, we first define the S × S matrix of divergence residuals, S δ , where its i , j th element is
s i j δ = 1 δ p i j + p j i 2 2 p i j p i j + p j i δ 1 .
Therefore, the sum of squares of these residuals gives (8) so that
CR S * δ = n i = 1 S j = 1 S s i j 2 δ
= n trace S δ T S δ
= n trace S δ S δ T .
Note that when i = j (so that we are concerned with the diagonal elements of S δ ) these residuals are zero for all δ . Three examples of the form that (12) takes is when δ = 1 , 1 / 2 and (approaching) 0. Respectively, these values of δ give the residuals
s i j 1 = 1 2 p i j p j i p i j + p j i s i j 1 2 = 2 p i j p i j + p j i 2 s i j 0 = p i j + p j i 2 ln 2 p i j p i j + p j i .
The first of these, s i j 1 , is the i , j th Bowker residual described by Beh and Lombardo [4] (Equation (7)) so that n times its sum of squares produces Bowker’s statistic, (9).
The second and third residuals are akin to the Freeman–Tukey residual and modified log-likelihood ratio residual, respectively, described by Beh and Lombardo [11], but are used when assessing departures from perfect symmetry. Note that n times the sum of squares of these two residuals gives (10) and (11), respectively.
For s i j 0 , it is assumed that all cells of the contingency table have non-zero frequencies so that 0 < p i j < 1 for i = 1 , 2 , , S and j = 1 , 2 , , S . This is to avoid any problems with calculating the natural logarithm of zero. In the event that a zero cell frequency is observed, a simple remedy is to replace it with a small value, say 0.01. Alternatively, one may use more objective methods to accommodate for a zero cell frequency. Other residuals can also be obtained using alternative values of δ .

3.2. Is the Matrix of Divergence Residuals Skew-Symmetric?

One of the benefits of using Bowker’s statistic as the numerical basis on which to perform a correspondence analysis is that the resulting matrix of divergence residuals is skew-symmetric. That is, when δ = 1 , S 1 has the property that S 1 T = S 1 . Therefore, s i i 1 = 0 and s i j 1 = s j i 1 for i j , i , j . It also means that the singular values and the left and right singular vectors of S 1 can be calculated by applying an eigen-decomposition to S 1 T S 1 or, equivalently, S 1 2 . Ward and Gray [28] and Gower [8] (p. 113) discuss that for an S × S skew-symmetric matrix, like S 1 , if S is odd, then there will always be a zero eigen-value and S 1 positive eigen-values. If S is even, there will always be S eigen-values that exist in pairs; see Constantine and Gower [7].
When δ 1 , S δ is not a skew-symmetric matrix, since there is at least one cell where s i j δ s j i δ , i j , unless there is perfect symmetry between the variables of N .

3.3. Singular Value Decomposition and the Divergence Residual

When assessing departures from perfect symmetry in N , the correspondence analysis approach of Beh and Lombardo [4] involves applying a singular value decomposition (SVD) to the matrix S 1 . Since Bowker’s statistic is a special case of (8), this suggests that a more general family of correspondence analysis techniques can be developed for visualising departures from perfect symmetry. Such a general family can be developed using the family of statistics generated from (8). Therefore, a new general family of correspondence analysis techniques can be obtained by applying an SVD to S δ such that, for the i , j th cell,
1 δ p i j + p j i 2 2 p i j p i j + p j i δ 1 = m = 1 M a i m δ λ m δ b j m δ
where
i = 1 I a i m δ a i m δ = 1 m = m 0 m m , j = 1 J b j m δ b j m δ = 1 m = m 0 m m ,
and M is the maximum number of dimensions required to depict all of the association that exists between the variables of the contingency table. When δ = 1 , M = S if S is even, and M = S 1 if S is odd. For other values of δ , M = S . The quantities a i m δ and b j m δ are the ith and jth element, respectively, of the mth left and right singular vectors of the matrix of divergence residuals for a fixed δ . The mth largest singular value is λ m δ so that 1 > λ 1 δ > λ 2 δ > , λ M * δ > 0 .
The matrix form of (15) and (16) is
S δ = A δ Δ δ B δ T
with
A δ T A δ = I M and B δ T B δ = I M
being the matrix form of (16). Here, I M is an M × M identity matrix, A δ is the S × M matrix where the i , m th element is a i m δ , B δ is the S × M matrix where the j , m th element is b j m δ , and Δ δ is the M × M diagonal matrix of singular values with λ m as its m , m th element.
While the Cressie–Read family of divergence statistics can be expressed in terms of S δ —see (13) and (14)—it can also be expressed in terms of its singular values. To show this, substituting (17) into (13) leads to
CR * δ = n trace A δ Δ δ B δ T T A δ Δ δ B δ T = n trace B δ Δ δ 2 B δ T = n trace Δ δ 2
when B δ is of full rank so that B δ B δ T = B δ T B δ = I M . Therefore, the total inertia of N can be expressed as the sum of squares of the squared singular values so that
CR * δ n = m = 1 M λ m 2 δ .
Expressing the total inertia in this manner is analogous to the total inertia of Beh and Lombardo [11] (p. 22) when the Cressie–Read family of divergence statistics is used as the numerical basis of the correspondence analysis of a two-way contingency table.

3.4. The Principal Inertia Values

Beh and Lombardo [4] showed that when assessing departures from perfect symmetry when N is a 2 × 2 contingency table, S 1 has two equal singular-values whose squared values are
λ 1 2 = λ 2 2 = s 21 2 1 = 1 2 p 21 p 12 p 21 + p 12
and are the principal inertia values of the first two dimensions of the correspondence plot when analysing a two-way contingency table. Similarly, when symmetry is of concern for a 3 × 3 contingency table, the three principal inertia values are
λ 1 2 = λ 2 2 = CR * 1 2 n and λ 3 2 = 0 .
For both sizes of N , the sum of their squared singular values gives Bowker’s statistic.
When analysing the symmetry of a two-way table using the Cressie–Read family of divergence statistics we can consider values of δ 1 . For example, when S δ is of rank 2, then there are two unequal singular values whose squares are
λ 1 2 δ = max s 12 2 δ , s 21 2 δ
λ 2 2 δ = min s 12 2 δ , s 21 2 δ .
Note that when δ = 1 , these squared singular values simplify to (19).
When δ 1 , then (19) and (20) are satisfied only when there exists perfect symmetry between the variables of N (in which case all squared singular values are zero). This is because when δ 1 , S δ is not a skew-symmetric matrix.
We now turn our attention to the construction of the M-dimensional correspondence plot by defining and describing the principal coordinates for each row and column of N .

3.5. Principal Coordinates

When visually portraying the categories of N, define the metric matrix by
D ˜ = D I + D J 2
where D I = diag p i and D J = diag p j . Then, the matrices of row and column principal coordinates are
F δ = D ˜ 1 / 2 A δ Δ δ
G δ = D ˜ 1 / 2 B δ Δ δ ,
respectively. These provide a more general set of principal coordinates than those of Beh and Lombardo [4] (Equations (14) and (15)), who were concerned only with the case when δ = 1 , although their principal coordinates can be obtained by simply substituting δ = 1 into (23) and (24).
Defining the row and column principal coordinates by (23) and (24), respectively, means that the row and column spaces have the same metric that is based on the aggregation of p i j across the two variables, irrespective of the value of δ . Such an aggregation is performed since (7) relies only on the cell proportions p i j and p j i .
Post-multiplying both sides of (23) by B δ T and simplifying gives us an alternative expression for the row principal coordinates
F δ = D ˜ 1 / 2 S δ B δ .
Similarly, it can be shown that the column principal coordinates can be expressed in terms of S δ such that
G δ = D ˜ 1 / 2 S δ T A δ .
As we have already shown, S δ is not a skew-symmetric matrix unless δ = 1 . In the event that δ = 1 , then
G 1 = D ˜ 1 / 2 A 1 Δ 1 J M T = F 1 J M T
where J M is an M × M block-diagonal and orthogonal skew-symmetric matrix so that
J M T J M = J M J M T = M .

3.6. On the Total Inertia and the Origin

The total inertia of the two-way contingency table can be expressed in terms of the matrices of row and column principal coordinates given by (23) and (24). To show this, suppose we consider the total inertia in terms of the row principal coordinates. Then,
trace F δ T D ˜ F δ = trace D ˜ 1 / 2 A δ Δ δ T D ˜ D ˜ 1 / 2 A δ Δ δ = trace Δ δ A δ T A δ Δ δ = trace Δ δ 2 = CR * δ n .
Similarly, we can also show that the total inertia can be expressed in terms of the column principal coordinates so that
CR * δ n = trace G δ T D ˜ G δ .
Therefore, if there is perfect symmetry between the rows and columns of our two-way contingency table then the total inertia is zero. When this happens, the position of the row and column principal coordinates is located at the origin. Therefore, the origin is interpreted as the point in the low-dimensional space where there is perfect symmetry between the row and column variables. The further a point is away from this origin, the more deviation it has from the null hypothesis of perfect symmetry. When assessing departures from complete independence, assessing the contribution of a row and column point to the association structure can be undertaken using the closed-form equations that yield confidence regions for each point; see Beh [29] and Beh and Lombardo [30] when δ = 1 and Alzahrani, Beh and Stojanovski [31] for other values of δ . Such regions have not yet been developed for studying departures from perfect symmetry and so we leave this for future study.

4. Example 1: Artificial Data

4.1. The Data

To examine how the Cressie–Read family of divergence statistics can be used for the purposes of applying correspondence analysis to visually assess departures from perfect symmetry, we consider the artificial data set given in Table 1. Beh and Lombardo [4] used this contingency table to highlight the features obtained when using Bowker’s statistic and so we focus on showing the features of the correspondence analysis using (8) for δ = 1 , 1 / 2 , and 0. Thus, the numerical foundations of this variant of correspondence analysis uses Pearson’s chi-squared statistic, X S 2 , the Freeman–Tukey statistic T S 2 , and the modified log-likelihood ratio statistic, M S 2 , respectively. In Table 1 the 2 , 1 th cell frequency is 20 + C where C 20 is a constant. When C = 0 , the variables of Table 1 exhibit perfect symmetry, and as C , the departure from perfect symmetry between the variables becomes more apparent. The sample size of Table 1 is n = 680 + C .

4.2. The Family of Divergence Statistics

Since there exists perfect symmetry in all but two cells of Table 1, we only need to confine ourselves to examining the difference between the 2 , 1 th and 1 , 2 th cells. Of course, when there is perfect symmetry, then s 12 δ = s 21 δ and this happens only when C = 0 . We examine the changes in these two values as C and δ change. Therefore, to assess the departure from perfect symmetry in Table 1, we do so by comparing
n · s 12 δ = 1 δ 40 + C 2 40 40 + C δ 1
and
n · s 21 δ = 1 δ 40 + C 2 40 + 2 C 40 + C δ 1
for δ 0 , otherwise
n · s 12 0 = 40 + C 2 ln 40 40 + C
and
n · s 21 0 = 40 + C 2 ln 40 + 2 C 40 + C
using the Box–Cox transformation.
Therefore, when assessing departures from perfect symmetry for the data in Table 1, the Cressie–Read family of divergence statistics can be expressed in terms of C 20 and δ , so that
CR * δ = n s 12 2 δ + s 21 2 δ = 1 δ 2 40 + C 2 40 40 + C δ 1 2 + 40 + 2 C 40 + C δ 1 2 .
We can immediately see that this family of statistics can also be derived by substituting (21) and (22) into (18) (since M = 2 ), yielding the equivalent expression
CR * δ = n λ 1 2 δ + λ 2 2 δ .
Substituting δ = 1 into (27) for Table 1 of
X S 2 = CR * 1 = C 2 40 + C
which is Bowker’s statistic derived by Beh and Lombardo [4] (Equation (21)). Similarly, the Freeman–Tukey statistic, (10), and modified log-likelihood ratio statistic, (11), can be written in terms of C so that
T S 2 = CR * 1 2 = 8 40 + C 1 2 40 + C 40 + 40 + 2 C M S 2 = CR * 0 = 40 + C ln 40 40 + 2 C 40 + C 2 .
A visual representation of X S 2 , T S 2 , and M S 2 versus C 20 , 100 at unitary increments is given in Figure 1; the horizontal line is the quantile of the chi-squared distribution with six degrees of freedom for α = 0.05 , so that χ 0.95 2 6 = 12.5916 . Figure 1 shows that all three statistics are quite similar, especially for values of C 15 , 50 . Note that when C = 0 , these three statistics are all zero showing there is perfect symmetry in Table 1. When C 20 , 0 , all three statistics decrease to zero and then increase for C > 0 . Therefore, there is a minimum value of C that leads to the rejection of the null hypothesis of perfect symmetry. We now investigate what this value of C is for X S 2 , T S 2 , and M S 2 .

4.3. On the Departure from Perfect Symmetry

Beh and Lombardo [4] show that for Bowker’s statistic, CR * 1 , there is a statistically significant departure from perfect symmetry at the α level of significance, when
C > χ α 2 6 + χ α 2 6 2 + 160 χ α 2 6 2 ,
where χ α 2 6 is the 1 α quantile of the chi-squared distribution with S S 1 / 2 = 4 4 1 / 2 = 6 degrees of freedom. For example, the minimum value of C when α = 0.05 is 29.59, or 30 when rounded up to an integer value. Hence, the 2 , 1 th cell frequency must be at least 50 to detect any departure from perfect symmetry when performing the test at the 0.05 level of significance using Bowker’s statistic.
There is also a second solution to C that leads to a rejection of the null hypothesis of perfect symmetry. This is when
20 C < χ α 2 6 χ α 2 6 2 + 160 χ α 2 6 2
yielding an upper bound of this interval of C = 17.01 . Thus, there is a rejection of the null hypothesis of perfect symmetry when the 2 , 1 th cell frequency is less than 2.99, or 2 when rounding down to an integer value.
Figure 1 shows fairly similar values of C (and n 11 ) are required when assessing the test of perfect symmetry using T S 2 and M S 2 , although obtaining a simple expression to determine these cell counts, like (28) and (29) do for Bowker’s statistic, is not straightforward. However, numerical methods show what the values of C that produce a statistically significant T S 2 are when C < 15.99 and C > 28.50 , for α = 0.05 and for six degrees of freedom. Thus, when using T S 2 , the values of the 2 , 1 th cell frequency that ensure the null hypothesis of perfect symmetry is rejected at the 5% level of significance are n 11 49 and 0 n 11 4 .
Similarly, numerical methods show that when using M S 2 , the values of C that reject the null hypothesis of perfect symmetry are C < 15.52 and C > 27.94 . Therefore, the values of the 2 , 1 th cell frequency that ensure the null hypothesis of perfect symmetry is rejected at the 5% level of significance are n 11 48 and 0 n 11 4 .
To keep any further analysis of Table 1 simple, we now confine our attention to values of C 0 .

4.4. Features of Correspondence Analysis and Symmetry

4.4.1. The Matrix of Divergence Residuals

We can derive the matrix of divergence residuals, S δ , for Table 1. Based on (25) and (26), this matrix is
S δ = 1 δ 40 + C 2 680 + C 0 40 40 + C δ 1 0 0 40 + 2 C 40 + C δ 1 0 0 0 0 0 0 0 0 0 0 0
when δ 0 . For example, when δ = 1 , then
S 1 = C 2 680 + C 40 + C 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 0
which is a skew-symmetric matrix since s i j 1 s j i 1 , i , j = 1 , , 4 . When δ = 0 and δ = 1 / 2 , then the matrices of divergence residuals are
S 0 = 40 + C 2 680 + C 0 ln 40 40 + C 0 0 ln 40 + 2 C 40 + C 0 0 0 0 0 0 0 0 0 0 0
and
S 1 / 2 = 2 680 + C 0 40 40 + C 0 0 40 + 2 C 40 + C 0 0 0 0 0 0 0 0 0 0 0 ,
respectively. These two matrices are not skew-symmetric matrices since s 12 δ s 21 δ for δ = 0 and 1/2 unless C = 0 . In this case, there is perfect symmetry in Table 1 so that s 12 δ = s 21 δ = 0 .

4.4.2. The Singular Values

The structure of the 4 × 4 matrix S δ given by (30) is identical to the 2 × 2 matrix obtained by removing the zero rows and columns of the matrix. Beh and Lombardo [4] (Appendix A) derived the two singular values of S 1 for Table 1 and showed them to be
λ 1 1 = λ 2 1 = C 2 680 + C 40 + C
for C > 0 and both zero when C = 0 . When δ 1 , the two singular values are not equivalent since, for these δ values, S δ is not a skew-symmetric matrix. However, the two singular values are approximately equivalent when δ 1 . When δ 0 and C 0 , the two singular values of Table 1 are
λ 1 δ = s 12 δ n = 1 δ 40 + C 2 680 + C 40 40 + C δ 1
λ 2 δ = s 21 δ n = 1 δ 40 + C 2 680 + C 40 + 2 C 40 + C δ 1
so that λ 1 δ λ 2 δ when C 0 . Therefore, while the 4 × 4 diagonal matrix of eigen-values consists of zero values for the 3 , 3 th and 4 , 4 th elements, the 2 × 2 matrix of non-zero eigen-values is
Δ δ = 40 + C 1 / 2 δ δ 2 680 + C 40 + C δ 40 δ 0 0 40 + 2 C δ 40 + C δ
so that both singular values remain positive when C > 0 . For example, when δ = 1 these two singular values simplify to (31). Similarly, when δ = 1 / 2 ,
λ 1 1 2 = s 21 1 / 2 n = 2 680 + C 40 + C 40 λ 2 1 2 = s 21 1 / 2 n = 2 680 + C 40 + 2 C 40 + C .
When δ = 0 , applying the Box–Cox transformation to (32) and (33) yields, for C 0 , the two singular values
λ 1 0 = s 12 0 n = 40 + C 2 680 + C ln 40 40 + C λ 2 0 = s 21 0 n = 40 + C 2 680 + C ln 40 + 2 C 40 + C .
Figure 2 displays λ 1 δ versus C 0 , 100 for δ = 0 , 1 / 2 , and 1, while Figure 3 shows λ 2 δ versus C 0 , 100 ; the vertical axis of both figures are identically scaled to enable an easy comparison of the two singular values. These two figures show that for all values of C, λ 1 1 = λ 2 1 as expected since S 1 is a skew-symmetric matrix. It also shows that λ 1 δ = λ 2 δ = 0 when C = 0 . A comparison of Figure 2 and Figure 3 shows that λ 1 δ > λ 2 δ for C > 0 .
Figure 2 shows that for λ 1 δ , as δ moves from zero to one the singular value decreases in magnitude for all C. However, the values of λ 2 δ increase as δ goes from zero to one, although any difference between values of λ 2 δ for a given C > 0 is not as large as the differences observed between the λ 1 δ values.
Suppose we define λ Diff δ = λ 1 δ λ 2 δ , so that from (34),
λ Diff δ = 40 + C 1 / 2 δ δ 2 680 + C 2 40 + C δ 40 + 2 C δ 40 δ 0
for all values of δ ; note that this difference is zero when δ = 1 . This difference is also zero when C = 0 irrespective of the choice of δ . A plot of this difference versus C 0 , 100 is given in Figure 4. It confirms that λ 1 1 = λ 2 1 while the difference between the two singular values is at its largest when δ = 0 . Thus, LRA will produce a more heavily dominant first dimension than its second dimension when compared with the correspondence analysis approach that uses Bowker’s statistic. In fact, Figure 2 and Figure 3 show that when δ = 0 , the first singular value is larger than the first singular value when performing an HDD and a correspondence analysis to assess departures from perfect symmetry. Therefore, the first dimension of an LRA always accounts for a larger proportion of any departure from perfect symmetry than that of the HDD and correspondence analysis.

4.4.3. Principal Coordinates

To derive the row and principal coordinates, (23) and (24), we first need to determine D ˜ . Beh and Lombardo [4] (p. 11) showed that for Table 1
D ˜ = 200 + C 2 680 + C 0 0 0 0 400 + C 2 680 + C 0 0 0 0 150 680 + C 0 0 0 0 230 680 + C
so that
D ˜ 1 / 2 = 680 + C 2 200 + C 0 0 0 0 2 400 + C 0 0 0 0 1 150 0 0 0 0 1 230 .
We also have the matrices of left and right singular vectors which are
A δ = 1 0 0 1 0 0 0 0 and B δ = 0 1 1 0 0 0 0 0
when δ 0 , 1 , and when δ 1 ,
A δ = 0 1 1 0 0 0 0 0 and B δ = 1 0 0 1 0 0 0 0 .
When δ 0 , 1 , using (34), (35), and A δ from (36), the elements of the matrix of row principal coordinates, (23), can be expressed in terms of δ and C so that
F δ = 40 + C 1 / 2 δ δ 40 + C δ 40 δ 200 + C 0 0 40 + 2 C δ 40 + C δ 400 + C 0 0 0 0 .
Therefore, changing δ and C does not influence the position of the principal coordinates of the third and fourth rows of Table 1. This make sense since there is perfect symmetry for these two rows and thus their position in the correspondence plot is at the origin. Note that for δ 0 , 1 , the 1 , 1 th and 2 , 2 t h elements of F δ , denoted by f 11 C , δ and f 22 C , δ , respectively, are both negative for C > 0 . The link between them is
f 11 C , δ f 22 C , δ = K C , δ 400 + C 200 + C > 400 + C 200 + C
where
K C , δ = 1 40 40 + C δ 40 + 2 C 40 + C δ 1 > 1
for δ 0 , 1 and C > 0 . Thus, the magnitude of f 11 C , δ is always at least 400 + C / 200 + C times larger than the magnitude of f 22 C , δ for all δ 0 , 1 . For example, when C = 50 in Table 1, the lower bound of this ratio is 450 / 250 = 3 / 5 = 1.3416 and this occurs as δ 1 . Therefore, when C > 0 , f 11 C , δ always lies at least 1.3416 times further from the origin than f 22 C , δ . Thus, row 1 of Table 1 contributes more to any departure from perfect symmetry than row 2, irrespective of the choice of δ . When δ = 1 , K C , 1 = 1 . Also, when δ = 1 , the link between f 12 C , 1 and f 21 C , 1 can be established using (37) instead of (36) and is
f 12 C , 1 f 21 C , 1 = 400 + C 200 + C .
This is identical to the ratio derived by Beh and Lombardo [4] (Section 6.3) when using Bowker’s statistic to assess departures from perfect symmetry.
We can also obtain similar expressions for the column principal coordinates. Substituting (34), (35), and B δ from (36), into (24) leaves us with
G δ = 40 + C 1 / 2 δ δ 0 40 + 2 C δ 40 + C δ 200 + C 40 + C δ 40 δ 400 + C 0 0 0 0 0 .
Note that the choice of δ and C does not influence the position of the third and fourth columns in the two-dimensional correspondence plot, where they lie at the origin. This makes sense since these columns of Table 1 are perfectly symmetrical with the third and fourth rows of the contingency table. Something else to note is that the 1 , 2 th element of G δ , denoted by g 12 δ , is negative for δ 0 , 1 . Also, the 2 , 1 th element of G δ , denoted by g 21 δ , is always positive for these values of δ . Therefore, the ratio of these two coordinates is always negative and is
g 12 C , δ g 21 C , δ = 1 K C , δ 400 + C 200 + C .
Therefore,
K C , δ = f 11 C , δ f 22 C , δ · g 21 C , δ g 12 C , δ
showing that the relationship between the first and second row and column principal coordinates remains constant for some given value of C and δ .

4.5. The Correspondence Plots

Figure 5 gives the correspondence plot of Table 1 for δ = 1 , 1 / 2 , and 0; these were constructed with X S 2 , T S 2 , and M S 2 , respectively, as their numerical foundation, with C = 50 .
Suppose we consider first the correspondence plot, (Figure 5a). It shows that R3, R4, C3, and C4 are located at the origin. This should not be surprising for two related reasons: (1) there is perfect symmetry between R3 and C3, and between R4 and C4, and (2) these rows and columns are not influenced by the magnitude of C. Thus, these four categories of Table 1 play no part in determining the magnitude of Bowker’s statistic. Instead, X B 2 is influenced solely by the row categories R1 and R2, and the column categories C1 and C2, since C impacts on the symmetry (or lack thereof) of the 1 , 2 th and 2 , 1 th cell frequencies of Table 1. However, there is a noticeable difference in the position of R1 and C1 showing that there is a large departure from perfect symmetry between these categories; a feature present because C = 50 . Similarly, R2 and C2 are situated at quite a distance from each other, showing the influence of C on their position in the correspondence plot. However, since this distance appears shorter than between R1 and C1, this shows the influence of C impacts more on the symmetry between R1 and C1 than it does on the symmetry between R2 and C2.
The configurations of points in Figure 5b,c are quite similar, although they appear quite different when compared with the configuration of points in Figure 5a. However, since λ 1 1 = λ 2 1 , the configuration of points in Figure 5a remains unchanged if it is rotated clockwise 90 degrees and reflected along the first dimension. Doing so produces a configuration of points that is comparable to that in Figure 5b,c and, since M = 2 for our three values of δ , the three correspondence plots in Figure 5 depict all of the departures that exist from perfect symmetry. The only noticeable difference between the three plots is the percentage of the total inertia accounted for by the two dimensions. While all three plots display 100% of the departures from perfect symmetry (and are therefore excellent visual depictions) the first dimension is very much the most dominant when δ = 0 , accounting for 77.1% of M S 2 , while 64.5% of T S 2 is accounted for along this dimension when δ = 1 / 2 . This confirms the findings in our discussion of Figure 4.

5. Example 2: Pre- and Post-Courtship Behaviour of Bitterlings

5.1. The Data

We now move away from the analysis in Section 4 of the artificial contingency table and turn our attention to a more practical application. Consider Table 2, where S = 12 , which originally comes from the extensive study of Wiepkema [18] (Table II). The data concern the pre- and post-courtship behaviour of male bitterlings (Rhodeus amarus Bloch), a small European fish where the behaviour was classified according to 12 traits. Here, we used the (pre/POST)-courtship labelling convention that is an adaptation of the one used by Wiepkema [18] and van der Heijden, de Vries and van Hooff [32]: jerking (jk/JK), turning beats (tu/TU), head butting (hb/HB), chasing (cs/CS), fleeing (fl/FL), quivering (qu//QU), leading (le/LE), head-down posture (hd/HD), skimming (sk/SK), snapping (sn/SN), chafing (cf/CF), and fin-flickering (ff/FF).
Table 2 was the subject of a classical correspondence analysis performed by van der Heijden, de Vries and van Hooff [32], where departures from complete independence were assessed. Given the symmetric nature of the variables, we perform a correspondence analysis using (8) to assess any departures from perfect symmetry that may exist in the data.

5.2. Test of the Departure from Perfect Symmetry

Of the 144 cells in Table 2, there are 22 zero cell frequencies (or 15.3% of the cells). The effect of this is that there are 16 values of p i j + p j i that are zero, which means that (8) involves 16 instances where a division by zero occurs. To overcome this problem, 0.01 was added to each cell of the contingency table. Doing this led to a Bowker’s statistic, (9), of 277.801, while (10) and (11) were 333.9 and 671.0, respectively. With 12 12 1 / 2 = 66 degrees of freedom, these three statistics have a p-value that is less than 0.0001. Therefore, there is enough evidence in Table 2 to conclude that there is a statistically significant departure from perfect symmetry. That is, there is at least one of the 12 pairings of the pre- and post-courtship behaviour that is statistically different.

5.3. On the Divergence Residuals

One may evaluate where these departures from perfect symmetry lie by observing the elements of S δ . Table 3 gives these residuals for δ = 1 , 1 / 2 , and 0. Note that since all diagonal elements of S δ are zero, they have been omitted from Table 3. Those residuals designated “<0.001” are residuals lying within the interval ± 0.0000001 , 0.0001 .
The largest (negative and positive) divergence residuals for our three values of δ appear in bold text in Table 3. We can see that the largest positive residuals are for the pre- and post-courtship pairs (hd, LE), (sk, HD), and (qu, SK). These combinations reflect that there are more observations in these cells than what would be expected if there were perfect symmetry between the variables. For example, for (LE, hd), the observed cell count is 167 while the expected number of observations under perfect symmetry is 167 + 7 / 2 = 87 . On the other hand, the largest negative residuals are for the pairs (QU, sk), (HD, le), and (SK, hd) and reflect those cells where the observed cell count is smaller than what is expected under perfect symmetry. This can be see with the (HD, le) pairing, where the observed cell count is 7 and the expected number of observations under perfect symmetry is 87 (as we showed above). Therefore, these three pairs of pre- and post-courtship behaviour are the reverse of those pairings with a large positive divergence residual.
Table 3 also shows that s i j 1 = s j i 1 for all i , j = 1 , 2 , , 12 . For our other two values of δ , there is either perfect or near-perfect symmetry since the i , j th and j , i th divergence residuals are of the same or similar magnitude (differing only in their sign). However, there are clear differences in the magnitude of some of these residuals. For example s 87 0 = 0.132 (corresponding to the (HD/le) pair) while s 78 0 = 0.046 (corresponding to the (LE/hd) pair). Comparing these divergence residuals shows that the negative interaction between HD and le is about three times greater than the positive interaction between hd and LE.
The similarities, and differences, in these divergence residuals can be visualised using a correspondence plot. We now turn our attention to the correspondence plot of Table 2 when δ = 1 , 1 / 2 , and 0.

5.4. Visualising the Departures from Perfect Symmetry

To visualise where the departures from perfect symmetry exist, we constructed the correspondence plot using the principal coordinates of (23) and (24) for δ = 1 , 1 / 2 , and 0. These plots are given in Figure 6, where departures from perfect symmetry were assessed using the statistics (9)–(11) for δ = 1 , 1 / 2 , and 0, respectively. These three correspondence plots provide an excellent visual depiction of departures from perfect symmetry in Table 2 since they all account for about 84% of the total inertia calculated using X S 2 , T S 2 , and M S 2 .
The first thing to note about the configuration of points in the three correspondence plots of Figure 6 is that there is a large cluster of points that lie close to the origin. In fact, most of the categories of Table 2 lie at or near the origin with only a few categories that lie at a distance from the origin. Therefore, the three plots of Figure 6 show that most of the categories of Table 2 are fairly consistent with what is expected under perfect symmetry. Note that we are not saying here that all of the categories located in close proximity to the origin are perfectly symmetric. This can be achieved by determining the 100 1 α % confidence region for each category (for some level of significance α ), which is beyond the scope of this paper. However, when assessing departures from complete independence, such regions were recently developed by Alzahrani, Beh and Stojanovski [31] and were based on those described in Beh [29] and Beh and Lombardo [30].
We now turn our attention to those categories that are located relatively far from the origin. The three plots of Figure 6 show these to be le/LE (pre- and post-courtship leading), sk/SK (pre- and post-courtship skimming), and hd/HD (pre- and post-courtship head-down posture). Therefore, it is these three behaviours that deviate the most from what would be expected if there were perfect symmetry in Table 2 and are the dominant source for why the p-value of (9)–(11) is very small. Interestingly, these are three of the four behaviours that Wiepkema [18] (p. 131) and van der Heijden, de Vries and van Hooff [32] (p. 56) note as being the sexual factors that underlay bitterling courtship behaviour. The fourth trait they identified was quivering (qu/QU). Note that there is a relatively large negative divergence residual between sk and QU in Table 3, which suggests that a pre-courtship skimming behaviour is unlikely to lead to a quivering post-courtship behaviour. While QU lies relatively close to the origin for all four values of δ , it does lie at a distance from sk. However, there are many other post-courtship behaviours that lie close to the origin of their correspondence plot, and hence at a distance from sk, and have a relatively small divergence residual. So is there really an under-count of pre-courtship skimming behaviour and post-courtship quivering? While adding a third dimension does not add a great deal to our visual display of the departures from perfect symmetry, they do show, for our three δ values, that the proximity of sk from the origin is matched by the proximity of le and/or hd (depending on the choice of δ ). Therefore, the third dimension does add additional context to the differences highlighted in Table 3 between sk and QU.
Suppose we now discuss other courtship behaviours and pairs that are located relatively far from each other. The first thing to point out here is that for our three values of δ , LE, HD, and SK are all located in different parts of their correspondence plot. This suggests that the post-courtship behaviours of leading, head-down posture, and skimming all contribute differently to the lack of perfect symmetry in Table 2. So too are their pre-courtship behaviours le, hd, and sk. Interestingly, each of these three pre-courtship behaviours is not followed by their post-courtship behaviour. That is, for example, a pre-courtship display of leading is not followed by a post-courtship display of leading. In fact, Figure 6 shows that the differences between these three courtship behaviours is quite consistent.
While there are differences in pre- and post-courtship behaviours there are also some clearly defined pairings that can be identified by observing where departures from perfect symmetry exist. These are for the pairings of (hd, LE) and (sk, HD) when δ = 1 and 1/2; recall that the divergence residuals for these pairs in Table 3 are relatively large and positive. This suggests that when assessing the departures from perfect symmetry using the statistics (9) and (10), a pre-courtship display of head-down posture is followed by a leading post-courtship display, while a pre-courtship display of skimming is followed by a post-courtship display of head-down posture. Only when δ = 0 does there appear to be quite a difference between sk and HD; in fact, Figure 6(a) shows that a pre-courtship display of skimming is equally likely to lead to a post-courtship behaviour of head-down posture and skimming, although the link between the (sk, SK) and (sk, HD) pairs is not strong when δ = 0 .

6. Discussion

When numerically assessing departures from perfect symmetry one need not be confined to Bowker’s statistic [9], defined here by (9). There are a range of alternative statistics that can be considered and have been available for many decades; here, we focused our attention on the Freeman–Tukey statistic, T S 2 , and the modified log-likelihood ratio statistic M S 2 . These statistics are special cases of the Cressie–Read family of divergence statistics, defined by (2), as well as the second-order Taylor series approximation of this family; see (3).
This paper demonstrates how (3) can be used as the numerical foundations for performing a correspondence analysis to visualise departures from perfect symmetry. A special case of this family is when δ = 1 , leading to the correspondence analysis technique recently described by Beh and Lombardo [4]. While we have discussed how any value of δ can be considered when performing this analysis, there are advantages in considering δ = 1 , δ = 1 / 2 , and δ = 0 . With such flexibility in the choice of δ , one may well ask what is the most appropriate choice of δ to use? We discussed this issue [11] (Section 8) when showing the links between (4) and the correspondence analysis when assessing departures from complete independence. We described in that paper that the choice of δ may depend on many factors, including “the structure of the data, the output that is generated from the analysis or the ease and interpretability that a value of δ provides” (p. 38). However, there are other factors that may impact on the choice of δ . As the applications have shown, one may wish to choose the value of δ that yields the greatest percentage of the total inertia in a two-dimensional, say, correspondence plot. This was also advocated by Cuadras and Cuadras [14] (p. 72) but depends greatly on the data structure that is being assessed for departures from perfect symmetry. One may consider δ = 1 to be an ideal choice for numerous reasons including that (1) it leads to the more traditional correspondence analysis, (2) the total inertia is measured using the well-known and well-understood Bowker’s statistic, and (3) the first two dimensions account for the same percentage of the total inertia. This third reason also means that the analyst is provided with flexibility to rotate and/or reflect the configuration of points around either dimension without affecting the general interpretability of the configuration. As the application to Table 2 also shows, of the three values we considered, δ = 1 also led to the greatest percentage of the total inertia being visualised. However, this value of δ does not always produce a two-dimensional (for example) correspondence plot that reflects the greatest percentage of the total inertia. When the cell frequencies are small, there may be alternatives to δ = 1 that are more appropriate, such as δ = 1 / 2 . Beh and Lombardo [11] recommended this choice of δ when assessing departures from complete independence for several reasons including that (1) it is the ideal choice of δ when dealing with the presence of over-dispersion in the contingency table, (2) it leads to a well-known and well-documented measure of association (the Freeman–Tukey statistic), and (3) it is guaranteed to lead to non-negative estimates of n i j when reconstituting these values based on the information reflected in a low-dimensional correspondence plot (unlike δ = 1 ). However, the need to detect the presence of over-dispersion when assessing departures from perfect symmetry is rare as is the need to reconstitute the cell frequencies. Therefore, when assessing departures from perfect symmetry, there is greater flexibility in how to choose δ . Since the choice of δ does not impact on the properties of the correspondence analysis performed, a sensible choice of δ is the value that visually depicts the greatest percentage of the association; in many practical cases, this value may well be close to δ = 1 / 2 or δ = 1 , although this does depend on the data being analysed.
The next step in the evolution of this method of correspondence analysis is to derive the confidence regions alluded to in Section 3.6 for visualising those categories that are statistically significant contributors to the global measure of the departure from perfect symmetry. Such regions expand upon those describe by Alzahrani, Beh and Stojanovski [31] and complement the correspondence analysis framework which uses the Cressie–Read family of divergence statistics. Another step in this evolution is to recognise that this family is a special case of a more general family discussed by Csisźar [33]. These involves information theoretic measures, such as the Kullback–Leibler divergence; see also Kharazmi and Contreras-Reyes [34] who introduced various new information measures. Therefore, visually assessing departures from perfect symmetry using a correspondence analysis may be expanded to incorporate these types of measures. We shall leave these, and other further developments of the method of correspondence analysis outlined in this paper, for future work.

Author Contributions

Methodology, E.J.B. and R.L.; Software, E.J.B.; Validation, E.J.B. and R.L.; Formal analysis, E.J.B.; Investigation, R.L.; Writing—original draft, E.J.B. and R.L.; Writing—review & editing, E.J.B. and R.L.; Visualization, E.J.B. and R.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data in Table 1 are artificial and come from Beh and Lombardo [4]. The data in Table 2 are from Wiepkema [18] and also appear in van der Heijden, de Vries and van Hooff [32].

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Beh, E.J.; Lombardo, R. Correspondence Analysis: Theory, Practice and New Strategies; Wiley: Chichester, UK, 2014. [Google Scholar]
  2. Greenacre, M.J. Theory and Applications of Correspondence Analysis; Academic Press: London, UK, 1984. [Google Scholar]
  3. Lebart, L.; Morineau, A.; Warwick, K.M. Multivariate Descriptive Statistical Analysis: Correspondence Analysis and Related Techniques for Large Matrices; Wiley: New York, NY, USA, 1984. [Google Scholar]
  4. Beh, E.J.; Lombardo, R. Visualising departures from symmetry and Bowker’s X2 statistic. Symmetry 2022, 14, 1103. [Google Scholar] [CrossRef]
  5. Greenacre, M. Correspondence analysis of square asymmetric matrices. J. R. Stat. Soc. Ser. C Appl. Stat. 2000, 49, 297–310. [Google Scholar] [CrossRef]
  6. Bove, G. Asymmetric multidimensional scaling and correspondence analysis for square tables. Stat. Appl. 1992, 4, 587–598. [Google Scholar]
  7. Constantine, A.G.; Gower, J.C. Graphical representation of asymmetry. Appl. Stat. 1978, 27, 297–304. [Google Scholar] [CrossRef]
  8. Gower, J.C. The analysis of asymmetry and orthogonality. In Recent Developments in Statistics; Barra, J.R., Brodeau, F., Romer, G., van Cutsem, B., Eds.; North-Holland: Amsterdam, The Netherlands, 1977; pp. 109–123. [Google Scholar]
  9. Bowker, A.H. A test for symmetry in contingency tables. J. Am. Stat. Assoc. 1948, 43, 572–598. [Google Scholar] [CrossRef] [PubMed]
  10. Cressie, N.A.C.; Read, T.R.C. Multinomial goodness-of-fit tests. J. R. Stat. Soc. Ser. B Methodol. 1984, 46, 440–464. [Google Scholar] [CrossRef]
  11. Beh, E.J.; Lombardo, R. Correspondence analysis and the Cressie-Read family of divergence statistics. Int. Stat. Rev. 2024, 92, 17–42. [Google Scholar] [CrossRef]
  12. Greenacre, M. Power transformations in correspondence analysis. Comput. Stat. Data Anal. 2009, 53, 3107–3116. [Google Scholar] [CrossRef]
  13. Greenacre, M. Log-ratio analysis is a limiting case of correspondence analysis. Math. Geosci. 2010, 42, 129–134. [Google Scholar] [CrossRef]
  14. Cuadras, C.M.; Cuadras, D. A parametric approach to correspondence analysis. Linear Algebra Its Appl. 2006, 417, 64–74. [Google Scholar] [CrossRef]
  15. Cuadras, C.M.; Cuadras, D.; Greenacre, M.J. A comparison of different methods of representing categorical data. Commun. Stat.-Simul. Comput. 2006, 35, 447–459. [Google Scholar] [CrossRef]
  16. Neyman, J. Contributions to the theory of the χ2 test. In Proceedings of the Berkeley Symposium on Mathematical Statistics and Probability; Neyman, J., Ed.; Statistical Laboratory of the University of California: Berkeley, CA, USA, 1949; pp. 239–273. [Google Scholar]
  17. Freeman, M.F.; Tukey, J.W. Transformations related to the angular and square root. Ann. Math. Stat. 1950, 21, 607–611. [Google Scholar] [CrossRef]
  18. Wiepkema, P.R. An ethological analysis of the reproductive behaviour of the bitterling (Rhodeus amarus Bloch). Arch. NÉerlandaises Zool. 1961, 14, 103–199. [Google Scholar] [CrossRef]
  19. Beh, E.J.; Lombardo, R.; Alberti, G. Correspondence analysis and the Freeman-Tukey statistic: A study of archaeological data. Comput. Stat. Data Anal. 2018, 128, 73–86. [Google Scholar] [CrossRef]
  20. Agresti, A. Categorical Data Analysis, 3rd ed.; Wiley: Hoboken, NJ, USA, 2013. [Google Scholar]
  21. Anderson, E.B. The Statistical Analysis of Categorical Data; Springer: Berlin/Heidelberg, Germany, 1991. [Google Scholar]
  22. Tomizawa, S.; Seo, T.; Yamamoto, H. Power-divergence-type measure of departure from symmetry for square contingency tables that have nominal categories. J. Appl. Stat. 1998, 25, 387–398. [Google Scholar] [CrossRef]
  23. Ando, S.; Hoshi, H.; Ishii, A.; Tomizawa, S. A generalized two-dimensional index to measure the degree of deviation from double symmetry in square contingency tables. Symmetry 2021, 13, 2067. [Google Scholar] [CrossRef]
  24. Altun, G.; Saraçbaşi, T. Determination of model fitting with power-divergence-type measure of departure from symmetry for sparse and non-sparse contingency tables. Commun. Stat.-Simul. Comput. 2022, 51, 4087–4111. [Google Scholar] [CrossRef]
  25. Bishop, Y.M.; Fienberg, S.E.; Holland, P.W. Discrete Multivariate Analysis: Theory and Practice; MIT Press: Cambridge, MA, USA, 1975. [Google Scholar]
  26. Haberman, S.J. Analysis of Qualitative Data, Volume 2: New Developments; Academic Press: New York, NY, USA, 1979. [Google Scholar]
  27. Ireland, C.T.; Ku, H.H.; Kullback, S. Symmetry and marginal homogeneity of an r × r contingency table. J. Am. Stat. Assoc. 1969, 64, 1323–1341. [Google Scholar] [CrossRef]
  28. Ward, R.C.; Gray, L.J. Eigensystem computation for skew-symmetric matrices and a class of symmetric matrices. ACM Trans. Math. Softw. 1978, 4, 278–285. [Google Scholar] [CrossRef]
  29. Beh, E.J. Elliptical confidence regions for simple correspondence analysis. J. Stat. Plan. Inference 2010, 140, 2582–2588. [Google Scholar] [CrossRef]
  30. Beh, E.J.; Lombardo, R. Confidence regions and approximate p-values for classical and non symmetric correspondence analysis. Commun. Stat.-Theory Methods 2015, 44, 95–114. [Google Scholar] [CrossRef]
  31. Alzahrani, A.; Beh, E.J.; Stojanovski, E. Confidence regions for simple correspondence analysis using the Cressie-Read family of divergence statistics. Electron. J. Appl. Stat. Anal. 2023, 16, 423–448. [Google Scholar]
  32. van der Heijden, P.G.M.; de Vries, H.; van Hooff, J.A.R.A.M. Correspondence analysis of transition matrices, with special attention to missing entries and asymmetry. Anim. Behav. 1990, 40, 49–64. [Google Scholar] [CrossRef]
  33. Csisźar, I. Information-type measures of difference of probability distributions and indirect observations. Stud. Sci. Math. Hung. 1967, 2, 229–318. [Google Scholar]
  34. Kharazmi, O.; Contreras-Reyes, J.E. Deng-Fisher information measure and its extensions: Application to Conway’s Game of Life. Chaos Solitons Fractals 2023, 174, 113871. [Google Scholar] [CrossRef]
Figure 1. X S 2 , T S 2 , and M S 2 versus C 20 , 100 at unitary increments for Table 1.
Figure 1. X S 2 , T S 2 , and M S 2 versus C 20 , 100 at unitary increments for Table 1.
Symmetry 16 00830 g001
Figure 2. λ 1 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 , and 0.
Figure 2. λ 1 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 , and 0.
Symmetry 16 00830 g002
Figure 3. λ 2 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 , and 0.
Figure 3. λ 2 δ versus C 0 , 100 for Table 1; δ = 1 , 1 / 2 , and 0.
Symmetry 16 00830 g003
Figure 4. λ Diff δ versus C 0 for Table 1; δ = 1 , 1 / 2 , and 0.
Figure 4. λ Diff δ versus C 0 for Table 1; δ = 1 , 1 / 2 , and 0.
Symmetry 16 00830 g004
Figure 5. Correspondence plot for Table 1 with C = 50 , where (a) δ = 1 , (b) δ = 1 / 2 , and (c) δ = 0 .
Figure 5. Correspondence plot for Table 1 with C = 50 , where (a) δ = 1 , (b) δ = 1 / 2 , and (c) δ = 0 .
Symmetry 16 00830 g005
Figure 6. Correspondence plot for Table 2 where (a) δ = 0 , (b) δ = 1 / 2 , and (c) δ = 1 .
Figure 6. Correspondence plot for Table 2 where (a) δ = 0 , (b) δ = 1 / 2 , and (c) δ = 1 .
Symmetry 16 00830 g006
Table 1. A near-symmetric artificial contingency table where C is a non-negative integer.
Table 1. A near-symmetric artificial contingency table where C is a non-negative integer.
Columns
RowsC1C2C3C4Total
R110203040100
R220 + C506070200 + C
R330602040150
R440704080230
Total100 + C200150230680 + C
Table 2. The pre- and post-courtship behaviour of bitterlings (Rhodeus amarus Bloch; source: [18]).
Table 2. The pre- and post-courtship behaviour of bitterlings (Rhodeus amarus Bloch; source: [18]).
Pre-Courtship Behaviour
Post-jktuhbcsflqulehdsksncfffTotal
JK65412817256272512804614181169
TU1011326227511110859362
HB1716219713002505014181412693
CS6022152135080431615124467
FL192004191902017511494
QU3611851278911929526701141386
LE4000057167730800309
HD22940375245717128753813897
SK3273801208134192849363
SN42217162070116792251212503
CF18310136508024979193
FF273651013018010829129
Total115736668146250413773149003715221801316965
Table 3. The matrix of divergence residuals, S δ , for δ = 1 , 1 / 2 , and 0.
Table 3. The matrix of divergence residuals, S δ , for δ = 1 , 1 / 2 , and 0.
Pre-Courtship Behaviour
Post- δ jktuhbcsflqulehdsksncfff
1 0.015<0.001−0.0030.010−0.012−0.0110.007−0.0150.004−0.006−0.011
JK1/2 0.015<0.001−0.0030.010−0.013−0.0140.007−0.0270.004−0.006−0.012
0 0.014<0.001−0.0030.009−0.013−0.0170.007−0.0740.004−0.006−0.013
1−0.015 00.0060.01000.0080.004−0.0120.0160.0060.015
TU1/2−0.016 00.0060.00900.0070.004−0.0220.0140.0060.013
0−0.016 00.0060.00800.0060.004−0.0560.0130.0050.012
1<0.0010 −0.01100.00900.0090.0130.0010.0070.012
HB1/2<0.0010 −0.01100.00900.0090.0120.0010.0070.011
0<0.0010 −0.01200.00800.0080.0110.0010.0060.010
10.003−0.0060.011 00.00700.006−0.025−0.002−0.002−0.003
CS1/20.003−0.0060.011 00.00700.006−0.029−0.002−0.002−0.003
00.003−0.0060.011 00.00600.005−0.033−0.002−0.002−0.003
1−0.010−0.01000 0.0110−0.0100−0.004−0.0030.002
FL1/2−0.010−0.01100 0.0100−0.0110−0.004−0.0030.002
0−0.011−0.01300 0.0100−0.0130−0.004−0.0030.002
10.0120−0.009−0.007−0.011 0.0400.0180.0660−0.0140.002
QU1/20.0110−0.009−0.008−0.011 0.0370.0180.0830−0.0170.002
00.0110−0.010−0.008−0.012 0.0340.0170.1060−0.0230.002
10.011−0.008000−0.040 0.063−0.024−0.00600
LE1/20.010−0.015000−0.044 0.053−0.046−0.00600
00.0090.034000−0.049 0.0460.144−0.00600
1−0.007−0.004−0.009−0.0060.010−0.0180.063 0.063−0.0110−0.008
HD1/2−0.007−0.004−0.009−0.0060.009−0.0190.088 0.058−0.0110−0.008
0−0.008−0.004−0.009−0.0060.008−0.0190.132 0.054−0.0120−0.008
10.0150.012−0.0130.02500.0660.0240.063 0.0260.0170
SK1/20.0120.010−0.0140.02300.0580.0200.070 0.0240.0140
00.0100.008−0.0160.02100.0510.0170.079 0.0210.0120
1−0.004−0.016−0.0010.0020.00400.0060.011−0.026 −0.0170.004
SN1/2−0.004−0.020−0.0010.0020.00400.0060.011−0.031 −0.0190.004
0−0.004−0.024−0.0010.0010.00400.0050.010−0.037 −0.0210.003
10.006−0.006−0.0070.0020.0030.01400−0.0170.017 0.002
CF1/20.006−0.006−0.0070.0020.0020.01200−0.0320.016 0.002
00.006−0.007−0.0080.0020.0020.011000.0900.015 0.002
10.011−0.015−0.0120.003−0.002−0.00200.0080−0.004−0.002
FF1/20.011−0.017−0.0130.003−0.002−0.00200.0070−0.004−0.002
00.010−0.020−0.0150.003−0.002−0.00200.0070−0.004−0.002
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Beh, E.J.; Lombardo, R. Correspondence Analysis for Assessing Departures from Perfect Symmetry Using the Cressie–Read Family of Divergence Statistics. Symmetry 2024, 16, 830. https://doi.org/10.3390/sym16070830

AMA Style

Beh EJ, Lombardo R. Correspondence Analysis for Assessing Departures from Perfect Symmetry Using the Cressie–Read Family of Divergence Statistics. Symmetry. 2024; 16(7):830. https://doi.org/10.3390/sym16070830

Chicago/Turabian Style

Beh, Eric J., and Rosaria Lombardo. 2024. "Correspondence Analysis for Assessing Departures from Perfect Symmetry Using the Cressie–Read Family of Divergence Statistics" Symmetry 16, no. 7: 830. https://doi.org/10.3390/sym16070830

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop