Next Article in Journal
Hybrid Management Strategy for Outsourcing Electromechanical Maintenance and Selecting Contractors in Taipei MRT
Previous Article in Journal
The Wave Equation for a Moving Source and a Moving Receiver
Previous Article in Special Issue
On V-Geometric Ergodicity Markov Chains of the Two-Inertia Systems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Testing and Visualization of Associations in Three-Way Contingency Tables: A Study of the Gender Gap in Patients with Type 1 Diabetes and Cardiovascular Complications

1
Department of Economics, University of Campania “Luigi Vanvitelli”, 81043 Capua (CE), Italy
2
National Institute for Applied Statistics Research Australia (NIASRA), University of Wollongong, Wollongong, NSW 2522, Australia
3
Centre for Multi-Dimensional Data Visualisation (MuViSU), Stellenbosch University, Stellenbosch 7602, South Africa
4
IRCCS MultiMedica, 20138 Milan, Italy
5
CORESEARCH, Center for Outcomes Research and Clinical Epidemiology, 65122 Pescara, Italy
6
Department of Medicine, Sahlgrenska University Hospital, 413 45 Gothenburg, Sweden
7
Center of Registers in Region Västra Götaland, 413 45 Gothenburg, Sweden
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(14), 2186; https://doi.org/10.3390/math12142186
Submission received: 30 May 2024 / Revised: 26 June 2024 / Accepted: 1 July 2024 / Published: 12 July 2024
(This article belongs to the Special Issue Advances of Applied Probability and Statistics)

Abstract

:
Using data from the Swedish National Diabetes Register, this study examines the gender disparity among patients with type 1 diabetes who have experienced a specific cardiovascular complication, while exploring the association between their weight variability, age group, and gender. Fourteen cardiovascular complications have been considered. This analysis is conducted using three-way correspondence analysis (CA), which allows for the partitioning and decomposition of Pearson’s three-way chi-squared statistic. The dataset comprises information organized in a data cube, detailing how weight variability among these patients correlates with a cardiovascular complication, age group, and gender. The three-way CA method presented in this paper allows one to assess the statistical significance of the association between these variables and to visualize this association, highlighting the gender gap among these patients. From this analysis, we find that the association between weight variability, age group, and gender varies among different types of cardiovascular complications.

1. Introduction

Patients diagnosed with type 1 diabetes (T1D) face a heightened risk of developing a range of cardiovascular complications (CVCs) compared to the general population [1,2]. It is known that women with T1D face a greater risk of all-cause mortality and experience more vascular events compared to men [3]. This study assesses gender differences in patients with T1D [4,5,6], focusing on weight variability and age in males and females who experienced a CVC, and examining the associations between these variables in different cohorts.
By utilizing data from the Swedish National Diabetes Register (NDR), this paper explores the links between weight variability, age group, and gender of patients with T1D and a CVC. The CVCs that we focus our attention on are: (1) stroke, (2) myocardial infarction (myo), (3) heart failure hospitalization (HFH), (4) coronary artery bypass graft (CABG), (5) percutaneous coronary intervention (PCI), (6) peripheral artery disease (PAD), (7) renal failure (RF), (8) acute renal disease (ARF), (9) chronic renal failure (CRF), (10) dialysis, (11) glomerular filtration rate of 50 mL/min/1.73 m2 (GFR50), (12) macroalbuminuria (Macr), (13) retinopathy (Ret), and (14) all-cause mortality (Mort).
This population-based cohort study involved 6255 participants (3485 males and 2770 females) with T1D, ranging in age from 21 to 85 years, identified within the NDR. The inclusion period began on 1 July 2003 and concluded on 30 December 2013, during which time participants underwent at least three distinct weight measurements. Weight variability was assessed by computing the coefficient of variability (CV) for weight measurements of each patient and dividing it into quartiles. Subsequently, data concerning each of the 14 CVCs were summarized into a three-way contingency table, stratified by weight variability quartiles, age quartiles, and gender. To study the dependence relationship between the patients’ weight variability, age group and gender, we consider a three-way correspondence analysis (CA) [7,8] based on the partition and decomposition of a three-way index of association, that is, Pearson’s chi-squared statistic [9,10]. Pearson’s three-way chi-squared statistic is orthogonally partitioned into two-way and three-way association terms to determine the dependence relationships between variables. Note that we use the term “association” when analyzing the dependence between variables and the term “interaction” when analyzing the dependence between the categories of the variables. Indeed, three-way CA allows us to decompose the association between the variables and also to visualize the interactions between the diverse categories.
The paper is organized as follows. After introducing the notation to be used in this paper in Section 2, Section 3 and Section 4 provide a brief description of the partition of Pearson’s three-way chi-squared statistic and of the classical three-way CA technique, respectively. Section 5 gives a brief overview of the use and construction of biplots for three-way correspondence analysis. Section 6 describes several illustrative examples demonstrating how the association between weight variability, age group, and gender varies with respect to a given CVC. Section 7 provides some final remarks.

2. Materials and Methods

Before analyzing the data, it is necessary to explain some of the general principles behind three-way CA.

Notation and Terminology

Define an I × J × K (three-way) contingency table by N ̲ , which contains information on the three cross-classified categorical variables observed on n individuals. This contingency table consists of I rows, J columns, and K tubes, and its ( i , j , k )th element is denoted by n i j k . Let p i j k = n i j k / n be the proportion of observations in cell ( i , j , k ) , and  p i , p j , and p k be the i th row, j th column, and  k th tube marginal proportions, respectively, also called expected proportions. The deviations from the three-way independence model are defined as follows:
π i j k = p i j k p i p j p k 1 .
The value π i j k will primarily be referred to as Pearson’s residual for the cell ( i , j , k ) , and its square is a cell’s contribution to the chi-squared statistic X I J K 2 . We define D I to be the I × I diagonal matrix with the row marginal p i as its i , i th element. Similarly, D J and D K are the J × J and K × K diagonal matrices of the column and tube marginal proportions, respectively. Finally, we consider the total inertia or phi-squared statistic, Φ 2 = X I J K 2 / n , which is also equal to the squared norm of the table Π ̲ = ( π i j k ) containing the Pearson residuals so that
Φ 2 = X I J K 2 n = | | Π ̲ | | 2 = i = 1 I j = 1 J k = 1 K p i p j p k p i j k p i p j p k 1 2 .
Using Φ 2 , Carlier and Kroonenberg [7] proposed a three-way CA method employing the Tucker3 model [11], which is the most common generalization of the singular value decomposition of a matrix; see also Kroonenberg (p. 54, [12]). This model defines components for each of the three categorical variables. These components are a linear combination of the original variables, synthesizing the information from the categories of variables. It also consists of a three-way core table containing the three-way equivalent of singular values; see also Lombardo, Beh, and Kroonenberg [8]; Lombardo, van de Velden, and Beh [13]; and Beh and Lombardo (Chapter 11, [14]). The elements of the core table reflect the strengths of the links between the components of the three variables. The decomposition does not impose any restrictions on the components other than the identification constraints of orthogonality.

3. Partitioning the Chi-Squared Statistic

Lombardo, Takane, and Beh [15] discuss a common method for studying the association among three categorical variables based on the orthogonal decomposition of Pearson’s chi-squared statistic, X I J K 2 , and under a variety of hypotheses about the expected proportions (hypothesized probabilities). These partitions are based on the ANOVA-like family-wise partitions of Pearson’s three-way chi-squared statistic. In this paper, we focus on the most common case where these probabilities are estimated using the marginal proportions of the empirical distribution underlying the data. In some cases, however, the probabilities are prescribed by the analyst, because a priori knowledge of the phenomena suggests otherwise [16].
Under the independence assumption, Pearson’s three-way chi-squared statistic can be partitioned into four orthogonal parts, each corresponding to an association term, as detailed, for example, by Lancaster [17]. This partition of X I J K 2 is given by
X I J K 2 = X I J 2 + X I K 2 + X J K 2 + X i n t 2
and can be written in more detail as
X I J K 2 = n i = 1 I j = 1 J p i p j p i j p i p j p i p j 2 + n i = 1 I k = 1 K p i p k p i k p i p k p i p k 2 + n j = 1 J k = 1 K p j p k p j k p j p k p j p k 2 + n i = 1 I j = 1 J k = 1 K p i p j p k p i j k α p i j k p i p j p k 2 ,
where
α p i j k = p i j p k + p i k p j + p j k p i 2 p i p j p k .
The first three terms of Equations (1) and (2) are the two-way association terms obtained by a weighted aggregation of the total association measure across each of the variables. For example, X I J 2 is Pearson’s chi-squared statistic for the row and column variables formed by aggregating across each of the tube categories. The last term, X i n t 2 , represents the trivariate association between all three categorical variables. For further details on this partition, see Carlier and Kroonenberg (p. 358, [7]); Beh and Lombardo (p. 454, [14]); and Lombardo, Takane, and Beh [15].
The X I J K 2 , as well as the four association terms on the right-hand side of Equations (1) and (2), can be tested for (asymptotic) statistical significance since all these terms are asymptotic chi-squared random variables. Specifically, under the null hypotheses that all three variables are independent, the statistical significance of X I J K 2 can be assessed using I 1 J 1 + I 1 K 1 + J 1 K 1 + I 1 J 1 K 1 degrees of freedom. The statistical significance of the first term, X I J 2 , can be evaluated using I 1 J 1 degrees of freedom (df) and the second term, X I K 2 , by using I 1 K 1 df. Similarly, the statistical significance of the third term, X J K 2 , can be assessed using J 1 K 1 df and the fourth term, X I J K 2 , by using I 1 J 1 K 1 df.

4. Modeling Association

When the analyst not only wishes to test the statistical significance of the three-way association but also wants to provide a graphical representation of it, a decomposition of the three-way contingency table is necessary for exploring in more detail the interactions between the categories.

The Tucker3 Decomposition

As indicated in Section 2, the most common three-way generalization of the singular value decomposition within the context of three-way CA is the Tucker3 model proposed by Tucker [11]. The general form of the decomposition is
π i j k = π ^ i j k + e i j k = p = 1 P q = 1 Q r = 1 R g p q r a i p b j q c k r + e i j k ,
where P, Q, and R (with P I , Q J , and R K ) represent the number of columns in the component matrices A , B , and C that are associated with the rows, columns, and tube categories, respectively. The sets of component vectors a p { a i p } , b q { b j q } , and c r { c k r } are orthonormal with respect to D I , D J , and D K , respectively. The quantities g p q r are the elements of the core table and can be interpreted as the generalized or three-way analogs, of the two-way singular values, where its magnitude indicates the strength of the relationship among the components. The magnitude of the squared core element, g p q r 2 , also indicates the contribution of the product of the P, Q, and R components to the fit of the model to the data, allowing the analyst to compute the inertia explained in a two-dimensional plot. Note that, given the orthogonality of the components, the phi-squared statistic or total inertia can be expressed as the sum of squares of the elements of the core table, so that
X I J K 2 n = p = 1 I q = 1 J r = 1 K g p q r 2 .
For more details on Equation (4), see Kroonenberg (pp. 226–227, [12]).
Finally, the element e i j k in Equation (3) is the error of approximation between the observed π i j k and their predicted values, π ^ i j k ; see, for example, Kroonenberg (Chapter 4, [12]. The choice of the number of row, column, and tube components, P, Q, and R, respectively, relies on various methodologies developed by researchers for the Tucker3 model. Studies such as Kiers [18,19], Ten Berg and Kiers [20], Timmerman and Kiers [21], Rocci [22], Kroonenberg [23], and Lombardo, van de Velden, and Beh [13] illustrate this point. Here, we will adopt the strategy of Lombardo, van de Velden, and Beh [13] to choose P, Q, and R. This strategy uses the tunelocal() function in the R package CA3variants (v. 3.3.1). The three-way CA performed in Section 6 uses the CA3variants() function of this package.

5. Biplots in Three-Way Correspondence Analysis

To visualize the association structure among two or more categorical variables, we limit ourselves to the biplot, a technique first introduced by Gabriel [24]. Comprehensive discussions of the variety of biplots that can be constructed, particularly in the context of CA, are available in the works of Carlier and Kroonenberg [7]; Lombardo, Carlier, and D’Ambra [25]; Greenacre [26]; and Beh and Lombardo [27].

Biplots for Three-Way Correspondence Analysis

To illustrate the association structure among the categorical variables of a three-way contingency table, Carlier and Kroonenberg [7,28] explored different biplot techniques. The technique of interest here employs a single component for one variable and the interactively-coded components for the other variables; this biplot is also known as a nested-mode biplot (Sect. 11.5.4, [12]) or interactive biplot (p. 472, [14]). Lombardo, van de Velden, and Beh [13] discuss six variants of the interactive biplot. These variants depend on the variables chosen for the interactive representation and on the computation of the type of coordinates, i.e., standard or principal coordinates [29,30]. Firstly, we can represent the coordinates of the row categories using their A components, while the coordinates of the column and tube categories are represented interactively by the Kronecker product of their B and C components. Secondly, the coordinates of the column categories can be represented by their B components, while the coordinates of the rows and tubes are interactively represented by the Kronecker product of their A and C components. Thirdly, the coordinates of the tube categories can be represented by their C components, with the coordinates of the rows and columns being interactively represented by the Kronecker product of their A and B components.
When the coordinates are multiplied by the core elements, they are referred to as principal coordinates; otherwise, they are standard coordinates. Their features are the same as those derived for biplots in the classical approach to CA [30]; for more information, refer to Greenacre [26] and Beh and Lombardo (Chapter 11, [14]). In particular, the distances between the categories from the same variable are correctly portrayed only when using the principal coordinates.
The interactive coordinates can be expressed in either their standard or principal coordinate form. Therefore, the three previous types of variable representation lead to six types of interactive biplots; for further details, see Lombardo, van de Velden, and Beh (pp. 243–244, [13]).

6. Practical Application: Patients with T1D and CVCs

6.1. Preliminary Discussion of the NDR Data

We utilize data from the NDR to assess and visualize the association between the variables weight variability, age group, and gender among patients with T1D who experienced one of the 14 CVCs outlined in Section 1. We classify their weight variability using four quartiles: the first quartile includes patients with a CV of weight between 0 and 0.018 (Q1: low); the second quartile includes those with a CV of weight between 0.018 and 0.028 (Q2: moderately low); the third quartile includes those with a CV of weight between 0.028 and 0.043 (Q3: middle-high); and the fourth quartile includes those with a CV of weight at least 0.043 (Q4: high). The CV of the weight variable is then cross-classified to create a three-way contingency table with respect to the four age group quartiles (age1 = 21–27 years; age2 = 27–40 years; age3 = 40–52 years; age4 = 52+ years) and to gender (males and females). Consequently, the data are summarized in 14 three-way contingency tables, with each table corresponding to a specific CVC.
Table 1 presents Pearson’s three-way chi-squared statistic for each of the 14 CVCs given in Section 1, as well as their p-value with respect to 24 df. Interestingly, Pearson’s three-way chi-squared statistic does not yield statistically significant results for all 14 CVCs. Rather, it only shows that there is a statistically significant association (at the 1% level of significance) for two CVCs: macroalbuminuria and retinopathy. Therefore, among the 14 CVCs, there is no statistically significant association between weight variability, age group, and gender for any of the six cardiovascular diseases (stroke, myo, HFH, CABG, PCI, and PAD). The interested reader can find the related three-way tables in the Supplementary Materials for this paper.
In the following sections, we discuss the partition and decomposition of Pearson’s chi-squared statistic for the CVCs macroalbuminuria and retinopathy. We focus exclusively on these two three-way contingency tables, since it is only for these CVCs where a statistically significant association exists between weight variability, age group, and gender.

6.2. Patients with TD1 and Macroalbuminuria

The main question we shall address here is:
To what extent is there an association between weight variability, age group, and gender for patients with T1D and macroalbuminuria?
Table 2 presents a cross-classification of weight variability (row variable), age group (column variable), and gender (tube variable) among patients with T1D experiencing macroalbuminuria.
To assess the nature of the association between weight variability, age group, and gender in detail, we partition Pearson’s three-way chi-squared statistic, X I J K 2 , first in accordance with Equation (1) so that three bivariate terms and a trivariate term are produced. Table 3 summarizes the partition of this statistic for Table 2. This includes the partition of phi-squared into four terms: their percentage contribution to phi-squared, the corresponding degrees of freedom (df), the p-value, and the relative size ( T e r m /df) of each partition term (which facilitates a comparison of the chi-squared values derived from different asymptotic chi-squared distributions). In Table 3, looking at Pearson’s three-way statistic X I J K 2 = 73.806 and its p-value, which is less than 0.001 (df = 24), we can conclude that there is clear evidence of a statistically significant association among the three variables of Table 2. Table 3 also shows that the most dominant source of association among the three variables is the association between weight variability and age group, denoted as T e r m I J ; it contributes to over 75% of X I J K 2 and has a p-value that is less than 0.001. While the p-value of X I J K 2 for Table 1 shows there is a statistically significant association between the variables, Table 3 reveals that not all three bivariate terms of the partition yield a statistically significant association between the variables. Specifically, only T e r m I J is statistically significant. Conversely, the associations between weight variability and gender ( T e r m I K ) and between age group and gender ( T e r m J K ) are not statistically significant, nor is the trivariate term ( T e r m I J K ). This means that, while not all sources of associations between the variables are statistically significant, at least one is. Also, while T e r m I J K is not statistically significant, we can still identify categories and joint categories that do provide a statistically significant contribution to the overall association among the three variables. We now turn our attention to addressing this issue for Table 2.
To identify which categories of Table 2 provide a statistically significant contribution to the total association, we calculate the p-value for each quartile of the weight variability variable. We also calculate the p-value of the interaction between the age group and gender variables. These p-values are determined using the approach described in Beh and Lombardo [31]. Their methodology constructs 100 ( 1 α ) % confidence ellipses for points in a low-dimensional plot generated from applying a CA to the data. The methodology also includes ways to calculate excellent p-value approximations for each point when assessing their contribution to the association (or lack thereof) between categorical variables. These p-values are designed to reflect the statistical significance of the distance of a category’s point from the origin, thus determining the statistical significance of the category to the association structure between the categorical variables. They are calculated taking into account the chi-squared statistic, the principal coordinates, the inertia explained by each axis, and the marginal proportions of N ̲ .
Table 4 shows that all the categories of weight variability are statistically significant at the α = 0.01 level, except for Q2. However, only two categories of the age group for males and females show statistical significance at the 1% level; they are “age4” for the males and “age1” for the females.
To visualize the association reflected in the p-values of Table 4, we consider a column–tube biplot with interactive coding of age group (column variable) and gender (tube variable). However, since the T e r m I J is statistically significant (see Table 3), we also portray the row–column biplot that depicts the statistically significant association that exists between weight variability and age group ( T e r m I J ). Figure 1 shows these two biplots: on the left side is the column–tube interactive biplot, and on the right side is the row–column interactive biplot. The column–tube interactive biplot on the left of Figure 1 depicts the three-way association where the age groupgender interaction is depicted using principal coordinates, and weight variability is depicted using standard coordinates. The row–column interactive biplot on the right of Figure 1 portrays the three-way association where the weight variabilityage group interaction is depicted using principal coordinates, and gender is depicted using standard coordinates. The origin of both plots represents where there is complete independence between the variables. Therefore, the farther the interactive categories are from the origin, the stronger the association. The interactive principal coordinates provide an accurate depiction of their distance from the plot’s origin and from each other. Note that the distance of points depicted by their standard coordinates cannot be appropriately evaluated. Only their angle can be considered: when this angle is acute, there is a strong positive interaction with the interactive pair of categories. When the angle is at 90°, there is independence, and when the angle is obtuse, there is a strong negative interaction.
The quality of the interactive biplots in Figure 1 is very good, since they visually describe about 84% (for the column–tube interactive biplot) and 83% (for the row–column interactive biplot) of the total association. Note that the farthest interactive coordinates contribute most prominently to the association. In the column–tube biplot of Figure 1, they are “age4male”, which is positively related to Q1, and “age1female”, which is positively related to Q4. We can also see from Figure 1 that “age1male” is negatively related to Q1, while “age4female” is negatively related to Q4. In the row–column biplot of Figure 1, we can see that “Q1age4” is positively related to males, while “Q4age1” is positively related to females. Furthermore, “Q1age1” is negatively related to males, and “Q4age4” is negatively related to females.
In summary, the biplots in Figure 1 show that males with T1D and macroalbuminuria are significantly characterized by having “low” weight variability and being in the “old” age group (aged 52 years or older). Figure 1 also shows that the females with T1D and macroalbuminuria are characterized as generally having “high” weight variability and being in the “young” age group (aged 21–27 years old).

6.3. Patients with TD1 and Retinopathy

The primary question we address here is:
What is the strength of the association among weight variability, age group, and gender in patients diagnosed with T1D and retinopathy?
Table 5 presents a cross-classification of weight variability (row variable), age group (column variable), and gender (tube variable) among patients experiencing retinopathy, which is the most prevalent microvascular complication experienced by T1D patients, often present during adolescence [32].
Table 6 shows that Pearson’s three-way chi-squared statistic of Table 5  X I J K 2 = 168.271 (df = 24), so that there is a statistically significant association among the three variables (p-value < 0.001 ). To assess the nature of the association among weight variability, age group, and gender in detail for patients with T1D and retinopathy, we examine the partition of Pearson’s three-way chi-squared statistic X I J K 2 in accordance with Equation (1). This partition is summarized in Table 6, which indicates that the three indices of the bivariate association are statistically significant, albeit at different α levels. The T e r m I J indicates a significant association between weight variability and age group, since the p-value is less than 0.001.
The T e r m I K , which represents the association between weight variability and gender, shows that there is a gender difference among patients with T1D and retinopathy at the 5% level of significance, and the T e r m J K also shows a significant association at this level of significance between age group and gender. However, the trivariate term is not statistically significant.
To determine which categories significantly contribute to the association, we compute the approximate p-value [31] for each category within the weight variability and age group variables of the males and females; see Table 7. Table 7 shows that all the categories of weight variability have a p-value that is less than 0.001, showing that their contribution to the association that exists within Table 5 is statistically significant. Additionally, three categories among the age group for males show statistical significance: “age1Male”, “age3Male”, and “age4Male”. However, for the females in the study, only “age1Female” is statistically significant.
Figure 2 illustrates the three-way association. Since the association between age group and gender is statistically significant (see T e r m J K in Table 6), this figure only shows the column–tube interactive biplot. Figure 2 shows that the quality of the column–tube interactive biplot is excellent, since 88% of the total association is depicted. In Figure 2, our attention is directed towards the farthest age groupgender points, which are most relevant for evaluating the association. Unlike what was observed for patients with T1D and macroalbuminuria, the biplot shows that males experiencing T1D and retinopathy are mainly characterized by having “high” weight variability and being in the “young” age group (“age1male”). We can also see in Figure 2 that females with T1D and retinopathy are characterized as having a “low” level of weight variability and being in the “old” age group (“age4female”). This result is consistent with what Nyström et al. [32] discussed regarding patients with T1D and retinopathy, observing that overweight/obesity was more common in males than in females.

7. Discussion

The gender gap in patients with T1D exhibits variability across different cardiovascular complications. The analysis of the NDR data reveals that, among the 14 CVCs considered, the association among weight variability, age group, and gender is statistically significant at the α = 0.01 level of significance for only two complications: macroalbuminuria and retinopathy.
Patients with T1D experiencing macroalbuminuria show distinct patterns. In particular, males are primarily characterized by a “high” level of weight variability (Q4) and being in the “young” age group (“age1male”), while females are characterized by having a “low” weight variability (Q1) and being in the “old” age group (“age4female”).
For those experiencing retinopathy, males are primarily characterized by having a “high” weight variability (Q4) and being in a “young” age group (“age1male”), whereas females are characterized by having “low” weight variability (Q1) and being in the “old” age group (“age4female”), confirming the findings given by Nyström et al. [32].
It is worth noting that, while Pearson’s three-way chi-squared statistic is statistically significant for many of the association terms in Table 3 and Table 6, Table 4 and Table 7 show that the interaction between the categories are not all statistically significant. The computation of p-values in Table 4 and Table 7 helps to assess the significance of the categories, discerning the most relevant to the least.
To visualize the significant associations, we performed three-way CA, an exploratory tool that does not require any distributional assumptions for the variables. This study focused on the gender gap among patients with T1D who experienced a specific CVC, highlighting that a gender gap exists when analyzing the interactions between the diverse categories of weight variability and age class. However, the analysis might be carried out considering other diseases of medical interest to permit personalized treatment, specifically considering gender [33].
Here, the findings also depend on how the weight variability has been classified; we used the CV, which accounts for the differences in averages of the male and female populations. However, other measures of variability might be considered, such as the median absolute difference between subsequent BMI measurements (divided by the interval length starting at the first weight measurement registration), which was used by Nyström et al. [32].

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math12142186/s1, Supplementary Materials: Three-way contingency tables.

Author Contributions

Conceptualization, R.L.; Methodology, R.L., E.J.B.; Resources, G.L.; Data curation, F.P., A.N., B.E., H.K.C., V.P. and R.L.G.; Writing—review & editing, R.L., E.J.B.; Supervision, A.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been funded by the following research projects of the first author: PRIN-2022 SciK-Health: Mapping Scientific Knowledge about Health for Decision-making (Project code: 2022825Y5E_02; CUP: B53D23009750006 M. Aria). PRIN-2022 PNRR “The value of scientific production for patient care in A. H. S. C.” (Project Code: P2022RF38Y; CUP: B53D23026630001 C. Cuccurullo).

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Cai, X.; Li, J.; Cai, W.; Chen, C.; Ma, J.; Xie, Z.; Dong, Y.; Liu, C.; Xue, X.; Zhao, J. Meta-analysis of type 1 diabetes mellitus and risk of cardiovascular disease. J. Diabetes Complicat. 2021, 35, 107833. [Google Scholar] [CrossRef] [PubMed]
  2. Julián, M.T.; Pérez-Montes de Oca, A.; Julve, J.; Alonso, N. The double burden: Type 1 diabetes and heart failure—A comprehensive review. Cardiovasc. Diabetol. 2024, 23, 65. [Google Scholar] [CrossRef] [PubMed]
  3. Bak, J.C.G.; Serné, E.H.; de Valk, H.W.; Valk, N.K.; Kramer, M.H.H.; Nieuwdorp, M.; Verheugt, C.L. Gender gaps in type 1 diabetes care. Acta Diabetol. 2023, 60, 425–434. [Google Scholar] [CrossRef] [PubMed]
  4. Krishnan, S.; Fields, D.A.; Copel, K.C.; Blackett, P.R.; Anderson, M.P.; Gardner, A.W. Sex differences in cardiovascular disease risk in adolescents with type 1 diabetes. Gend. Med. 2012, 9, 251–258. [Google Scholar] [CrossRef]
  5. Svensson, J.; Carstensen, B.; Mortensen, H.B.; Borch-Johnsen, K. Gender-associated differences in Type 1 diabetes risk factors? Diabetologia 2003, 46, 442–443. [Google Scholar] [CrossRef] [PubMed]
  6. Tatti, P.; Pavandeep, S. Gender difference in type 1 diabetes: An underevaluated dimension of the disease. Diabetology 2022, 3, 364–368. [Google Scholar] [CrossRef]
  7. Carlier, A.; Kroonenberg, P.M. Biplots and decompositions in two-way and three-way correspondence analysis. Psychometrika 1996, 61, 355–373. [Google Scholar] [CrossRef]
  8. Lombardo, R.; Beh, E.J.; Kroonenberg, P.M. Symmetrical and non-symmetrical variants of three-way correspondence analysis for ordered variables. Stat. Sci. 2021, 36, 542–561. [Google Scholar] [CrossRef]
  9. Pearson, K. Mathematical contributions to the theory of evolution. XIII: On the theory of contingency and its relation to association and normal correlation. In Draper’s Co. Research Memoirs; Biometrie Series, No. 1; Reprinted 1948 in Karl Pearson’s Early Papers; Pearson, E.S., Ed.; Cambridge University Press: Cambridge, UK, 1904. [Google Scholar]
  10. Pearson, K. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Philos. Mag. 1900, 50, 157–175. [Google Scholar] [CrossRef]
  11. Tucker, L.R. Some mathematical notes on three-mode factor analysis. Psychometrika 1966, 31, 279–311. [Google Scholar] [CrossRef]
  12. Kroonenberg, P.M. Applied Multiway Data Analysis; Wiley: Hoboken, NJ, USA, 2008. [Google Scholar]
  13. Lombardo, R.; van de Velden, M.; Beh, E.J. Three-way correspondence analysis in R. R J. 2023, 15, 237–262. [Google Scholar] [CrossRef]
  14. Beh, E.J.; Lombardo, R. Correspondence Analysis: Theory, Methods and New Strategies; Wiley: Chichester, UK, 2014. [Google Scholar]
  15. Lombardo, R.; Takane, Y.; Beh, E.J. Familywise decompositions of Pearson’s chi-square statistic in the analysis of contingency tables. Adv. Data Anal. Classif. 2020, 14, 629–649. [Google Scholar] [CrossRef]
  16. Loisel, S.; Takane, Y. Partitions of Pearson’s chi-square statistic for frequency tables: A comprehensive account. Comput. Stat. 2016, 31, 1429–1452. [Google Scholar] [CrossRef]
  17. Lancaster, H.O. Complex contingency tables treated by the partition of the chi-square. J. R. Stat. Soc. Ser. B 1951, 13, 242–249. [Google Scholar] [CrossRef]
  18. Kiers, H.A.L. Three-mode orthomax rotation. Psychometrika 1997, 62, 579–598. [Google Scholar] [CrossRef]
  19. Kiers, H.A.L. Three-way SIMPLIMAX for oblique rotation of the three-mode factor analysis core to simple structure. Comput. Stat. Data Anal. 1998, 28, 207–324. [Google Scholar] [CrossRef]
  20. Ten Berge, J.M.F.; Kiers, H.A.L. Simplicity of core arrays in three-way principal component analysis and the typical rank of p × q × 2 arrays. Linear Algebra Its Appl. 1999, 294, 169–179. [Google Scholar] [CrossRef]
  21. Timmerman, M.E.; Kiers, H.A.L. Three-mode principal component analysis: Choosing the numbers of components and sensitivity to local optima. Br. J. Math. Stat. Psychol. 2000, 53, 1–16. [Google Scholar] [CrossRef] [PubMed]
  22. Rocci, R. Core matrix rotation to natural zeros in three-mode factor analysis. In Advances in Classification and Data Analysis: Studies in Classification, Data Analysis, and Knowledge Organization; Borra, S., Rocci, R., Vichi, M., Schader, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2001; pp. 161–168. [Google Scholar]
  23. Kroonenberg, P.M. Model selection procedures in three-mode component models. In New Developments in Classification and Data Analysis; Vichi, M., Molinari, P., Mignani, S., Montanari, A., Eds.; Elsevier: Amsterdam, The Netherlands, 2005; pp. 167–172. [Google Scholar]
  24. Gabriel, K.R. The biplot graphic display of matrices with application to principal component analysis. Biometrika 1971, 58, 453–467. [Google Scholar] [CrossRef]
  25. Lombardo, R.; Carlier, A.; D’Ambra, L. Nonsymmetric correspondence analysis for three-way contingency tables. Methodologica 1996, 4, 59–80. [Google Scholar]
  26. Greenacre, M.J. Biplots in Practice; Fundación BBVA: Barcelona, Spain, 2010. [Google Scholar]
  27. Beh, E.J.; Lombardo, R. Features of the polynomial biplot of ordered contingency tables. J. Comput. Graph. Stat. 2021, 31, 403–412. [Google Scholar] [CrossRef]
  28. Carlier, A.; Kroonenberg, P.M. The case of the French cantons: An application of three-way correspondence analysis. In Visualization of Categorical Data; Blasius, J., Greenacre, M., Eds.; Academic Press: San Diego, CA, USA, 1998; pp. 253–275. [Google Scholar]
  29. Gower, J.C.; Lubbe, S.; le Roux, N. Understanding Biplots; Wiley: Chichester, UK, 2011. [Google Scholar]
  30. Greenacre, M.J. Theory and Applications of Correspondence Analysis; Academic Press: London, UK, 1984. [Google Scholar]
  31. Beh, E.J.; Lombardo, R. Confidence regions and approximate p-values for classical and non symmetric correspondence analysis. Commun. Stat. Theory Methods 2015, 44, 95–114. [Google Scholar] [CrossRef]
  32. Nyström, T.; Andersson Franko, M.; Ludvigsson, J.; Lind, M.; Persson, M. Overweight or obesity, weight variability and the risk of retinopathy in type 1 diabetes. Diabetes Obes. Metab. 2024, 26, 2509–2512. [Google Scholar] [CrossRef] [PubMed]
  33. Östman, J.; Lönnberg, G.; Arnqvist, H.J.; Blohme, G.; Bolinder, J.; Schnell, A.E.; Eriksson, J.W.; Gudbjörnsdottir, S.; Sundkvist, G.; Nyström, L. Gender differences and temporal variability in the incidence of type 1 diabetes: Results of 8012 cases in the nationwide Diabetes Incidence Study in Sweden 1983–2002. J. Intern. Med. 2008, 263, 386–394. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Biplots for patients with T1D and macroalbuminuria. On the left side is the column–tube interactive biplot, while the right side is the row–column interactive biplot.
Figure 1. Biplots for patients with T1D and macroalbuminuria. On the left side is the column–tube interactive biplot, while the right side is the row–column interactive biplot.
Mathematics 12 02186 g001
Figure 2. The column–tube interactive biplot for patients with T1D and retinopathy from the three-way analysis of Table 5.
Figure 2. The column–tube interactive biplot for patients with T1D and retinopathy from the three-way analysis of Table 5.
Mathematics 12 02186 g002
Table 1. Test of independence between weight variability, age group, and gender for each of the 14 CVCs using Pearson’s three-way chi-squared statistic with 24 df.
Table 1. Test of independence between weight variability, age group, and gender for each of the 14 CVCs using Pearson’s three-way chi-squared statistic with 24 df.
CVC X IJK 2 p-Value
Stroke26.0510.351
Myocardial infarction39.9320.022
HFC33.7880.089
CABG26.6950.319
PCI23.0670.516
PAD29.4290.204
Renal failure29.5360.201
Acute renal failure30.6850.163
Chronic renal failure28.3120.247
Dialysis14.2120.942
GFR5032.7230.110
Macroalbuminuria73.806<0.001
Retinopathy168.271<0.001
All-cause mortality38.2800.032
Those p-values in bold are less than the nominal significance level of 0.01.
Table 2. Patients with T1D and macroalbuminuria, cross-classified by weight variability, age group, and gender.
Table 2. Patients with T1D and macroalbuminuria, cross-classified by weight variability, age group, and gender.
Gender: Male
Age Group
Weight VariabilityAge1Age2Age3Age4
Q17142438
Q212231824
Q317192312
Q41815116
Gender: Female
Age Group
Weight VariabilityAge1Age2Age3Age4
Q17101120
Q21412710
Q31115188
Q42110135
Table 3. Partition of Pearson’s three-way association statistic for patients with T1D and macroalbuminuria in Table 2.
Table 3. Partition of Pearson’s three-way association statistic for patients with T1D and macroalbuminuria in Table 2.
Term IJ Term IK Term JK Term IJK Term Total
chi-squared57.9595.3735.2945.18073.806
phi-squared0.1230.0110.0110.0110.156
% of Inertia78.5287.2817.1737.018100.000
df933924
p-value<0.0010.1460.1510.818<0.001
Term/df6.4401.7911.7650.5763.075
Those p-values in bold are less than the nominal significance level of 0.01.
Table 4. The p-values of the categories of weight variability and age group for males and females with T1D and macroalbuminuria.
Table 4. The p-values of the categories of weight variability and age group for males and females with T1D and macroalbuminuria.
p-values of Row Principal Coordinates
Categoryp-value
Q10.003
Q20.027
Q30.001
Q40.001
p-values of Column–Tube Principal Coordinates
Categoryp-value
age1Male0.957
age2Male0.998
age3Male0.934
age4Male0.047
age1Female0.005
age2Female1.000
age3Female0.158
age4Female0.959
Those p-values in bold are less than the nominal significance level of 0.01.
Table 5. Patients with type 1 diabetes experiencing retinopathy, cross-classified by weight variability by age group and gender.
Table 5. Patients with type 1 diabetes experiencing retinopathy, cross-classified by weight variability by age group and gender.
Gender: Male
Age Group
Weight VariabilityAge1Age2Age3Age4
Q14456100118
Q248678867
Q353665836
Q467582524
Gender: Female
Age Group
Weight VariabilityAge1Age2Age3Age4
Q136326097
Q238495066
Q362495840
Q458443435
Table 6. Partition(s) of Pearson’s three-way chi-squared statistic for patients with T1D and retinopathy of Table 5.
Table 6. Partition(s) of Pearson’s three-way chi-squared statistic for patients with T1D and retinopathy of Table 5.
Term IJ Term IK Term JK Term IJK Term Total
chi-squared143.0809.9288.0527.211168.271
phi-squared0.0800.0060.0050.0040.094
% of Inertia85.0295.9004.7854.286100.000
df9.0003.0003.0009.00024.000
p-value<0.0010.0190.0450.615<0.001
Term/df15.8983.3092.6840.8017.011
Those p-values in bold are less than the nominal significance level of 0.01.
Table 7. The p-values of categories of the weight variability and age group for males and females with T1D and retinopathy.
Table 7. The p-values of categories of the weight variability and age group for males and females with T1D and retinopathy.
p-values of Row Principal Coordinates
Categoryp-value
Q1<0.001
Q2<0.001
Q3<0.001
Q4<0.001
p-values of Column–Tube Principal Coordinates
Categoryp-value
age1Male<0.001
age2Male0.885
age3Male<0.001
age4Male<0.001
age1Female0.002
age2Female0.160
age3Female0.192
age4Female0.036
Those p-values in bold are less than the nominal significance level of 0.01.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lombardo, R.; Beh, E.J.; Prattichizzo, F.; Lucisano, G.; Nicolucci, A.; Eliasson, B.; Carlsen, H.K.; Grotta, R.L.; Pellegrini, V.; Ceriello, A. Testing and Visualization of Associations in Three-Way Contingency Tables: A Study of the Gender Gap in Patients with Type 1 Diabetes and Cardiovascular Complications. Mathematics 2024, 12, 2186. https://doi.org/10.3390/math12142186

AMA Style

Lombardo R, Beh EJ, Prattichizzo F, Lucisano G, Nicolucci A, Eliasson B, Carlsen HK, Grotta RL, Pellegrini V, Ceriello A. Testing and Visualization of Associations in Three-Way Contingency Tables: A Study of the Gender Gap in Patients with Type 1 Diabetes and Cardiovascular Complications. Mathematics. 2024; 12(14):2186. https://doi.org/10.3390/math12142186

Chicago/Turabian Style

Lombardo, Rosaria, Eric J. Beh, Francesco Prattichizzo, Giuseppe Lucisano, Antonio Nicolucci, Björn Eliasson, Hanne Krage Carlsen, Rosalba La Grotta, Valeria Pellegrini, and Antonio Ceriello. 2024. "Testing and Visualization of Associations in Three-Way Contingency Tables: A Study of the Gender Gap in Patients with Type 1 Diabetes and Cardiovascular Complications" Mathematics 12, no. 14: 2186. https://doi.org/10.3390/math12142186

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop