1. Introduction
Over the past decades, advancements in neuroimaging have played a crucial role in understanding brain development [
1,
2] contributing significantly to biomedical research [
3,
4]. Numerous studies have utilized traditional machine learning [
5,
6,
7] and deep learning [
8,
9,
10,
11,
12] to investigate various aspects of brain organization, including its anatomy, functional dynamics, and network connectivity. Various MRI techniques, including T1-weighted imaging, diffusion-weighted imaging, and task-based and resting-state functional MRI, along with genetic data, are increasingly employed to study brain structure and function more effectively [
13,
14,
15,
16,
17]. Multimodal neuroimaging analyses, which combine data from two or more neuroimaging modalities, leverage complementary information and overcome the limitations of each [
18]. This approach of multimodal fusion has yielded more reliable results [
3,
4,
19], providing comprehensive insight into the understanding of brain and cognition, as well as abnormal brain function and mental disorders [
4,
20]. Recent studies also suggest that neural interfaces, which establish a direct link between the brain and external devices, can present new opportunities in the future for integrating multi-modalities to advance brain–computer communication [
21].
Various approaches, such as parallel independent component analysis, canonical correlation analysis (CCA), and deep learning-based fusion, have been explored for integrating multimodal data, as has been extensively reviewed in [
20,
22,
23,
24]. This study specifically focuses on CCA-based methodologies due to their straightforward implementation and widespread use. CCA facilitates a linear transformation of data, and it reveals maximally linearly correlated hidden patterns [
25]. An extension of CCA known as deep canonical correlation analysis (DCCA) leverages deep neural networks to capture the non-linear hidden features of each data modality and then maximizes the relationship between two sets of hidden features, resulting in highly correlated representations across data modalities [
26,
27].
This study examined Adolescent Brain Cognitive Development (ABCD) data. The ABCD Study includes a longitudinal cohort consisting of youth aged 9-10 at the baseline from 21 different sites across the US, and it is the largest long-term study that followed youths for 10 years with annual lab-based assessments and bi-annual imaging acquisitions [
28]. It is designed to examine the interplay between biological, behavioral, and environmental factors on brain development and health outcomes in children and adolescents [
29]. ABCD studies leverage the measurement of brain structure and function relevant to adolescent development and addiction, providing evidence for the feasibility and age-appropriateness of the procedures and the generalizability of findings [
30]. Also, the study of neurocognitive development is crucial for distinguishing premorbid vulnerabilities from the consequences of behaviors like substance use [
31].
We investigated the structural changes within two years of brain development between 9–10-year-olds and 11–12-year-olds using images of gray matter and white matter from the ABCD Study. Our hypothesis posits that gray matter and white matter grow coherently during this period, implying a significant relationship between changes in gray matter density and white matter integrity. Such brain-structural changes underline the observed cognitive maturation and behavioral changes outwardly. While CCA captures the linear interactions of brain regions, we recognize the potential contribution of non-linear interactions across brain regions. To address this, we implemented two approaches: CCA and an extension of DCCA, DCCA with an autoencoder (DCCAE), which incorporates both the reconstruction objective from an autoencoder and a standard CCA correlation objective [
32]. Gray matter and white matter growth patterns extracted from both approaches were compared and contrasted. Furthermore, we explored their associations with cognitive and behavioral changes. The contribution of this study is twofold. This study aimed to understand the growth of GM and FA during the two years of critical brain development and their significant changes underlying cognitive and behavior changes, as well as to determine whether the brain growth pattern across regions is linear or non-linear.
5. Discussion
In this study, we implemented both CCA and DCCAE models to extract correlated components from GM and FA changes within two years during adolescence, and such components represented brain growth patterns in GM density and FA integrity coherently in two years. Data from over three thousand children from the general population were used, and similar numbers of GM-FA component pairs were extracted and verified from the CCA and DCCAE models. A comparison of correlation strength suggests that the DCCAE yielded highly correlated GM–FA pairs in both training and testing data when compared to the CCA results. In both CCA and DCCAE, the correlation scores in the testing data were lower than those in the training data; this attenuation was expected when applying the model to independent testing data. Importantly, the persistence of statistically significant correlations in the testing data demonstrates the robustness of the model and suggests that the observed relationships are not a result of overfitting but, rather, reflect genuine associations between GM and FA features. CCA, being a linear model, only captures linear interactions among brain regions and thus a linear relationship between the GM and FA of white matter. On the other hand, DCCAE incorporates a deep learning architecture that is able to extract both linear and non-linear interactions among brain regions and thus a more intricate relation between the GM and FA of white matter. We speculate that this may be the main reason for stronger relations between GM and FA matter components in the DCCAE results.
When examining the direct similarity between components of CCA and DCCAE, we found that most of the CCA components (for both GM and FA) have high and significant correlations (shown in
Table 3 and
Table 4) with some of the DCCAE components. For example, CCA GM component first is significantly and negatively correlated to DCCAE GM components first and second, and it is positively correlated to DCCAE GM component third. Similar results were observed for FA components; CCA FA component first was linked to DCCAE FA components first and second negatively and component third positively. The negative correlation observed in
Table 3 and
Table 4 does not indicate a fundamental contradiction in the relationship between GM and FA features. Instead, it suggests that, for the same brain region, CCA encodes a change in one direction (an increase or a decrease), while DCCAE encodes it in the opposite direction. However, the underlying GM-FA association remains consistent across both models. This difference in sign can be further illustrated in
Figure 1 and
Figure 3, where the first components from CCA and DCCAE show similar spatial patterns, particularly in the posterior occipital region, but the color representation differs, indicating an increase in one model and a decrease in the other.
We also examined the contributing brain regions of components that shared the same brain regions. The top PCs of the CCA and DCCAE components highlighted common regions for GM and FA. The first CCA component correlates with the first, second, and third DCCAE components. The brain regions of the first CCA components include the middle temporal gyrus, precentral gyrus, middle frontal gyrus, superior temporal gyrus, and sub-gyral while DCCAE’s first component identified the middle temporal gyrus, middle frontal gyrus, and sub-gyrus, DCCAE’s second component identified middle temporal gyrus, and middle frontal gyrus, and DCCAE’s third component identified the superior temporal gyrus and precentral gyrus. Similarly, common FA regions were also observed for CCA FA components and DCCAE FA components. The brain regions of the first CCA components include the corticospinal tract, anterior thalamic radiation, and the forceps minor, and DCCAE’s first, second, and third components identified brain regions pointed out by the first component of CCA.
The significant correlations between CCA and DCCAE components and the shared brain regions of components from the two approaches suggest that both linear and non-linear approaches extracted similar brain growth patterns in the two years. Maybe two years is a short enough time for non-linear growth patterns in the brain structure to be estimated in a linear fashion with reasonable accuracy.
The GM brain regions identified using CCA and DCCAE show a negative association with cognition, including the superior frontal gyrus, middle temporal gyrus, and precuneus, while a positive association is observed with cognition in regions such as the inferior frontal gyrus and medial frontal gyrus identified through CCA. However, for the FA, only CCA components are related to fluid intelligence, and the brain regions are negatively related, which includes anterior thalamic radiation and the forceps minor.
Regarding the CBCL behavioral syndromes, five CCA GM components showed significant associations with aggressive behavior, somatic complaints, withdrawn/depressed, and thought problems. These components include brain regions such as the superior temporal gyrus, middle temporal gyrus, inferior frontal gyrus, and middle frontal gyrus and cuneus, which are negatively related to behavior. For white matter, CCA showed associations for the anxious/depressed, attention problem, and withdrawn/depressed conditions, which include negatively related brain regions such as the superior longitudinal fasciculus, anterior thalamic radiation, and the corticospinal tract, while thought problems had positive associations with the brain region of the inferior longitudinal fasciculus and forceps minor.
Similarly, the CBCL behavioral syndromes analysis using DCCAE with gray matter revealed negative associations for the aggressive behavior, anxious/depressed, and somatic complaints conditions, including the brain regions of the superior temporal gyrus, inferior frontal gyrus, superior frontal gyrus, middle temporal gyrus, middle frontal gyrus, cingulate gyrus, sub-gyrus, and precentral gyrus, along with positive associations for attention problems and withdrawn/depressed behavior with the brain regions of the inferior frontal gyrus and superior temporal gyrus. In white matter, DCCAE showed that the brain region has negative associations for attention problems, rule-breaking behavior, and somatic complaints, indicating the following regions: anterior thalamic radiation, the corticospinal tract, the forceps minor, and the inferior fronto-occipital fasciculus. Meanwhile, thought problems had positive associations with the brain regions of the forceps minor and the inferior longitudinal fasciculus.
Many studies have documented diverse growth patterns for different brain regions [
36,
37,
38,
39]. Particularly, the GM densities of brain regions presenting inverted U-shaped growth trajectories peaked at different times [
40]. Our findings indicate that, between the ages of 10 and 12, children’s cognitive ability improves, accompanied by their behavioral changes, and such changes might be underscored by a GM density reduction in the negatively associated brain regions, a pruning phase of brain maturation, and also a GM density increase in the positively associated brain regions that are still in a growth phase of brain maturation.
The examination of the total variance in cognitive and behavioral changes explained by the components extracted from CCA or DCCAE indicates that brain growth patterns could explain cognitive maturation much better than behavioral changes, and the CCA and DCCAE components overall demonstrate a comparable ability in terms of explaining cognitive and behavioral changes.