Next Article in Journal
SwinDenoising: A Local and Global Feature Fusion Algorithm for Infrared Image Denoising
Previous Article in Journal
The Dynamic Behavior of a Stochastic SEIRM Model of COVID-19 with Standard Incidence Rate
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Principled Framework to Assess the Information-Theoretic Fitness of Brain Functional Sub-Circuits

by
Duy Duong-Tran
1,2,*,†,
Nghi Nguyen
3,†,
Shizhuo Mu
1,
Jiong Chen
1,
Jingxuan Bao
1,
Frederick H. Xu
1,
Sumita Garai
1,
Jose Cadena-Pico
4,
Alan David Kaplan
5,
Tianlong Chen
6,
Yize Zhao
7,
Li Shen
1,‡ and
Joaquín Goñi
8,9,10,‡
1
Department of Biostatistics, Epidemiology, and Informatics (DBEI), Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
2
Department of Mathematics, United States Naval Academy, Annapolis, MD 21402, USA
3
Gonda Multidisciplinary Brain Research Center, Bar-Ilan University, Ramat Gan 5290002, Israel
4
Machine Learning Group, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
5
Computational Engineering Division, Lawrence Livermore National Laboratory, Livermore, CA 94550, USA
6
Department of Computer Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
7
School of Public Health, Yale University, New Heaven, CT 06520-8034, USA
8
School of Industrial Engineering, Purdue University, West Lafayette, IN 47907, USA
9
Purdue Institute for Integrative Neuroscience, Purdue University, West Lafayette, IN 47907, USA
10
Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Co-supervising Authors.
Mathematics 2024, 12(19), 2967; https://doi.org/10.3390/math12192967
Submission received: 26 June 2024 / Revised: 10 September 2024 / Accepted: 18 September 2024 / Published: 24 September 2024
(This article belongs to the Section Computational and Applied Mathematics)

Abstract

:
In systems and network neuroscience, many common practices in brain connectomic analysis are often not properly scrutinized. One such practice is mapping a predetermined set of sub-circuits, like functional networks (FNs), onto subjects’ functional connectomes (FCs) without adequately assessing the information-theoretic appropriateness of the partition. Another practice that goes unchallenged is thresholding weighted FCs to remove spurious connections without justifying the chosen threshold. This paper leverages recent theoretical advances in Stochastic Block Models (SBMs) to formally define and quantify the information-theoretic fitness (e.g., prominence) of a predetermined set of FNs when mapped to individual FCs under different fMRI task conditions. Our framework allows for evaluating any combination of FC granularity, FN partition, and thresholding strategy, thereby optimizing these choices to preserve the important topological features of the human brain connectomes. By applying to the Human Connectome Project with Schaefer parcellations at multiple levels of granularity, the framework showed that the common thresholding value of 0.25 was indeed information-theoretically valid for group-average FCs, despite its previous lack of justification. Our results pave the way for the proper use of FNs and thresholding methods, and provide insights for future research in individualized parcellations.

1. Introduction

The success of large-scale brain connectomics—which subserves a myriad of neuroimaging research endeavors based on fMRI [1,2,3], MEG [4], and EEG [5]—hinges on choosing representations of functional connectivity that are as well-defined as possible. Functional connectomes (FCs) are often constructed by computing a statistical dependency measure, such as the Pearson correlation coefficient, across all specified pairs of the brain’s regions of interest (ROIs) using the aggregated voxel-level blood-oxygen-level-dependent (BOLD) signals. However, constructing FCs from BOLD signals with activation delays (due to inhibitory–excitatory dynamics possibly causing negative ROI correlations) can significantly impact estimates of population-level FCs [1] and the associated functional brain network topological features, such as nodes’ centrality [6], global network measures [2], and geometry–topology relation [7]. Recent efforts have focused on improving FC construction by taking into account neuronal signal activation delays [2] and negative correlations [3]. Nonetheless, much effort is still needed to quantify the efficacy of each FC construction framework, especially in terms of preserving the “true” features of the population FCs that shed light on fundamental principles of the brain.
Functional sub-circuits, e.g., functional networks (FNs) [8,9], and their modularity characteristics [10,11,12,13] are crucial to understanding such fundamental neural principles, including brain complexity [14], differential configurational properties [13], modular structures [10,15], and information processing [16,17]. Studies on the modular organizations of the human brain have also informed applied research on aging [18,19] and disorders including schizophrenia [20]. Moreover, research consistently shows that executive subsystems in the brain are reproducible across many individuals at rest, e.g., [8,21], indicating a widespread application of these FNs in various studies—from control groups [22] to pathological investigations [23,24] and predicting individual differences [25]. Even so, there have been few (if any) systematic studies addressing the validity of a common and rarely challenged practice in brain connectomics, which is applying one specific set of a priori FNs to multiple FCs. In other words, FC processing usually involves mapping an a priori fixed set of FNs onto the constructed FCs, across different subjects and fMRI task conditions, without examining whether those mappings are relevant information-theoretically fit to the constructed FCs, given the existence of human brain fingerprint [13,17,26,27,28,29,30,31,32,33].
Among the many decisions influencing whole-brain functional connectivity estimates, like FCs, and circuit-level representations, like FNs, the choice of brain parcellations (i.e., how nodes in functional brain networks are defined) is, undoubtedly, one of the most critical steps [34,35,36]. In fact, this choice determines the network topology used in downstream analyses. Recent studies have shown that different levels of parcellation granularity can affect the identification of subject-level FC fingerprints [30,37]. In an effort to register the raw neuroimaging data into a sequence of increasing granularity, Schaefer and colleagues have recently published a scheme of atlases that increase in network sizes. These parcellations refine the robust set of resting state networks initially identified by Yeo et al. [8], offering various granularity levels for in-depth analysis. Thanks to these advancements, the brain connectivity research community can now explore characteristics of sequential functional brain networks, especially those coupled with the corresponding a priori set of FNs.
Regardless of which parcellation scheme is employed during large-scale FC and FN analyses, another common practice in network neuroscience is thresholding (or, more generally, eliminating statistically spurious functional edges) based on some arbitrary rules or research hypotheses. Careful design of the thresholding process is central to ensuring scientific rigor not only in healthy control studies, but also in those studying disorders such as schizophrenia [38], unipolar depression, and bipolar disorder [39]. Unrigorous application of thresholding can therefore undermine the validity of such important studies by affecting downstream analyses, including parametric statistical tests [40] and network characterization [41]. To mitigate such issues, various thresholding strategies have been proposed to retain particular desired attributes of the original weighted networks. These strategies include proportional thresholding aimed at keeping the absolute number of edges across different subjects and tasks [41], modular similarity [39], and percolation aimed at preserving the topological features of the original weighted graph [42]. Spurious edge elimination also involves methods based on wavelets [38], mixture modeling [43], topological data analysis through persistent homology [44,45], branch-and-bound-based algorithms (to study cognitive activity [5]), and orthogonal minimal spanning trees for dynamical functional brain networks [46]. Furthermore, alternatives to thresholding treatment for FCs have also been proposed using hierarchical Bayesian mixture models [47]. However, this multitude of strategies further complicates the already complex decision-making process of brain data preprocessing and analysis. After all, how can one determine which combination of FC parcellation, FN partitioning, and edge pruning techniques is optimal for their dataset? To the best of our knowledge, no studies have offered a mathematically justified and robust process for choosing such combinations.
This work tackles the complexity posed by that abundance of choices and lack of theoretical justification. Our objectives are two-fold: (i) formalizing and quantify the level of information prominence of a given fixed set of FNs across different subjects and tasks, and (ii) using the level of prominence as guidance to eliminate spurious functional edges in whole-brain FCs. To do so, we utilize Schaefer parcellations [34] with nine distinct granularity levels, ranging from 100 to 900 nodes in 100-node increments. We first present an overview of Schaefer parcellations, as well as a formalization of stochastic block models (SBMs) and its relevance to our quest in Section 2. We then propose an SBM reconstruction pipeline in Section 3. We wrap up with the results in Section 4 and a discussion in Section 5. Our framework can be generalized to any given pair of an FN partition and a parcellation (e.g., [48,49]).

2. A Principled Framework to Assess Information Theoretical Fitness of Brain Functional Sub-Circuits/Networks

2.1. Schaefer’s Atlas of the Cerebral Cortex

The brain parcellations used in this work are sequential, in the sense that their granularity increases, ranging from 100 nodes to 900 nodes with increments of 100 nodes, and are registered on the cortical surface of the brain. These sequential atlases are made possible thanks to the work of Schaefer and colleagues [34]. To ensure our framework is consistent with prior works in human brain connectomics [31,50], we added 14 sub-cortical regions, resulting in network sizes of 114, 214, …, 914 nodes for all fMRI conditions in the Human Connectome Project (HCP) repository, Released Q3 [51,52]. Schaefer parcellations are subdivisions of Yeo’s functional networks [8].

2.2. Stochastic Block Models (SBMs)

Stochastic Block Models (SBMs) have recently gained traction due to exciting developments in both theoretical and practical domains (see Preliminaries—Stochastic Block Models in the Supplementary Materials for further details on notations and a brief introduction). Theory-wise, phase transitions in the fundamental limits of community detection (or, more generally, mesoscopic structures) were discovered through the measure Signal-to-Noise Ratio (SNR) [53]. In the domain of brain connectivity, SBM has demonstrated its advantages in exploring and uncovering diverse types of brain functional sub-circuits (e.g., dis-assortative or core-periphery) beyond the traditional assortative mesoscopic structures [54,55]. Specifically, Sandon and Abbe, in [53], laid out a comprehensive treatment of criteria for mesoscopic structure recovery for any pair of a networked system and an a priori set of communities (or functional networks in brain connectomic domain). Specifically, the recovery requirements were classified under:
  • Weak Recovery (also known as community detection);
  • Almost Exact Recovery;
  • Exact Recovery.
The recoverability of the ground-truth partition depends on the degree regime (indicated by the degree scaling factor s t ) in which the network resides. For instance, weak recovery only requires the necessary condition for a limiting graph ( n ) to be in the constant degree regime, i.e., O ( 1 n ) . On the other hand, exact recovery requires the necessary condition (for the limiting graph) that the graph is asymptotically connected, i.e., in the degree regime of logarithmic O ( l o g ( n ) n ) . The sufficient condition for all the recovery criteria is stated in the respective theorems with different proposed measures with sharp phase transitions, as seen in [53]. If a measure (say, for weak or exact recovery) is below a certain algebraic threshold (stated in the respective theorems), recovery is not possible, although the necessary condition is satisfied. Further details on recovery theorems are located in the Supplementary Materials.
Here, we chose the weak-recovery requirements as guidance for whole-brain functional connectivity estimation for four reasons:
  • Although Schaefer parcellations with an increasing number of nodes allow us to project some empirical insights onto their degree regime, a rigorous theoretical argument on the degree regime is not possible for any empirical graph sequence. Hence, the exact recovery of an a priori unique ground-truth partition is not relevant in the case of brain functional connectomes;
  • Even in the empirical domain, we observe that both group-average and individual FCs become disconnected (i.e., the number of connected components is more than 1) after a relatively small threshold value in the interval τ [ 0.2 , 0.3 ] . Theoretically, a graph sequence is required to be connected, asymptotically, to fulfill the requirements for exact recovery. On the other hand, weak-recovery (detection of mesoscopic structures) offers a more realistic and relaxed set of criteria for this particular application. This facilitates estimating a whole-brain FC that is most suitable for an a priori set of FNs without evaluating the number of connected components of the thresholded FC.
  • Most (if not all) mesoscopic studies of brain functional sub-circuits, such as [54,55], are based on pre-defined hypotheses, e.g., that the brain functional sub-circuits involve a more diverse class of community than just assortative ones [54]. Such an assumption leads to the appropriate usage of different community detection algorithms, such as Weighted Stochastic Block Models in the case of [54,55]. As mentioned above, weak-recovery is equivalent to community detection in the theoretical SBM literature;
  • No set of functional sub-circuits is universally agreed upon and uniquely identified as the ground-truth communities. Hence, all proposed brain functional sub-circuit parcellations, e.g., [8], are relative.

2.3. SBM Description, Inference and Extended Usage

2.3.1. Model Description

In this subsection, we define some of the key components of SBM. Other fundamental mathematical notations are referred to in Section titled Stochastic Block Model Preliminaries in the Supplementary Materials.
  • G = [ a u v ] = F C , weighted-graphs , M , binarized-graphs : network/graph (e.g., FCs in the context of this work);
  • V ( G ) = u , and E ( G ) = u v u , v V ( G ) are sets of vertices and edges, respectively;
  • The size and order of a network are denoted by | V ( G ) | = n and | E ( G ) | , respectively;
  • G t , t N is the graph sequence. In the empirical domain, the number of graphs in the sequence is defined as | G t | = T ;
  • k is the number of communities/clusters;
  • σ = [ σ u ] [ k ] n is the pre-defined, well-understood community assignment in vector form of length n. It is the mathematical map σ = u i , u [ n ] , i [ k ] . In general, σ is also referred to as a graph partition;
  • Ω = [ | Ω i | ] is the vector containing the cardinality of community, where
    | Ω i | = | u σ u = i | , i [ k ] , u [ n ] .
  • C is the statistical summary of edge properties within and between communities in matrix form. Mathematically,
    C = C b i n N + k × k , C w e i R k × k ,
    where C b i n N + k × k denoted the simple edge count matrix within or between communities and C w e i denoted the weighted edge sum (also within or between communities);
  • C m a x N + k × k is the maximum number of edges within or between communities;
  • p = [ p i ] : the probability that a node u belongs to community i [ k ] ; P = diag ( p ) is a k × k matrix filled with p i in the diagonal;
  • Q = [ Q i j ] R k × k is the expected node degree matrix, i.e., the expected number of connections a node in community i has with community j;
  • s t : scalable factor of degree regime in a graph sequence G t where t T ;
  • W s t = [ w i j ] s t is the edge probability between two nodes in community i and j in terms of the scaling factor s t (it is worth noting that if w i j is the same for all i , j [ k ] , then SBM collapses to classical ER random graph model). We use W to denote the edge probability matrix with s t = 1 ;
  • P Q = n P W s t = n P W is the community profile matrix where i column is the expected number of edges that community i has with all communities. Note that for weak-recovery (detection), scaling factor s t = 1 .

2.3.2. Inference and Extended Usage

The basis of SBM parameter inference is reverse engineering by the maximum likelihood principle. Specifically, since both G and σ (subsequently, k = max u [ n ] σ u ) are priors, in expectation, we can infer S B M ( P , W ) using the Bayesian approach as follows:
  • P = Ω n = [ p i ] = | Ω i | n .
  • Infer W b i n = C b i n C m a x .
  • Compute W w e i = C w e i C m a x .
  • Q = n W as s t = 1 for weak-recovery.
  • Compute P Q (Matrix Multiplication).
C b i n is a simple edge count of M τ G A between or within blocks of communities, whereas C w e i is the sum of weighted edges of F C τ (also between or within communities). Specifically,
C b i n = u , v [ n ] 1 σ u = σ v ,
C w e i = u , v [ n ] | | w u v | | , σ u = σ v ,
and
C m a x = Ω Ω T .
The inference of matrix P is based on the law of large numbers [53]. For W b i n , we perform entry-wise divisions of matrix C b i n by matrix C m a x , which infers the Bernoulli random variable parameter p representing the probability of successful edge formation between each pair of stochastically equivalent nodes within or between communities. In the case of C w e i , note that we use the term computing instead of inferring because we have extended the usage of S N R to mesoscopic prominence measure. We use the absolute values w u v only to consider the overall magnitude (and not the sign) of functional couplings within/between FNs.
Technically, this inference is less challenging than traditional inference problems where σ is also a latent variable in the model and graph ensemble G is the only observable ensemble available. Specifically,
( G , n , σ , k ) S B M ( P , W )
where G and σ are priors.

2.4. Weak Recovery of Ground-Truth Partition

Definition 1.
Weak recovery of a ground-truth partition can be rigorously equivalent to the existence of an algorithm that infers a partition that agrees with the ground-truth one up to max i p i + ϵ , i [ k ] . This level of accuracy is the minimal requirement for most community detection methods.
Theorem 1
(Sandon and Abbe [53]). Let ( G , σ ) S B M n , p , s t Q n for p , Q arbitrary and s t = 1 . If S N R > 1 , then weak recovery is efficiently solvable, where
S N R = λ 2 2 λ 1
and λ i is the i t h eigen value of the community profile matrix P Q .
Weak recovery of given ground-truth communities means that, through that algorithm, the recovered partition outperforms a random guess, i.e., max i p i , by a small factor ϵ . The criteria for weak recovery are driven by a hard threshold approach presented in the below theorem. Importantly, achieving weak recovery does not necessitate the graph being connected under an asymptotic regime. Loosely speaking, we only need every graph in the graph sequence to have a large connected component. In other words, we only need G t T to be in the constant degree regime, i.e., s t = 1 . Consider a network of n nodes divided into two equal-sized ground-truth communities (i.e., n 2 nodes for each community). In a weak recovery scenario, it is feasible to accurately identify the community membership of each node with a probability marginally above 50 % , say by an additional 5 % . This implies that if an ensemble is generated under a constant degree regime, one can arbitrarily assign any community membership to isolated nodes, i.e., leaves; hence, exact recovery is impossible in this regime. On the other hand, for exact recovery, since W scales with n through the factor s t , the community profile matrix M consequently grows with the factor s t as well.

2.5. Problem Formulation and the Fitness Assessment Framework

2.5.1. Problem Formulation

The current problem of interest is stated as follows: given an a priori set of functional sub-circuits, we would like to investigate and subsequently quantify the “fitness” of such pre-determined partition for a given brain network topology.
We formulate this problem as the “dual” of the primal community detection problems [56,57,58,59]. Specifically, in the primal problem, a partition (i.e., set of mesoscopic structures) is constructed for a given network topology. In the dual, we look to evaluate and subsequently quantify the fitness of a pre-determined (e.g., a priori) set of mesoscopic structures for the given network.

2.5.2. Fitness Assessment Framework

For a given pair of a complex network (e.g., functional connectome) and an a priori set of ground-truth communities (e.g., Yeo’s functional sub-circuits), we propose the following steps to access the information-theoretic fitness of ground-truth communities (Figure 1) as follows:
  • Step 1: obtain an average representation (e.g., a group-average FC) from the collection of individual networks, binarize the group-average representation, apply Theorem 1 across the thresholding parameter space S , and yield the masked, binarized group-average FC;
  • Step 2 (vetting step): Compute the SNR for the masked group-average FC and investigate the SNR across all finite combinations in S . Compute the the weak-recoverability sub-interval I S ;
  • Step 3: compute the a priori community prominence for each individual FC; note that this prominence can also be computed for the group-average FC.
  • Step 4: For each individual FC, obtain the τ maximizing the prominence computed in Step 3. Check if τ o p t belongs to the weak-recoverability sub-interval I. Note that, similarly to Step 3, this step can also be performed on the group-average FC.

3. Application: A Pipeline for Thresholding Functional Connectomes

The below pipeline (Figure 2) describes the process to compute the optimal threshold for a given fMRI condition, a Schaefer granularity, and a cohort in two particular cases:
  • Individually driven threshold τ o p t γ ;
  • Constant (cohort-driven) threshold τ o p t G A where G A stands for group-average.
Here, we see that the parameter space S reduces to the line search of threshold value τ = [ 0 , 1 ] S .
The pipeline contains four distinct steps:
  • Step 1: for each Schaefer granularity level and task, compute the binarized (masked) group-average FC (denoted as M τ G A ) using the entry-average of individual FCs (the number of individual FC is denoted as Γ ).
  • Step 2 (vetting step):
    (a)
    For each threshold value τ = τ [ 0 , 1 ] , infer the Stochastic Block Model (SBM) parameters to compute the Signal-to-noise ratio (SNR) of M τ G A :
    S N R [ M τ G A ] = λ 2 2 λ 1 [ P Q ] b i n G A .
    (b)
    Repeat this computation for all threshold values, apply Theorem 1 to determine the weak-recoverability sub-interval ( a w , b w ) τ = [ 0 , 1 ] for the group-average FC, i.e., M τ G A .
  • Step 3: For a given individual FC and threshold value τ , compute the associated thresholded FC, i.e., F C τ γ , and then compute the Stochastic Block Model (SBM) parameters for F C τ γ . Extend the usage of S N R as a mesoscopic prominence measure:
    S N R [ F C τ γ ] = λ 2 2 λ 1 [ P Q ] w e i γ .
    Analogously, we can also compute the S N R for the group-average FC ( F C G A ) as follows:
    S N R [ F C τ G A ] = λ 2 2 λ 1 [ P Q ] w e i G A .
    Repeat Step 3 for all threshold values τ [ 0 , 1 ] and all individual FCs for a given fixed Schaefer parcellation and fMRI task pair.
  • Step 4:
    (a)
    Obtain the threshold value that maximizes SNR of the thresholded FC and the corresponding optimally reconstructed whole-brain FC:
    τ o p t γ = argmax τ ( S N R [ F C τ γ ] ) .
    Note that if the group-average FC ( F C G A ) is used in Step 3, then:
    τ o p t G A = argmax τ ( S N R [ F C τ G A ] ) .
    (b)
    Check if τ o p t is in the weak-recoverability sub-interval computed in Step 2:
    τ o p t ( a w , b w ) ,
    Note that one needs to check the optimal threshold against the weak-recovery sub-interval, regardless of whether it is an individualized threshold ( τ o p t γ ) or a group-average one ( τ o p t G A ).

4. Results

In this section, using weak recovery criteria, we investigate the level of information-theoretic prominence of an a priori set of FNs with respect to different FCs (both group-average and individual subject levels) across a range of threshold values. Additionally, we offer deeper insights into the use of SNR as a measure of the information-theoretic prominence of this predetermined set of FNs.
The dataset used in this paper contains 410 unrelated subjects from the Human Connectome Project repository, released Q3 [51,52]. This includes (test and retest) sessions for resting state and seven fMRI tasks: gambling (GAM), relational (REL), social (SOC), working memory (WM), language processing (LANG), emotion (EMOT), and motor (MOT). Whole-brain FCs estimated from this fMRI dataset include nine distinct Schaefer granularity levels that parcellate the cortical regions into n = 100 to n = 900 nodes, with a 100 nodes increment for each parcellation. The functional communities evaluated in this framework include seven cortical resting state FNs from [8]: visual (VIS), somatomotor (SM), dorsal attention (DA), ventral attention (VA), frontoparietal (FP), limbic (LIM), and default mode (DMN). Each Schaefer granularity has a corresponding Yeo’s FN parcellation. Additional details about the dataset are available in the Supplementary Materials.

4.1. Weak-Recoverability Sub-Interval ( a w , b w )

Based on Panel (A) of Figure 3, we see that for most Schaefer granularity levels (except for n = 100 ), the lower and upper bound of theoretically guaranteed sub-interval of weak-recovery stay fairly stable: τ [ 0.05 , 0.8 ] . The lower bound a w stabilizes faster than the upper bound b w across Schaefer parcellations. Except for the low-resolution parcellation n = 100 , the weak-recovery valid range is relatively stable and parcellation-independent. This implies that the information-theoretic relevance of an a priori set of FNs is, to some extent, parcellation-free. In other words, for all investigated granularity levels, the thresholded graphs are in the weak-recoverability regime, except for the complete ( τ [ 0 , 0.05 ) ) or empty ( τ ( 0.8 , 1 ] ) graph extremes. Panel D of Figure 3 shows further details on the FC density. This is rather interesting because, at those two extremes, networks will contain either too much noise (complete graphs) or too little signal (empty graph) for any highly putative partitions to be information-theoretically relevant.

4.2. Resting State: Group-Average versus Individuals

Based on Panels (B) and (E) of Figure 3, it is evident that all SNR profiles (including the group average and individual levels) behave non-monotonically across the threshold range. There exists a threshold value such that SNR is maximized in the investigated range τ [ 0 , 1 ] . In addition, all optimal threshold values, for both group-average and individual FCs, are within the weak-recoverability sub-interval ( a w , b w ) for all investigated Schaefer granularity levels.
Secondly, we see that both group-average and individual SNR profiles scale with n. This is because the scaling factor s t for the Schaefer FC sequence is not constant. In other words, as the graph size gets larger, one can expect the community profile matrix P Q with entries [ P Q ] i j , i , j [ k ] to represent the number of expected “friends” between FN i and j (e.g., between DMN and LIM) to become larger numerically. Further evidence on empirical exploration of the Schaefer graph sequence degree regime is located in the Supplementary Materials.
Thirdly, for a fixed Schaefer granularity level, the group-average SNR peaks higher and earlier across the investigated threshold range than that of an individual subject. Interestingly, the topological property of connected components for both individual and group-average FCs, across all Schaefer parcellations, also exhibit a similar trend. Specifically, according to Figure S1 (Supplementary Materials), individual FCs become fragmented earlier, i.e., the number of connected components surpasses 1 faster, compared to the corresponding group-average FCs for a fixed granularity level. Topologically and numerically speaking, averaging FC entries across the subject domain damps down the individual fingerprints presented as high-magnitude Pearson correlation values in FCs. This results in magnitude-wise smaller functional connectivity entries, which are annihilated by smaller threshold value τ . On the other hand, using the same analogy, one can see that it takes a higher threshold value for individual FC entries to be annihilated.

4.3. Individualized Optimal Thresholds

As one can observe from Figure 4, the individualized optimal threshold varies across different individuals, which demonstrates strong evidence of the existence of FN functional fingerprint across subjects. In addition, the average of these individualized thresholds, for a given parcellation granularity, is roughly equal to the group-average optimal threshold.

4.4. Group-Average: Resting State vs. fMRI Task Analysis

Next, we investigate the prominence of Yeo’s resting state networks with respect to different fMRI conditions, including seven tasks and the resting state, through SNR measures using group-average FCs across all Schaefer granularity levels and the entire threshold range. Using the resting state SNR profile as a baseline, we compare all task responses in these two scenarios:
  • Constructing FCs with the maximum number of time points available for each fMRI condition;
  • For all fMRI conditions, constructing FCs using 166 time points, which correspond to the number of time points associated with the EMOT task that is also the minimum across all conditions.
Firstly, in both scenarios, the maximum SNR values for all examined tasks surpass the hard threshold S N R = 1 for weak recoverability (Figure 5). Moreover, the optimal threshold τ o p t consistently falls within the range ( a w , b w ) . Trivially, resting state SNR dominates all available tasks across all parcellation levels. This is expected because the selected set of FNs are Yeo’s resting state networks. Secondly, working memory (WM) fMRI responds fairly consistently across all granularity levels in both scenarios. From an information-theoretic perspective, EMOT is the most similar task to the resting state, with respect to Yeo’s resting state networks.
Thirdly, in the maximum-timepoint case, with the exception of n = 100 parcellation, the SNR profiles for most tasks are roughly half the magnitude of the resting-state SNR. Furthermore, for all examined Schaefer parcellations, group-average task FCs appear to reach their SNR peak earlier than the resting-state counterpart. Further details are indicated in Figure 6, Panel A.
In the second scenario, when the minimum number of time points is used across all fMRI conditions, the gap in SNR magnitude between the resting state and each task condition is significantly narrowed, yet the SNR during rest still exceeds those during tasks. Further details are indicated in Figure 6, Panel B.

4.5. The SNR-Driven Inequality

It is important to check if SNRs are robust against randomness, i.e., whether they are a valid factor in deciding the threshold. To do so, we randomly shuffle Yeo’s resting state networks and recompute the SNR response. We repeat the random shuffling procedure 100 times, and record the results for all nine group-average FC induced by the nine Schaefer parcellations, each of which is under the R E S T 1 condition with scanning pattern L R . Results for R L pattern are available in the Supplementary Materials.
For every fixed Schaefer parcellation granularity, the null model SNR profiles are uniformly lower than those of all subjects across the entire thresholding range. Furthermore, the null model values do not exceed the hard threshold imposed by weak recovery criteria, i.e., S N R = 1 . This observation holds true across all investigated Schaefer parcellations, as seen in Panel F of Figure 3. Interestingly, the SNR gets uniformly smaller as Schaefer parcellation granularity increases, as seen also in Panel F.
Collectively, given the SNR results obtained at rest, under task conditions, and null models, we empirically form an inequality relation between resting state and task fMRI-induced FCs in terms of SNR response and the corresponding level of prominence of Yeo’s resting state networks across different fMRI conditions:
0 < S N R n u l l < S N R t a s k < S N R r e s t
This general order of SNR response is observed at the threshold τ that maximizes the objective function S N R by the weak-recovery criteria. At such optimal threshold values, all SNR profiles for task fMRI are in the weak recoverability region while still smaller in magnitude than that at the resting state. Together, these inequalities constitute an empirical lower-bound and upper-bound for S N R t a s k , at least for all the tasks investigated in our study.

4.6. Maximum SNR and Threshold Relationship

As the granularity of Schaefer parcellation gets finer, the corresponding group-average SNR profiles become larger due to the natural scaling of the community profile matrix P Q . This observation applies to the majority of the threshold range. Moreover, per Figure 3, Panel F, we see that optimal thresholds, e.g., τ o p t , tend to decrease as the granularity level increases, which suggests that larger Schaefer FCs do not need to be thresholded as much. Another interesting observation is that, with the exceptions of n = 100 , 200 , 900 , all other investigated granularity levels accept a very stable optimal threshold τ o p t G A = 0.25 . Being a computation pipeline that relies on discretized line search on threshold τ (of increments 0.05 for τ = [ 0 , 1 ] ), yielding this level of consistence of optimal value is unexpected.

4.7. Validation: Highly Putative Partition Back-Test

The theory of weak recovery and its extended usage proposed here allowed us to argue for the relevance of using SNR as a measure that, via thresholding as a specific use case, guides the estimation of well-defined functional connectivity given the mapping of an a priori set of FNs. In this section, our goals are two-fold:
  • Validate our framework by solving the “forward” problem (e.g., community detection) and compare the detected partition with the ground truth;
  • Benchmark different community detection algorithms to solidify the above validation.
Specifically, we juxtapose SNR as a guiding measure against objective-function community detection methods. One such method is Newman’s Q score Maximization [60,61,62]. Here, instead of using the Q score (modularity score) as an objective function for community detection, we use it as a guiding measure to investigate its behavior across the tested threshold range [ 0 , 1 ] . In broad strokes, the Q score measures the statistical difference between a network and its corresponding null model with similar topological properties such as the degree sequence. It can be computed as follows:
Q = u , v ( A u v α P u v ) δ ( σ u , σ v ) .
where δ ( · , · ) . and α are the Kronecker delta and tuning parameter (defaulted at α = 1 ), respectively. In network neuroscience, the majority of studies examining mesoscopic structures of brain functions heavily leverage the maximization of Q score, which unravels predominantly assortative communities, i.e., mesoscopic structures with denser internal edge density than the external one [15,63]. SBM inference methods like Weighted SBM Inference, in principle, uncover a more diverse type of communities beyond assortative ones, such as dis-assortative and core-periphery communities [54]. Because of such distinct differences in principle between the two types of approaches, the Q score would provide a good benchmark for comparing the robustness of SNR against various community detection approaches. Note that for Weighted SBM inference, we assume a Poisson distribution for the weighted graph [64]. Although other model assumptions are possible, our goal in this paper is not to select the most fitting model assumption, but rather to investigate the differences in the communities detected using two theoretically different approaches. In other words, we are not looking to see if the Q score or S N R picks up the exact threshold where the inferred partition is information-theoretically aligned with Yeo’s FNs; rather, we are interested in seeing whether each of those two measures captures the threshold interval where the two partitions agree to a relatively high degree. To measure the information-theoretic agreement between the inferred and ground-truth partitions, we use adjusted mutual information (AMI), which is a measure adjusted to chance. Further details on the inference method and AMI are described in the Supplementary Materials.
Here, our evaluation criteria are as follows: there would only be a threshold sub-interval that are better aligned with the highly putative (ground-truth) community assignment. In the forward direction, we apply three different community detection algorithms across the investigated threshold range [ 0 , 1 ) and measure the corresponding AMI between the inferred and ground-truth partitions. A robust guiding measure, since “blinded” from the ground-truth partition, should have a similar behavior, compared to the AMI curve.
Firstly, per the right panels in Figure 7, we see that both community detection methods, namely Newman’s Q score Maximization and Weighted SBM Inference, yield very similar trends. Specifically, both AMI profiles go up and down crossing the threshold range. Further, AMI gets smaller as n gets larger, which is expected for graphs with increasing numbers of nodes. Interestingly, the threshold values that maximize the AMI for Newman’s Q score Maximization tend to shift left as n increases. We see this particular behavior with S N R in the earlier result section (Panel F of Figure 3). Secondly, the Q score keeps a fairly steady rise in magnitude across the threshold range. Further, it does not appear that the Q score is parcellation dependent; this is expected, because the measure is normalized by 2   m . Moreover, the Q score peaks and plateaus at a very high threshold range τ [ 0.6 , 0.8 ] . In that range, the thresholded FC is highly fragmented (Figure S1) with extremely low edge density Figure 7, and ceases to retain interesting topological insights for further analysis. Lastly, we see that S N R driven curves, with an a priori set of FNs, behave very similar to AMI profiles of both objective-function approaches. On the other hand, the Q score keeps rising across the threshold range and starts plateauing towards the end, which indicates failure at picking a threshold useful for an a priori partition such as Yeo’s FNs. Collectively, our results show, once again, that S N R computation on weighted, thresholded FCs provides excellent guidance for reconstructing a graph with the most information-theoretic relevance to a particular fixed set of FNs.

5. Discussion

In recent years, the network neuroscience field has been striving forward with many exciting discoveries that are becoming more and more relevant to clinical applications and personal medicine. In network neuroscience, this urges the need to improve a popular proxy of brain function, namely functional connectivity. Having the most proper, state-of-the-art mathematical representation of functional brain circuits allows for more accurate and confident positioning of research endeavors. In this work, we put forth a simple framework that allows improving the mathematical representation of brain functions given the use of an a priori set of functional networks. This framework also doubles as a clear evaluation tool for any specific combination of FC parcellation, FN partition, and edge pruning techniques applied to a large-scale brain dataset, thereby streamlining the complex yet crucial studies in network neuroscience.
Thresholding, which is an edge pruning technique used in post-FC processing, is seldom challenged as a standard practice that eliminates, albeit arbitrarily, statistically spurious edges. Since an increasing body of clinical research now involves FC thresholding in the data construction pipeline, careful scrutiny of thresholding is therefore imperative. To the best of our knowledge, our framework represents the first attempt to address the impact of thresholding on the fitness of functional networks. Other thresholding methods aim to optimize topological properties (such as network density [41,65,66,67]), thereby stabilizing graph metrics across datasets. On the other hand, our approach tackles a fundamentally different challenge, namely stabilizing the information-theoretic metrics of functional subcircuit assignment across parcellations. We believe that this challenge is at least as relevant, if not more so, to the current state of fMRI-based functional brain research, as most studies continue to rely on Yeo’s canonical functional subcircuit assignments without re-examining their fitness to the specific parcellation granularity.
By addressing this gap, we conclude that there is no single constant threshold value that is optimal across different parcellation granularity levels, such as the Schaefer ones. In particular, from coarser to finer Schaefer granularities, the optimal threshold value decreases. This result is partially observable in the behavior of matrix W across all studied Schaefer parcellations. According to Figure 7, we see that for a fixed threshold value, as Schaefer granularity increases, Yeo’s functional networks behave more in an assortative manner, i.e., denser internal edge density and sparser external one. We see that through a brighter diagonal and a darker off-diagonal regime of matrix W, across Schaefer parcellations with fixed threshold value τ . Information-theoretically, it means that a larger graph (in size) tends to contain more relevant information to unravel the ground-truth partition (in our study, seven Yeo’s resting state networks); hence, we do not need to threshold the FCs as deeply as the lower granularity parcellations, such as n = 100 . This result also suggests that FC size is proportional to the level of prominence, or fitness, of the a priori set of FNs. Nonetheless, the exact limit of this behavior when granularity tends toward infinity is unknown, e.g., whether the optimal thresholding value will reach a plateau even if the granularity increases. Despite this uncertainty, we showed that for the majority of granularity levels between 100 and 900, the commonly used threshold of τ = 0.25 is a valid choice for group-average topology.
Moreover, when using SNR as a goodness-of-fit measure while fitting an a priori set of FNs onto the FC, while no significant differences are observed between resting state and task conditions for the low-resolution parcellation (n = 100), distinct differences emerge at higher resolutions. There are two ways of interpreting this result: (i) a priori FNs exhibit a poorer fit during rest compared to task states; (ii) there is an intrinsic shift of functional network dynamics at the individual level between the resting state and the task condition. Furthermore, there is also strong evidence suggesting a wide variance in the individualized thresholds across all Schaefer parcellation granularity levels. In the same vein, our results also support the concept of individualized parcellation suggested by the work of Salehi and colleagues [36]. While intuitive and insightful, individualized parcellation across subjects and tasks remains computationally expensive. To that end, our work offers a well-defined tool to examine the level of relevance a particular set of functional networks exhibits when mapped onto individual FCs under different conditions. In simpler terms, it allows us to, for the first time, quantify the individual differences (through information-theoretic gap) when the same atlas is mapped across cohort and/or task domains. This paves the way for alternative frameworks that build upon our work, potentially leading to task-dependent or subject-dependent parcellation methods beyond that proposed in [36].
Our work also extends the usage of the weak-recovery theorem by leveraging SNR as a goodness-of-fit measure. Specifically, our results suggest that for the majority of threshold values, the masked binarized FCs are in the regime of week recovery. However, an open question remains: when parcellated by the Schaefer atlas for a fixed individual and an fMRI condition, is the sequence of FCs in the exact recovery regime? Future studies are needed to address this information-theoretic gap between weak and exact recoverability requirements that is reflected by two measures: SNR for weak recovery and Chernoff–Hellinger distance for exact recovery. Although exact recovery is a stronger requirement, if the Schaefer graph sequence falls within the exact recovery degree regime, the mutual information between the inferred partition (through network inference and objective-based community detection methods) and the ground-truth one (e.g., Yeo’s parcellations) will be theoretically higher.
While our current work addresses the validity of multiple levels of analysis, such as parcellations and subcircuits, future extensions of this study should focus on examining the external validity and generalizability of the framework and its results. In particular, the study would benefit from replication on datasets drawn from healthy populations across different age groups, extreme populations such as those with neuropathological conditions, and data recorded under varying experimental conditions and task designs. Such extensions would introduce a vast range of network complexity and heterogeneous distributions of topological properties, thereby providing the ultimate tests for the power of our framework. Additionally, several technical aspects requiring further investigation include the limitations of fMRI voxel resolution, both spatial and temporal, as well as the corresponding Schaefer parcellations, particularly their impact on SNR when fitting Yeo’s functional networks. Specifically, further analysis is needed to explore the effects of voxel sizes (e.g., 2 mm isotropic for the HCP dataset [51]) and repetition times (e.g., 720 ms for the HCP dataset [51]).
Nevertheless, our findings highlight two crucial points that have substantial implications for brain connectomics research. First, because of the existence of individual brain fingerprints [30,31], we need to pay extra attention when applying a common, fixed atlas to individual FCs. Secondly, we show that thresholding FC matrices is not only an intuitive step during FC post-processing (e.g., to eliminate statistically spurious edges), but also a necessary one if we would like to use such FCs, coupled with an a priori set of FNs, to support any research endeavor in brain connectomics. These results suggest a promising new direction: individualized and task-dependent parcellation methods as an alternative to fixed atlases like that of Yeo. This opens up new directions for precision medicine, particularly targeting individual-specific neurological and neuropsychiatric disorders.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math12192967/s1, Supplementary Materials: A principled framework to assess the information-theoretic fitness of brain functional sub-circuits. References [68,69,70,71,72,73] are cited in the Supplementary Materials.

Author Contributions

D.D.-T.: conceptualization, formal analysis, investigation, methodology, writing original draft; J.C.-P. and A.D.K.: investigation, methodology; N.N.: investigation, formal analysis, visualization, writing original draft; S.M., J.C., J.B., F.H.X., S.G., T.C. and Y.Z.: writing—review and editing; J.G. and L.S.: conceptualization, formal analysis, writing original draft, project supervision, funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

J.G. acknowledges financial support from NIH R01EB022574 and NIH R01MH108467 and the Indiana Clinical and Translational Sciences Institute (Grant Number UL1TR001108) from the National Institutes of Health, National Center for Advancing Translational Sciences, Clinical and Translational Sciences Award. L.S. acknowledges financial support from the National Institutes of Health grants RF1 AG068191, R01 AG071470, U01 AG068057, and T32 AG076411, the National Science Foundation grant IIS 1837964. N.N. acknowledges financial support from the Erasmus Mundus Joint Master’s Programme in Brain and Data Science, European Commission. D.D.-T. acknowledges financial support from Office of Naval Research N0001423WX00749 and Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344.

Data Availability Statement

Data were provided [in part] by the Human Connectome Project, WU-Minn Consortium (Principal Investigators: David Van Essen and Kamil Ugurbil; 1U54MH091657) funded by the 16 NIH Institutes and Centers that support the NIH Blueprint for Neuroscience Research; and by the McDonnell Center for Systems Neuroscience at Washington University. The data used in this study are freely available on the HCP website (https://www.humanconnectome.org, accessed on 1 September 2021). The release Q3 from the HCP data with resting state and seven fMRI tasks and Glasser parcellation was used, and users must apply for permission to access the data.

Acknowledgments

We thank Emmanuel Abbe for his valuable comments.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

FNFunctional network
FCFunctional connectome
SBMStochastic block model
ROIRegion of interest
BOLDblood-oxygen-level-dependent
HCPHuman Connectome Project
SNRSignal-to-noise ratio
GAMGambling
RELRelational
SOCSocial
WMWorking memory
LANGLanguage
EMOTEmotion
MOTMotor
VISVisual
SMSomatomotor
DADorsal attention
VAVentral attention
FPFrontal parietal
LIMLimbic
DMNDefault mode
AMIAdjusted mutual information

References

  1. Tang, R.; Ketcha, M.; Badea, A.; Calabrese, E.D.; Margulies, D.S.; Vogelstein, J.T.; Priebe, C.E.; Sussman, D.L. Connectome smoothing via low-rank approximations. IEEE Trans. Med. Imaging 2018, 38, 1446–1456. [Google Scholar] [CrossRef] [PubMed]
  2. Mijalkov, M.; Pereira, J.B.; Volpe, G. Delayed correlations improve the reconstruction of the brain connectome. PLoS ONE 2020, 15, e0228334. [Google Scholar] [CrossRef] [PubMed]
  3. Zhan, L.; Jenkins, L.M.; Wolfson, O.E.; GadElkarim, J.J.; Nocito, K.; Thompson, P.M.; Ajilore, O.A.; Chung, M.K.; Leow, A.D. The significance of negative correlations in brain connectivity. J. Comp. Neurol. 2017, 525, 3251–3265. [Google Scholar] [CrossRef] [PubMed]
  4. Colclough, G.L.; Brookes, M.J.; Smith, S.M.; Woolrich, M.W. A symmetric multivariate leakage correction for MEG connectomes. Neuroimage 2015, 117, 439–448. [Google Scholar] [CrossRef]
  5. Thilaga, M.; Vijayalakshmi, R.; Nadarajan, R.; Nandagopal, D.; Cocks, B.; Archana, C.; Dahal, N. A heuristic branch-and-bound based thresholding algorithm for unveiling cognitive activity from EEG data. Neurocomputing 2015, 170, 32–46. [Google Scholar] [CrossRef]
  6. Alakörkkö, T.; Saarimäki, H.; Glerean, E.; Saramäki, J.; Korhonen, O. Effects of spatial smoothing on functional brain networks. Eur. J. Neurosci. 2017, 46, 2471–2480. [Google Scholar] [CrossRef]
  7. Roberts, J.A.; Perry, A.; Lord, A.R.; Roberts, G.; Mitchell, P.B.; Smith, R.E.; Calamante, F.; Breakspear, M. The contribution of geometry to the human connectome. NeuroImage 2016, 124, 379–393. [Google Scholar] [CrossRef]
  8. Yeo, B.T.; Krienen, F.M.; Sepulcre, J.; Sabuncu, M.R.; Lashkari, D.; Hollinshead, M.; Roffman, J.L.; Smoller, J.W.; Zöllei, L.; Polimeni, J.R.; et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J. Neurophysiol. 2011, 106, 1125–1165. [Google Scholar]
  9. Duong-Tran, D.; Kaufmann, R.; Chen, J.; Wang, X.; Garai, S.; Xu, F.H.; Bao, J.; Amico, E.; Kaplan, A.D.; Petri, G. Homological landscape of human brain functional sub-circuits. Mathematics 2024, 12, 455. [Google Scholar] [CrossRef]
  10. Meunier, D.; Lambiotte, R.; Bullmore, E.T. Modular and hierarchically modular organization of brain networks. Front. Neurosci. 2010, 4, 200. [Google Scholar] [CrossRef]
  11. Stam, C.J.; Hillebrand, A.; Wang, H.; Van Mieghem, P. Emergence of modular structure in a large-scale brain network with interactions between dynamics and connectivity. Front. Comput. Neurosci. 2010, 4, 133. [Google Scholar] [CrossRef] [PubMed]
  12. Bertolero, M.A.; Yeo, B.T.; D’Esposito, M. The modular and integrative functional architecture of the human brain. Proc. Natl. Acad. Sci. USA 2015, 112, E6798–E6807. [Google Scholar] [CrossRef] [PubMed]
  13. Duong-Tran, D.; Abbas, K.; Amico, E.; Corominas-Murtra, B.; Dzemidzic, M.; Kareken, D.; Ventresca, M.; Goñi, J. A morphospace of functional configuration to assess configural breadth based on brain functional networks. Netw. Neurosci. 2021, 5, 666–688. [Google Scholar] [CrossRef] [PubMed]
  14. Bassett, D.S.; Gazzaniga, M.S. Understanding complexity in the human brain. Trends Cogn. Sci. 2011, 15, 200–209. [Google Scholar] [CrossRef] [PubMed]
  15. Sporns, O.; Betzel, R.F. Modular brain networks. Annu. Rev. Psychol. 2016, 67, 613–640. [Google Scholar] [CrossRef] [PubMed]
  16. Amico, E.; Arenas, A.; Goñi, J. Centralized and distributed cognitive task processing in the human connectome. Netw. Neurosci. 2019, 3, 455–474. [Google Scholar] [CrossRef]
  17. Amico, E.; Abbas, K.; Duong-Tran, D.A.; Tipnis, U.; Rajapandian, M.; Chumin, E.; Ventresca, M.; Harezlak, J.; Goñi, J. Toward an information theoretical description of communication in brain networks. Netw. Neurosci. 2021, 5, 646–665. [Google Scholar] [CrossRef]
  18. Meunier, D.; Achard, S.; Morcom, A.; Bullmore, E. Age-related changes in modular organization of human brain functional networks. NeuroImage 2009, 44, 715–723. [Google Scholar] [CrossRef]
  19. Betzel, R.F.; Byrge, L.; He, Y.; Goñi, J.; Zuo, X.N.; Sporns, O. Changes in structural and functional connectivity among resting-state networks across the human lifespan. NeuroImage 2014, 102, 345–357. [Google Scholar] [CrossRef]
  20. Alexander-Bloch, A.; Lambiotte, R.; Roberts, B.; Giedd, J.; Gogtay, N.; Bullmore, E. The discovery of population differences in network community structure: New methods and applications to brain functional networks in schizophrenia. NeuroImage 2012, 59, 3889–3900. [Google Scholar] [CrossRef]
  21. Power, J.D.; Cohen, A.L.; Nelson, S.M.; Wig, G.S.; Barnes, K.A.; Church, J.A.; Vogel, A.C.; Laumann, T.O.; Miezin, F.M.; Schlaggar, B.L.; et al. Functional network organization of the human brain. Neuron 2011, 72, 665–678. [Google Scholar] [CrossRef] [PubMed]
  22. Lee, M.H.; Hacker, C.D.; Snyder, A.Z.; Corbetta, M.; Zhang, D.; Leuthardt, E.C.; Shimony, J.S. Clustering of resting state networks. PLoS ONE 2012, 7, e40370. [Google Scholar] [CrossRef] [PubMed]
  23. Chan, M.Y.; Alhazmi, F.H.; Park, D.C.; Savalia, N.K.; Wig, G.S. Resting-state network topology differentiates task signals across the adult life span. J. Neurosci. 2017, 37, 2734–2745. [Google Scholar] [CrossRef] [PubMed]
  24. Xu, F.H.; Gao, M.; Chen, J.; Garai, S.; Duong-Tran, D.A.; Zhao, Y.; Shen, L. Topology-based clustering of functional brain networks in an Alzheimer’s disease cohort. AMIA Summits Transl. Sci. Proc. 2024, 2024, 449–458. [Google Scholar] [PubMed]
  25. Reineberg, A.E.; Andrews-Hanna, J.R.; Depue, B.E.; Friedman, N.P.; Banich, M.T. Resting-state networks predict individual differences in common and specific aspects of executive function. NeuroImage 2015, 104, 69–78. [Google Scholar] [CrossRef] [PubMed]
  26. Duong-Tran, D.; Magsino, M.; Goñi, J.; Shen, L. Preserving human large-scale brain connectivity fingerprint identifiability with random projections. In Proceedings of the 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, 27–30 May 2024; pp. 1–5. [Google Scholar]
  27. Abbas, K.; Amico, E.; Svaldi, D.O.; Tipnis, U.; Duong-Tran, D.A.; Liu, M.; Rajapandian, M.; Harezlak, J.; Ances, B.M.; Goñi, J. GEFF: Graph embedding for functional fingerprinting. NeuroImage 2020, 221, 117181. [Google Scholar] [CrossRef]
  28. Abbas, K.; Liu, M.; Wang, M.; Duong-Tran, D.; Tipnis, U.; Amico, E.; Kaplan, A.D.; Dzemidzic, M.; Kareken, D.; Ances, B.M.; et al. Tangent functional connectomes uncover more unique phenotypic traits. iScience 2023, 26, 107624. [Google Scholar] [CrossRef]
  29. Chiêm, B.; Abbas, K.; Amico, E.; Duong-Tran, D.A.; Crevecoeur, F.; Goñi, J. Improving functional connectome fingerprinting with degree-normalization. Brain Connect. 2022, 12, 180–192. [Google Scholar] [CrossRef]
  30. Finn, E.S.; Shen, X.; Scheinost, D.; Rosenberg, M.D.; Huang, J.; Chun, M.M.; Papademetris, X.; Constable, R.T. Functional connectome fingerprinting: Identifying individuals using patterns of brain connectivity. Nat. Neurosci. 2015, 18, 1664. [Google Scholar] [CrossRef]
  31. Amico, E.; Goñi, J. The quest for identifiability in human functional connectomes. Sci. Rep. 2018, 8, 8254. [Google Scholar] [CrossRef]
  32. Garai, S.; Xu, F.; Duong-Tran, D.A.; Zhao, Y.; Shen, L. Mining correlation between fluid intelligence and whole-brain large scale structural connectivity. AMIA Summits Transl. Sci. Proc. 2023, 2023, 225. [Google Scholar] [PubMed]
  33. Garai, S.; Vo, S.; Blank, L.; Xu, F.; Chen, J.; Duong-Tran, D.; Zhao, Y.; Shen, L. Quantifying Contributions from Topological Cycles in the Brain Network towards Cognition. bioRxiv 2024. [Google Scholar] [CrossRef]
  34. Schaefer, A.; Kong, R.; Gordon, E.M.; Laumann, T.O.; Zuo, X.N.; Holmes, A.J.; Eickhoff, S.B.; Yeo, B.T. Local-global parcellation of the human cerebral cortex from intrinsic functional connectivity MRI. Cereb. Cortex 2018, 28, 3095–3114. [Google Scholar] [CrossRef] [PubMed]
  35. Glasser, M.F.; Coalson, T.S.; Robinson, E.C.; Hacker, C.D.; Harwell, J.; Yacoub, E.; Ugurbil, K.; Andersson, J.; Beckmann, C.F.; Jenkinson, M.; et al. A multi-modal parcellation of human cerebral cortex. Nature 2016, 536, 171–178. [Google Scholar] [CrossRef]
  36. Salehi, M.; Greene, A.S.; Karbasi, A.; Shen, X.; Scheinost, D.; Constable, R.T. There is no single functional atlas even for a single individual: Parcellation of the human brain is state dependent. bioRxiv 2018. [Google Scholar] [CrossRef]
  37. Abbas, K.; Liu, M.; Venkatesh, M.; Amico, E.; Kaplan, A.D.; Ventresca, M.; Pessoa, L.; Harezlak, J.; Goñi, J. Geodesic distance on optimally regularized functional connectomes uncovers individual fingerprints. Brain Connect. 2021, 11, 333–348. [Google Scholar] [CrossRef]
  38. Váša, F.; Bullmore, E.T.; Patel, A.X. Probabilistic thresholding of functional connectomes: Application to schizophrenia. NeuroImage 2018, 172, 326–340. [Google Scholar] [CrossRef]
  39. Yu, Z.; Qin, J.; Xiong, X.; Xu, F.; Wang, J.; Hou, F.; Yang, A. Abnormal topology of brain functional networks in unipolar depression and bipolar disorder using optimal graph thresholding. Prog. Neuro-Psychopharmacol. Biol. Psychiatry 2020, 96, 109758. [Google Scholar] [CrossRef]
  40. Langer, N.; Pedroni, A.; Jäncke, L. The problem of thresholding in small-world network analysis. PLoS ONE 2013, 8, e53199. [Google Scholar] [CrossRef]
  41. van den Heuvel, M.P.; de Lange, S.C.; Zalesky, A.; Seguin, C.; Yeo, B.T.; Schmidt, R. Proportional thresholding in resting-state fMRI functional connectivity networks and consequences for patient-control connectome studies: Issues and recommendations. NeuroImage 2017, 152, 437–449. [Google Scholar] [CrossRef]
  42. Esfahlani, F.Z.; Sayama, H. A percolation-based thresholding method with applications in functional connectivity analysis. In Proceedings of the International Workshop on Complex Networks; Springer: Berlin/Heidelberg, Germany, 2018; pp. 221–231. [Google Scholar]
  43. Bielczyk, N.Z.; Walocha, F.; Ebel, P.W.; Haak, K.V.; Llera, A.; Buitelaar, J.K.; Glennon, J.C.; Beckmann, C.F. Thresholding functional connectomes by means of mixture modeling. NeuroImage 2018, 171, 402–414. [Google Scholar] [CrossRef] [PubMed]
  44. Lee, H.; Chung, M.K.; Kang, H.; Kim, B.N.; Lee, D.S. Discriminative persistent homology of brain networks. In Proceedings of the 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, Chicago, IL, USA, 30 March–2 April 2011; pp. 841–844. [Google Scholar]
  45. Lee, H.; Kang, H.; Chung, M.K.; Kim, B.N.; Lee, D.S. Persistent brain network homology from the perspective of dendrogram. IEEE Trans. Med. Imaging 2012, 31, 2267–2277. [Google Scholar] [PubMed]
  46. Dimitriadis, S.I.; Salis, C.; Tarnanas, I.; Linden, D.E. Topological filtering of dynamic functional brain networks unfolds informative chronnectomics: A novel data-driven thresholding scheme based on orthogonal minimal spanning trees (OMSTs). Front. Neuroinform. 2017, 11, 28. [Google Scholar] [CrossRef]
  47. Gorbach, T.; Lundquist, A.; de Luna, X.; Nyberg, L.; Salami, A. A Hierarchical Bayesian Mixture Modeling Approach for Analysis of Resting-State Functional Brain Connectivity: An Alternative to Thresholding. Brain Connect. 2020, 10, 202–211. [Google Scholar] [CrossRef]
  48. Glasser, M.F.; Sotiropoulos, S.N.; Wilson, J.A.; Coalson, T.S.; Fischl, B.; Andersson, J.L.; Xu, J.; Jbabdi, S.; Webster, M.; Polimeni, J.R.; et al. The minimal preprocessing pipelines for the Human Connectome Project. NeuroImage 2013, 80, 105–124. [Google Scholar] [CrossRef] [PubMed]
  49. Tian, Y.; Margulies, D.S.; Breakspear, M.; Zalesky, A. Hierarchical organization of the human subcortex unveiled with functional connectivity gradients. bioRxiv 2020. [Google Scholar] [CrossRef]
  50. Amico, E.; Goñi, J. Mapping hybrid functional-structural connectivity traits in the human connectome. Netw. Neurosci. 2018, 2, 306–322. [Google Scholar] [CrossRef] [PubMed]
  51. Van Essen, D.; Ugurbil, K.; Auerbach, E.; Barch, D.; Behrens, T.; Bucholz, R.; Chang, A.; Chen, L.; Corbetta, M.; Curtiss, S.; et al. The Human Connectome Project: A data acquisition perspective. NeuroImage 2012, 62, 2222–2231. [Google Scholar] [CrossRef]
  52. Van Essen, D.C.; Smith, S.M.; Barch, D.M.; Behrens, T.E.; Yacoub, E.; Ugurbil, K.; Wu-Minn HCP Consortium. The WU-Minn human connectome project: An overview. Neuroimage 2013, 80, 62–79. [Google Scholar] [CrossRef]
  53. Abbe, E. Community detection and stochastic block models: Recent developments. J. Mach. Learn. Res. 2017, 18, 6446–6531. [Google Scholar]
  54. Betzel, R.F.; Medaglia, J.D.; Bassett, D.S. Diversity of meso-scale architecture in human and non-human connectomes. Nat. Commun. 2018, 9, 346. [Google Scholar] [CrossRef] [PubMed]
  55. Faskowitz, J.; Yan, X.; Zuo, X.N.; Sporns, O. Weighted stochastic block models of the human connectome across the life span. Sci. Rep. 2018, 8, 12997. [Google Scholar] [CrossRef] [PubMed]
  56. Fornito, A.; Zalesky, A.; Breakspear, M. The connectomics of brain disorders. Nat. Rev. Neurosci. 2015, 16, 159. [Google Scholar] [CrossRef] [PubMed]
  57. Fortunato, S. Community detection in graphs. Phys. Rep. 2010, 486, 75–174. [Google Scholar] [CrossRef]
  58. Fortunato, S.; Hric, D. Community detection in networks: A user guide. Phys. Rep. 2016, 659, 1–44. [Google Scholar] [CrossRef]
  59. Fornito, A.; Zalesky, A.; Bullmore, E. Fundamentals of Brain Network Analysis; Academic Press: Cambridge, MA, USA, 2016. [Google Scholar]
  60. Newman, M.E.; Girvan, M. Finding and evaluating community structure in networks. Phys. Rev. E 2004, 69, 026113. [Google Scholar] [CrossRef]
  61. Newman, M.E. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 2006, 74, 036104. [Google Scholar] [CrossRef]
  62. Newman, M.E. Modularity and community structure in networks. Proc. Natl. Acad. Sci. USA 2006, 103, 8577–8582. [Google Scholar] [CrossRef]
  63. Sporns, O. Network attributes for segregation and integration in the human brain. Curr. Opin. Neurobiol. 2013, 23, 162–171. [Google Scholar] [CrossRef]
  64. Karrer, B.; Newman, M.E. Stochastic blockmodels and community structure in networks. Phys. Rev. E 2011, 83, 016107. [Google Scholar] [CrossRef]
  65. van den Heuvel, M.P.; Stam, C.J.; Boersma, M.; Pol, H.H. Small-world and scale-free organization of voxel-based resting-state functional connectivity in the human brain. NeuroImage 2008, 43, 528–539. [Google Scholar] [CrossRef] [PubMed]
  66. Bassett, D.S.; Bullmore, E.T.; Meyer-Lindenberg, A.; Apud, J.A.; Weinberger, D.R.; Coppola, R. Cognitive fitness of cost-efficient brain functional networks. Proc. Natl. Acad. Sci. USA 2009, 106, 11747–11752. [Google Scholar] [CrossRef] [PubMed]
  67. Achard, S.; Bullmore, E. Efficiency and cost of economical brain functional networks. PLoS Comput. Biol. 2007, 3, e17. [Google Scholar] [CrossRef] [PubMed]
  68. Barch, D.M.; Burgess, G.C.; Harms, M.P.; Petersen, S.E.; Schlaggar, B.L.; Corbetta, M.; Glasser, M.F.; Curtiss, S.; Dixit, S.; Feldt, C.; et al. Function in the human connectome: Task-fMRI and individual differences in behavior. NeuroImage 2013, 80, 169–189. [Google Scholar] [CrossRef]
  69. Marcus, D.; Harwell, J.; Olsen, T.; Hodge, M.; Glasser, M.; Prior, F.; Jenkinson, M.; Laumann, T.; Curtiss, S.; Van Essen, D. Informatics and data mining tools and strategies for the human connectome project. Front. Neuroinform. 2011, 5, 4. [Google Scholar] [CrossRef]
  70. Peixoto, T.P. Nonparametric weighted stochastic block models. Phys. Rev. E 2018, 97, 012306. [Google Scholar] [CrossRef] [PubMed]
  71. Power, J.D.; Mitra, A.; Laumann, T.O.; Snyder, A.Z.; Schlaggar, B.L.; Petersen, S.E. Methods to detect, characterize, and remove motion artifact in resting state fMRI. NeuroImage 2014, 84, 320–341. [Google Scholar] [CrossRef]
  72. Smith, S.M.; Beckmann, C.F.; Andersson, J.; Auerbach, E.J.; Bijsterbosch, J.; Douaud, G.; Duff, E.; Feinberg, D.A.; Griffanti, L.; Harms, M.P.; et al. Resting-state fMRI in the human connectome project. NeuroImage 2013, 80, 144–168. [Google Scholar] [CrossRef]
  73. Vinh, N.X.; Epps, J.; Bailey, J. Information theoretic measures for clusterings comparison: Variants, properties, normalization and correction for chance. J. Mach. Learn. Res. 2010, 11, 2837–2854. [Google Scholar]
Figure 1. Pseudo-code for reconFC routine using the number of individual FCs Γ , Schaefer granularity n, number of functional networks k, a priori partition σ , parameter space S , community assignment likelihood P, and connectivity pattern matrix W. Note that S could contain d i m ( S ) parameters in general. In addition, in line 14, the notation G τ ( ) represents a particular configuration of matrix G satisfies the parameter space S represented by the value of τ ( ) .
Figure 1. Pseudo-code for reconFC routine using the number of individual FCs Γ , Schaefer granularity n, number of functional networks k, a priori partition σ , parameter space S , community assignment likelihood P, and connectivity pattern matrix W. Note that S could contain d i m ( S ) parameters in general. In addition, in line 14, the notation G τ ( ) represents a particular configuration of matrix G satisfies the parameter space S represented by the value of τ ( ) .
Mathematics 12 02967 g001
Figure 2. Example of the FC reconstruction routine based on the Schaefer granularity level of 100 nodes and resting-state fMRI with scanning pattern L R . Note that the for-loop indicated by ( ) is used to find the individualized optimal threshold for each subject, τ o p t γ . One can substitute this for-loop by finding the unique cohort optimal threshold, τ o p t G A , using the group-average FC.
Figure 2. Example of the FC reconstruction routine based on the Schaefer granularity level of 100 nodes and resting-state fMRI with scanning pattern L R . Note that the for-loop indicated by ( ) is used to find the individualized optimal threshold for each subject, τ o p t γ . One can substitute this for-loop by finding the unique cohort optimal threshold, τ o p t G A , using the group-average FC.
Mathematics 12 02967 g002
Figure 3. Panel (A) illustrates the weak-recoverability sub-interval of τ ( a w , b w ) [ 0 , 1 ] as a function of Schaefer granularity (Step 2), in which the blue line represents the interval upper bounds and the red line represents the lower ones. Panel (B) is the 5th and 95th percentile of individual subjects’ SNR for four distinct Schaefer parcellations n = 100 , 300 , 600 , 900 . Panel (C) illustrates the SNR null models. Panel (D) is the FC density, on a logarithmic scale, across the same 4 granularity levels. Panel (E) shows SNR profiles computed on group-average FCs (again, over the same granularity levels). Finally, Panel (F) reports the optimal threshold τ o p t computed based on the maximum SNR of group-average FCs. Note that in Panels (D,E), the weak-recoverability sub-intervals use the maximum and minimum values for the upper and lower bound, respectively, across Schaefer parcellations.
Figure 3. Panel (A) illustrates the weak-recoverability sub-interval of τ ( a w , b w ) [ 0 , 1 ] as a function of Schaefer granularity (Step 2), in which the blue line represents the interval upper bounds and the red line represents the lower ones. Panel (B) is the 5th and 95th percentile of individual subjects’ SNR for four distinct Schaefer parcellations n = 100 , 300 , 600 , 900 . Panel (C) illustrates the SNR null models. Panel (D) is the FC density, on a logarithmic scale, across the same 4 granularity levels. Panel (E) shows SNR profiles computed on group-average FCs (again, over the same granularity levels). Finally, Panel (F) reports the optimal threshold τ o p t computed based on the maximum SNR of group-average FCs. Note that in Panels (D,E), the weak-recoverability sub-intervals use the maximum and minimum values for the upper and lower bound, respectively, across Schaefer parcellations.
Mathematics 12 02967 g003
Figure 4. Individualized optimal threshold is derived using the SNR behavior of each individual for 4 distinct Schaefer’s parcellations n = { 100 , 300 , 600 , 900 } .
Figure 4. Individualized optimal threshold is derived using the SNR behavior of each individual for 4 distinct Schaefer’s parcellations n = { 100 , 300 , 600 , 900 } .
Mathematics 12 02967 g004
Figure 5. Maximum S N R computed for Resting state of Schaefer Group-average FC with n = 300 for increasing scanning lengths, starting at 100 to 1000 time points, increments of 100 each.
Figure 5. Maximum S N R computed for Resting state of Schaefer Group-average FC with n = 300 for increasing scanning lengths, starting at 100 to 1000 time points, increments of 100 each.
Mathematics 12 02967 g005
Figure 6. fMRI task and rest SNR profiles. For each row of panels, the left one represents SNR behavior across threshold range τ [ 0 , 1 ] using the maximum scanning length for all fMRI tasks and rest. The gray shade represents the 5th to 95th percentile of SNR task regime across all fMRI tasks and the entire threshold range. In the right panel, SNR profiles for task- and resting-state fMRI are computed using 166 time points corresponding to the scanning length of the shortest scanned task, i.e., EMOT. Results are for 4 Schaefer parcellation levels: n = [ 100 , 300 , 600 , 900 ] .
Figure 6. fMRI task and rest SNR profiles. For each row of panels, the left one represents SNR behavior across threshold range τ [ 0 , 1 ] using the maximum scanning length for all fMRI tasks and rest. The gray shade represents the 5th to 95th percentile of SNR task regime across all fMRI tasks and the entire threshold range. In the right panel, SNR profiles for task- and resting-state fMRI are computed using 166 time points corresponding to the scanning length of the shortest scanned task, i.e., EMOT. Results are for 4 Schaefer parcellation levels: n = [ 100 , 300 , 600 , 900 ] .
Mathematics 12 02967 g006
Figure 7. Panel (A)—left figure represents the modularity score of a thresholded group-average FC across threshold range τ [ 0 , 0.85 ] . Panel (A)—the right figure reports the normalized mutual information between the inferred partition (using Q score Maximization heuristics) and Yeo’s FN partition. The same order goes to Panel (B). Panel (B) represents the results of the SNR approach. Note that the full threshold range is not necessary because, in the sub-interval τ [ 0.9 , 1.0 ] , the thresholded graph is almost (if not) empty. The displayed result is for the group-average FCs, over four Schaefer granularity levels n = [ 100 , 300 , 600 , 900 ] .
Figure 7. Panel (A)—left figure represents the modularity score of a thresholded group-average FC across threshold range τ [ 0 , 0.85 ] . Panel (A)—the right figure reports the normalized mutual information between the inferred partition (using Q score Maximization heuristics) and Yeo’s FN partition. The same order goes to Panel (B). Panel (B) represents the results of the SNR approach. Note that the full threshold range is not necessary because, in the sub-interval τ [ 0.9 , 1.0 ] , the thresholded graph is almost (if not) empty. The displayed result is for the group-average FCs, over four Schaefer granularity levels n = [ 100 , 300 , 600 , 900 ] .
Mathematics 12 02967 g007
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Duong-Tran, D.; Nguyen, N.; Mu, S.; Chen, J.; Bao, J.; Xu, F.H.; Garai, S.; Cadena-Pico, J.; Kaplan, A.D.; Chen, T.; et al. A Principled Framework to Assess the Information-Theoretic Fitness of Brain Functional Sub-Circuits. Mathematics 2024, 12, 2967. https://doi.org/10.3390/math12192967

AMA Style

Duong-Tran D, Nguyen N, Mu S, Chen J, Bao J, Xu FH, Garai S, Cadena-Pico J, Kaplan AD, Chen T, et al. A Principled Framework to Assess the Information-Theoretic Fitness of Brain Functional Sub-Circuits. Mathematics. 2024; 12(19):2967. https://doi.org/10.3390/math12192967

Chicago/Turabian Style

Duong-Tran, Duy, Nghi Nguyen, Shizhuo Mu, Jiong Chen, Jingxuan Bao, Frederick H. Xu, Sumita Garai, Jose Cadena-Pico, Alan David Kaplan, Tianlong Chen, and et al. 2024. "A Principled Framework to Assess the Information-Theoretic Fitness of Brain Functional Sub-Circuits" Mathematics 12, no. 19: 2967. https://doi.org/10.3390/math12192967

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop