Next Article in Journal
Cardiac Function and Structure before and after Mild SARS-CoV-2 Infection in Elite Athletes Using Biventricular and Left Atrial Strain
Previous Article in Journal
The Use of Anaplastic Lymphoma Kinase Inhibitors in Non-Small-Cell Lung Cancer Treatment—Literature Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning Reveals Microbial Taxa Associated with a Swim across the Pacific Ocean

by
Garry Lewis
1,
Sebastian Reczek
1,
Osayenmwen Omozusi
2,
Taylor Hogue
3,
Marc D. Cook
3,4 and
Jarrad Hampton-Marcell
1,*
1
Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607, USA
2
Department of Biomedical Engineering, University of Illinois at Chicago, Chicago, IL 60607, USA
3
Department of Kinesiology, North Carolina Agriculture and Technical State University, Greensboro, NC 27411, USA
4
Center of Integrative Health Disparities and Equity Research, North Carolina Agriculture and Technical State University, Greensboro, NC 27411, USA
*
Author to whom correspondence should be addressed.
Biomedicines 2024, 12(10), 2309; https://doi.org/10.3390/biomedicines12102309
Submission received: 29 August 2024 / Revised: 19 September 2024 / Accepted: 20 September 2024 / Published: 11 October 2024
(This article belongs to the Section Endocrinology and Metabolism Research)

Abstract

:
Purpose: This study aimed to characterize the association between microbial dynamics and excessive exercise. Methods: Swabbed fecal samples, body composition (percent body fat), and swimming logs were collected (n = 94) from a single individual over 107 days as he swam across the Pacific Ocean. The V4 region of the 16S rRNA gene was sequenced, generating 6.2 million amplicon sequence variants. Multivariate analysis was used to analyze the microbial community structure, and machine learning (random forest) was used to model the microbial dynamics over time using R statistical programming. Results: Our findings show a significant reduction in percent fat mass (Pearson; p < 0.01, R = −0.89) and daily swim distance (Spearman; p < 0.01, R = −0.30). Furthermore, the microbial community structure became increasingly similar over time (PERMANOVA; p < 0.01, R = −0.27). Decision-based modeling (random forest) revealed the genera Alistipes, Anaerostipes, Bifidobacterium, Butyricimonas, Lachnospira, Lachnobacterium, and Ruminococcus as important microbial biomarkers of excessive exercise for explaining variations observed throughout the swim (OOB; R = 0.893). Conclusions: We show that microbial community structure and composition accurately classify outcomes of excessive exercise in relation to body composition, blood pressure, and daily swim distance. More importantly, microbial dynamics reveal the microbial taxa significantly associated with increased exercise volume, highlighting specific microbes responsive to excessive swimming.

1. Introduction

Exercise is renowned for its positive impact on physical fitness, physiological well-being, and athletic performance, evidenced by improved body composition, reduced blood pressure, and enhanced performance metrics [1,2,3]. It has also been shown to enhance glucose uptake by upregulating glucose transporters in skeletal muscle and promoting glycogen storage, thus improving energy utilization and exercise performance [4]. While these benefits are well-documented, exercise also introduces physiological stress that may contribute to overtraining syndrome and cardiovascular risks, emphasizing the need for a holistic understanding of exercise [5,6]. The gut microbiome (GM), hosting a vast array of nearly 30 trillion microbial cells, emerges as a critical factor in this context [7]. Its functional capacity to regulate glucose homeostasis and influence metabolic pathways suggests a profound connection with the body’s response to exercise [8,9]. The relationship between exercise—its frequency and volume—and the GM is associated with body composition and cardiorespiratory fitness, suggesting a similar or shared pathway for muscular endurance and metabolic health [10,11].
Concurrently, exercise profoundly influences the GM, as evidenced by changes in alpha and beta diversity in human and murine models [12]. Adaptive GM changes feature increased diversity, with abundant short-chain fatty acid (SCFA)-producing bacterial orders like Clostridiales and taxa such as Roseburia [13,14]. These microbial populations contribute to metabolic homeostasis by producing SCFAs and enhancing immune regulation [15,16,17] and endurance exercise performance [18]. Improved endurance and strength may result from this synergy, modulating oxidative stress and inflammatory responses, enhancing metabolism, and optimizing energy expenditure during intense exercise [19]. Further underlining the GM’s importance, research in murine models reveals that a depleted GM can lead to decreased muscular endurance and glycogen availability, which are reversed with GM restoration [20]. Moreover, increased cardiorespiratory fitness (i.e., VO2max) has been linked to increased GM diversity and elevated butyrate levels [21]. Despite these positive associations, there is a critical need to explore how these effects might vary or reverse under conditions of excessive exercise (EE). According to O’Keefe [22], EE can be induced with as little as four to five hours of vigorous exercise within a single week.
While exercise is widely recognized for its positive health benefits, there is a threshold beyond which exercise can become detrimental. Individuals who engage in EE have a higher incidence of illness and overuse injuries, with the most common injury being stress (fatigue) bone fractures. Furthermore, EE is associated with poor nutritional and psychological health, decreased energy availability, and worse exercise performance [23]. At minimum, sufficient recovery from EE involves at least one full rest day per week [22]. Assessing EE in relation to the GM, Yuan et al. [24] observed that EE is associated with decreased microbial diversity when studied within a murine model. Importantly, EE affects the GM by increasing inflammation and gut permeability, thereby increasing the risk of pathogen entry into the body [25].
To further understand the impact of EE and the GM, we analyzed a case study of a 51-year-old man who attempted to traverse the Pacific Ocean, logging 1212.59 nautical miles over the course of 107 days. Our study focuses on a continuous time-series analysis, providing concise change points for composition, structure, performance, and physiology. Based on the assumption that certain microbial profiles are optimized for endurance performance, we hypothesize that structural and compositional changes in the GM, driven by the high frequency and volume of swimming, can predict the overall time course of exercise. To our knowledge, this is the first study to monitor microbial dynamics and EE at this magnitude. Our research provides an interpretation of the GM when exercise transitions from beneficial to adverse, which is often overlooked when studying host–microbe interactions and exercise.

2. Methods

2.1. Study Design

A 51-year-old Caucasian man embarked on a 107-day swim across the Pacific Ocean, encompassing 1212.59 nautical miles. The participant received an explanation of the study and signed a written informed consent form that had been approved by the central institutional review board within the Department of Energy (protocol DOE000277). The study participant swam 8 h on average, rested on a boat, and returned to the same area the following morning to resume swimming. A physician on board monitored several health markers daily, including body fat composition (waist-to-height ratio), arterial oxygen saturation via a pulse oximeter (SaO2), heart rate (HR), and systolic and diastolic blood pressure, to assess changes in both physiological and physical measures. Over the 107-day swim, swabbed fecal samples were collected at 94 timepoints (n = 94). GPS technology was used to record daily swimming distances and latitudinal and longitudinal coordinates; wind speed and direction, wave height and direction, and weather conditions were also measured.

2.2. Sample Processing and Sequencing

Daily swabbed fecal samples were collected from the participant. Fecal samples were stored in a −20 °C freezer during the swim. Following the conclusion, samples were shipped on dry ice to Argonne National Laboratory and stored in a −80 °C freezer until downstream processing. Genomic DNA was extracted following Qiagen’s QIAamp Fast DNA Stool Mini Kit (Germantown, MD, USA) suggested protocol. Libraries were generated using forwarded barcoded primers according to the American Gut Project, and the Greengenes database version 13.8 was used for 16S rRNA sequencing. Utilizing the V4 region of the 16S rRNA gene, microbial sequence data were quality-filtered and assigned to corresponding samples based on their 12-bp error-correcting Golay barcodes using QIIME2. Amplicon sequence variants were generated using deblur, and the amplicon sequence variants (ASVs) were analyzed using Quantitative Insight into Microbial Ecology 2 (QIIME2) and R Statistical Software (v4.2.1). Microbial sequences appearing only once were discarded and not included in downstream analyses.

2.3. Statistical Analysis

R statistical language (v4.2.1) and RStudio (v4.2.1) were used for all analyses. The participant’s swim was mapped with coordinate data using the R packages ‘sf’, ‘rnaturalearth’, ‘googleway’, ‘cowplot’, ‘ggrepel’, and ‘ggspatial’. Pearson correlations of swim (daily distance), blood pressure, and body fat composition were plotted over time with a statistical significance cutoff of p < 0.05. Alpha diversity measures (inverse Simpson and Shannon indices) were generated using the R package ‘microbiome’, and linear regression models (LRMs) were applied to test whether changes over time were significant (p < 0.05). Wilcoxon tests were performed to test if differences between means of groups were statistically significant (p < 0.05).
Bray–Curtis distances were generated to assess dissimilarities in microbial community structure using the R packages ‘microbiome’, ‘phyloseq’, and ‘vegan’. K-means clustering, a type of unsupervised learning model, was applied to microbial composition to identify microbial subcommunity prevalence over time. This method groups similar observations by placing data points into groups (k’s) that are most similar based on average values. K-means clustering was applied to generate 1–10 microbial subcommunities using Bray–Curtis distances. To determine the best number of groups (k), we used a silhouette analysis, which compares how closely an observation matches its group versus others. This analysis, conducted with the ‘fpc’ and ‘cluster’ R packages, assigns scores from −1 to 1 to each observation, with a high score indicating a good fit within its group. Clusters were visualized with a dimensional reduction using principal coordinate analysis (PCoA), and a permutational analysis of variance (PERMANOVA) test assessed if community-level differences were significant. Additionally, a density plot over time determined temporal and dominance interactions between subcommunities.
Decision trees (supervised learning) were applied to identify microbial taxa important in shaping microbial community composition and structure over time using the R package ‘randomForest’. Random forest (RF) is a machine learning classifier that implements a series of interconnected and hierarchical decisions to analyze data. Multiple decision trees (i.e., RFs) are generated by building upon a consensus model that subsequently quantifies a variable’s importance in the model’s accuracy. The RF models used out-of-bag (OOB) estimates for the validation and accuracy assessment. Taxa were ranked based on their feature scores, calculating the mean decrease in the model’s accuracy when a given taxon was removed from training and the subsequent effect on the model’s performance. Furthermore, Spearman rank correlations were generated to observe whether taxa identified by RF models changed significantly (p < 0.05) over time. All figures were generated using the R package ‘ggplot2’ and related packages ‘ggsci’, ‘ggpubr’, ‘ggpmisc’, and ‘patchwork’. General data restructuring was performed in tandem with the ‘dplyr’ and ‘broom’ packages.

3. Results

3.1. Excessive Exercise Significantly Alters Microbial Community Structure

Over the course of 107 days, the participant swam a cumulative distance of 1212.59 nautical miles (N), averaging 17.32 N/day (±8.44) (Figure 1A). Of the 107 days at sea, the participant spent 70 days actively swimming and 37 days resting, engaging in continuous exercise for an average of 6.36 days and ranging from a minimum of 2 days to a maximum of 14 days. It was suspected that the participant would be unable to keep up with the demands of the exercise over such a prolonged duration, so a linear regression was used to analyze changes in the participant’s swim over time. The participant showed a significant reduction (Pearson, p < 0.05, R = −0.48) in daily swim distance over time, dropping below his overall average after day 57 (Figure 1B). Furthermore, when split into quartiles to assess the participant’s change in performance from start to finish, each quartile representing approximately 27 days, we observed an average swim distance of 22.39 N/day (±11.78) and 12.09 N/day (±5.25) within the first and last quartiles, respectively (Table 1, Supplementary Scheme S1), demonstrating the participant swam nearly twice as far at the beginning of the swim than at the end.
To measure exercise activity, heart rate and blood pressure were measured and assessed using a linear regression to observe changes over time. On average, the participant’s heart rate was 65.67 bpm (±6.64) (Supplementary Scheme S1); resting blood pressure was 128.28 mmHg (±11.50) (Figure 1C), and body fat composition was 19.31% (±1.41) (Figure 1D). While the participant’s average resting heart rate and body fat composition were within normal ranges [26,27], the average blood pressure was elevated [28]. There was a significant (Pearson, p < 0.05, R = 0.44) change in the participant’s blood pressure over time (Figure 1C). Comparing the participant’s systolic blood pressure between the start and end, we observed an increase from a healthy 115.40 mmHg (±5.27) in the first quartile to a stage 1 hypertensive 135.86 mmHg (±15.70) in the fourth. A linear regression of body fat composition also showed a significant (Pearson, p < 0.05, R = −0.85) decrease over time (Figure 1D). Interestingly, the heart rate (Supplementary Scheme S2) did not change significantly (Pearson, p > 0.05). The participant’s arterial oxygen saturation (SaO2) primarily remained at an expected level of 97–98% when measured, exhibiting no significant (Pearson, p > 0.05) change over time. However, notable exceptions occurred on day 68 and day 90, with SaO2 dropping to 93%.
The GM community composition was assessed following the filtering of single ASVs, with 6,237,032 sequences identified for 94 samples with a mean ASV count of 7286.25, a minimum of 10, and a maximum of 442,567. Among the identified ASVs, the phyla Firmicutes (45.44%) and Bacteroidetes (40.95%) comprised approximately 86.39% of the microbial composition on average. Bacterial diversity—both alpha and beta diversity—was analyzed to characterize the broad microbial diversity and structure changes. Shannon (emphasizing microbial richness) and inverse Simpson (emphasizing microbial evenness) indices were analyzed to measure alpha diversity. Both alpha diversity measures were fitted against a linear regression; however, neither measure of alpha diversity showed a significant change over time (LRM, p > 0.05; Supplementary Scheme S3A), suggesting microbial diversity was relatively stable during the participant’s swim.
The microbial community structure was assessed in association with time by fitting changes in Bray–Curtis dissimilarities across consecutive timepoints against a generalized linear regression. This approach allowed flexibility in dealing with the non-normal distribution of ecological data by accommodating the unique variance structures often present in microbiome studies. Over the swim, the microbial community became increasingly similar, marked by a 44.24% reduction in beta diversity from start to end (GLM, p < 0.01, R = −0.27; Supplementary Scheme S3B). While alpha diversity remained stable, beta diversity altered significantly, indicating microbial restructuring and a different microbial subcommunity by the end of the swim.

3.2. Machine Learning Reveals Microbial Taxa That Shape Swim Time Course

To further understand the association between the participant’s swim and the GM, an RF model was employed. For the participant, an RF model was used to train and predict the swim time course based on microbial composition at each timepoint. Using an out-of-bag model, the microbial composition readily predicted the swim time course (OOB; R = 0.89), demonstrating that changes in the GM highly reflected the participant’s swim. Assessing importance features, the top 10 ASVs were identified and binned at the genus level. For the identified ASVs, the genera Alistipes (11.01%), Ruminococcus (9.10%), and Bifidobacterium (9.37%) accounted for 29.48% of the model’s accuracy (Figure 2A). Applying a linear regression to their abundances across time, Ruminococcus and Bifidobacterium exhibited significant changes in relative abundance, decreasing by 40.52% and 86.97%, respectively, while Alistipes significantly increased by 73.47% (Figure 2B). The bacterial order contributing the most to the model’s accuracy was Clostridiales (58.31%). These results imply that this order and the previously identified genera may significantly influence the physiological mechanisms underlying the participant’s swim performance.
As beta diversity suggested a shift in the microbial community structure over time, microbial co-occurrences were further assessed using unsupervised machine learning. K-means clustering was employed to identify microbial subcommunities (i.e., enterotypes) as it can group data into distinct clusters based on similarities in the microbial community structure. A silhouette analysis identified the optimal number of K-means clusters based on Bray–Curtis distances and predicted the overall microbial variation within the clusters. When grouped into two clusters (k = 2), more than 70% of the microbial variation was explained (Supplementary Scheme S4). Patterns in the microbial abundance and taxa over time were also examined. A principal coordinate analysis (PCoA) demonstrated that the two enterotypes were significantly (PERMANOVA, p < 0.05, R = 0.74; Figure 3A) distinct from one another, suggesting two distinct microbial structures over the duration of the swim. Visualizing the distributions of the enterotypes over time revealed that the earlier half of the swim was dominated by enterotype 1, while the latter was dominated by enterotype 2 (Figure 3B). Examination of the subcommunities’ relative abundances revealed the dominance of Firmicutes and Bacteroidetes, which collectively comprised 77.71% of enterotype 1 (38.58% and 39.13%, respectively) and 81.11% of enterotype 2 (36.80% and 44.31%, respectively). Notably, the taxa Prevotella, Roseburia, and Sutterella were the primary discriminators between the two subcommunities (RF OOB, R = 0.90, Figure 4A), with a significant percent increase in the relative abundance of Prevotella (78.95%), Roseburia (44.08%), and Sutterella (44.68%) from enterotype 1 to enterotype 2 (Wilcoxon, p < 0.05, Figure 4B).

4. Discussion

This case study demonstrated that the GM is dynamic and responds to exercise of high volume and frequency. Notably, a shift between distinct microbial subcommunities was initially indicated by beta diversity and subsequently identified via K-means clustering. This microbial shift corresponded with a decline in key performance metrics, including swim distance, and was linked to changes in blood pressure and body fat composition. Additionally, GM composition accurately predicted the time course of exercise, potentially serving as a non-invasive biomarker for tracking endurance exercise performance. These findings highlight the influence of EE on GM composition. Our study emphasizes the value of examining the longitudinal relationship between exercise and the GM, revealing insights into microbiome dynamics that may not be apparent in comparative analyses.

4.1. Physiological Measures Indicate Maladaptive Responses to EE

We observed that the participant’s body fat percentage significantly decreased by approximately 3% during the first 60 days of the swim, consistent with reported levels of fat loss from high-volume exercise [29,30]. This may indicate that the participant was unable to maintain his energy balance, burning more calories than he was able to replace [31]. Notably, despite high exercise volume, the participant’s body fat percentage appeared to stabilize after day 60, potentially causing the body to burn fewer calories to preserve energy while increasing fatigue and risk of permanent organ damage, especially to the heart [32]. This raises important considerations regarding the participant’s cardiovascular health. The participant’s average heart rate remained at 65.67 bpm (±6.64) throughout the study. Ordinarily, one might anticipate a short-term increase in heart rate during active periods as an adaptation to supply the muscles with more blood [33] or a long-term decrease in resting heart rate as a marker of enhanced cardiovascular efficiency to exercise [34]. However, an unchanging heart rate over time suggests that the expected adaptations may not have materialized in response to EE.
Despite SaO2 also not significantly changing, the participant experienced clinically significant changes (≥4%), indicating hypoxemia where the blood was deprived of adequate oxygen supply [35]. Simultaneously, our analysis revealed that the participant’s blood pressure increased significantly from normotensive levels (~120 mmHg) to stage 2 hypertension (≥140 mmHg), peaking at 156 mmHg on multiple occasions and averaging 128.28 mmHg overall, which is indicative of EE (Figure 1C). This increase contrasts with the typical post-exercise blood pressure reduction of 2–4 mmHg in normotensive individuals and 5–8 mmHg in hypertensive individuals [28]. An increase in blood pressure may indicate a heightened effort to supply oxygen and nutrients to muscles during the participant’s intense, prolonged swimming sessions [36]. The blood pressure increase may be related to a lack of energy availability coinciding with a significant reduction in body fat, which indicates that caloric intake was insufficient to maintain fat mass [37]. Overall, this suggests that the participant’s heart was under considerable strain while under the demands of the exercise [28], with a potentially increased risk of developing cardiovascular disease.
Concurrently, a decline in swim performance over time was observed (Figure 1B), suggesting fatigue and insufficient recovery, characteristics often associated with EE [23]. While we did not directly measure the participant’s exercise intensity, swimming typically has a METs (metabolic equivalents) score in the range of 6 to 11, considered to be vigorous intensity [38,39]. Based on this, it is likely that our participant engaged in intense exercise. However, without specific METs measurements for our participant, this inference should be made cautiously. The participant’s blood pressure readings, which frequently exceeded 150 mmHg, might reflect such physiological strain, but this cannot be conclusively established from our data. Together, the observed rise in blood pressure, alongside a significant decrease in body fat composition and diminished swimming performance, raises concerns about the sustainability of the participant’s exercise, hinting at an imbalance where the physical demands of exercise may surpass the body’s capacity to adapt and recover.

4.2. Microbial Diversity and EE

Previous studies have indicated that professional athletes, such as rugby players, exhibit a higher GM alpha diversity than non-athletes [13]. This enhanced diversity is often attributed to their distinct physical fitness levels and dietary habits [40]. Often, greater alpha diversity is associated with beneficial health outcomes such as improved bowel health [41] and reduced incidence of various acute and chronic diseases [42]. However, the impact of exercise on GM diversity is not uniform across all studies, and exercise studies have yielded mixed results. Some report increased alpha diversity following exercise, while more than half observe no significant changes [43]. This inconsistency underscores a notable gap in our understanding of how prolonged, high-volume exercise influences GM diversity, especially in professional and endurance athletes.
In the case of our 51-year-old participant, no change in alpha diversity was observed during the swim. This result coincides with a previous study of triathlon athletes, where no significant change was reported in fecal microbiota richness and evenness pre- and post-race [44]. This similarity suggests that certain types of high-volume exercise may not notably affect alpha diversity. However, an unchanging alpha diversity may align with the participant’s health status, which included a rise in blood pressure to hypertensive levels and potential signs of starvation [28,45]. However, this pattern is inconsistent with findings from Sun et al. [46], who reported a significant, inverse relationship between alpha diversity (Shannon index) and systolic blood pressure. Finally, considering the participant’s age, the literature suggests diminished responsiveness of the GM, particularly alpha diversity, to exercise in older individuals. Soltys et al. [47] noted that older adult athletes engaging in endurance exercise did not experience significant changes in their GM, indicating a potential age-related decline in GM responsiveness to exercise. This suggests that our participant’s age may have contributed to the observed stability in alpha diversity despite his high volume of physical activity.
Additionally, a marked reduction in beta diversity was observed in our participant, indicative of a shift toward a more similar and stable microbial community over the study’s time course. This aligns with Kern et al. [48], who report a significant reduction in beta diversity in obese participants following vigorous exercise over 6 months. A similar reduction in beta diversity was also observed in triathlon athletes [44]. Increased GM homogeneity may indicate dysbiosis, because a healthy microbiome, capable of responding to its environment, is characterized by both dynamic microbial turnover and compositional stability [49].
Our results implicate a dichotomous state of the GM post-exercise: resilient in maintaining species richness yet potentially trending toward a less diverse (more stable) composition. This reflects the complex interactions between the maintenance of microbial homeostasis and the onset of dysbiosis, as the GM has been observed to stabilize with beneficial or harmful bacteria [50]. Consequently, our findings invite further exploration into the functional consequences of exercise-induced shifts in the gut microbiota and their longitudinal impact on host well-being.

4.3. GM Alterations: Microbial Adaptations and Maladaptations in Response to EE

A significant increase was observed in Alistipes over time. While Alistipes has been reported to exert some beneficial effects, it is more commonly linked with unfavorable health conditions such as hypertension [51,52]. Importantly, we report a significant increase in the prevalence of Alistipes in our participant’s GM over time. This observed relationship prompts additional research on the possible contributory role of Alistipes in blood pressure regulation, especially under the stress of high-volume, prolonged exercise.
We also observed a significant decrease in beneficial SCFA-producing bacteria such as Bifidobacterium and Ruminococcus throughout the participant’s swim. The reduction in Bifidobacterium, known for its health-promoting effects including immunomodulation and maintaining gut-barrier integrity, may be related to our participant’s declining performance, as these bacteria contribute to overall health and energy metabolism, although they are typically enriched by exercise [53,54]. Similarly, Ruminococcus, involved in complex carbohydrate breakdown, is associated with many exercise models and often with beneficial health outcomes. Its response to exercise (i.e., change in relative abundance) may indicate alterations in carbohydrate metabolism and energy balance that impact individuals’ performance under physical stress [55,56]. Considering this, Ruminococcus is implicated as a microbial biomarker for physical performance in our study, just like Bifidobacterium, as these SCFA-producers significantly decreased alongside the participant’s daily swim distance.
Examination of the GM revealed two distinct subcommunities: the first subcommunity dominated the initial 57 days of the swim, with subcommunity two superseding it from day 58 onward. Further investigation into these microbial subcommunities revealed a significant increase in characteristic taxa such as Prevotella, Roseburia, and Sutterella. An increase in Prevotella, often implicated in health concerns like inflammation, hypertension, and dysbiosis [57], occurred alongside increased blood pressure in our participant, suggesting that Prevotella may act as a useful biomarker for hypertension. On the other hand, Roseburia is associated with gut health and lower disease rates [58,59], which may indicate a protective role in the GM of our participant under the physiological stress of EE. Prevotella and Roseburia are associated with high sodium intake, which raises the possibility that salt water could be a contributing factor to the GM [60]. The role of Sutterella, however, remains more elusive, being prevalent in the human gastrointestinal tract as a commensal organism with mild anti- and pro-inflammatory properties [61,62,63,64]. The prevalence of Sutterella significantly increased over time, which may indicate an adaptive immune or inflammatory response of the GM to the conditions faced by our participant.
Our case study reveals dynamic shifts in the participant’s GM, particularly in the latter phase of the swim, where there was a notable shift toward microbial profiles commonly linked to physiological stress [51,52,57]. Accompanying this microbial shift were significant changes in performance and physiological markers, suggesting a link between EE and microbial composition. This likely reflects a selection pressure or environmental influence that caused certain microbial species to become more prevalent, indicating potential microbial adaptation to the swimming conditions.

4.4. Limitations and Future Directions

This study is subject to several limitations. Primarily, the one-of-a-kind design of this study, involving swimming across the Pacific Ocean, is difficult to replicate, although ultramarathon studies, examining exercise in relation to the GM, may closely replicate our data. Despite this challenge, our study provides a clear direction for future research of exercise-associated GM dynamics by identifying specific taxa and physiological changes. Another concern is the potential impact of swallowing sea water on GM composition and structure [60,65]. Despite this limitation, the observed changes in GM diversity are more likely to be diet-related.
A major limitation of this study is the absence of dietary data, as diet is known to influence the GM in ways akin to exercise, altering the microbial community structure and composition [66]. Undigested food particles act as variable substrates for metabolism by microbiota located in the colon, affecting GM diversity and function as they use these metabolites [67]. Consequently, it is unclear whether the observed GM changes are solely attributable to exercise or if dietary alterations coexisting with the participant’s exercise regimen played a role. It is possible that the initial decrease in body fat composition could have been from a lack of calorie consumption, while the stabilization may have resulted from increasing caloric intake [37]. Nevertheless, we observe a significant exercise-associated relationship with the GM characterized by changes in the relative abundance of many different microbial taxa and a dramatic shift in GM community structure.
Additionally, the absence of early blood pressure readings and VO2max data—key indicators of cardiovascular fitness and exertion [36]—complicates full understanding of the participant’s physiology. Standard exercise intensity measurements like VO2max could not be measured due to the unique study environment. However, despite this limitation, we still observed significant changes in the participant’s physiology, fitness, and performance coinciding with EE. The participant’s blood pressure escalated from healthy levels to stage 2 hypertension, which suggests a significant and maladaptive physiological change. Moreover, a significant decrease in the participant’s swim distance over time implies heightened fatigue and performance decline [23]. Together, these observations, despite the absence of VO2max data, support the findings of adverse effects of EE on performance, reinforcing the link between physical activity, environmental factors, and dietary influences on the GM.
While we observed a decline in body fat composition akin to patterns seen in extreme endurance activities like arctic trekking and ultramarathons [68], which suggests metabolic adaptation [69], the study’s design precluded direct measurement of the participant’s BMR or energy expenditure, further limiting our analysis. Furthermore, our findings on alpha diversity following EE contrast with Yuan et al. [24], although this discrepancy may arise from interspecies differences or differing study parameters. Other limitations include the lack of generalizability of our results considering the substantial inter- and intra-variability of the GM [50]. Gender-specific responses to exercise, particularly in the context of EE, could not be explored with a single male participant.
This study highlights the significant effects of long-term, high-volume exercise on the GM, emphasizing the need for further exploration into how athletes’ training and lifestyle influence the GM, particularly in endurance sports with in-season and out-of-season periods. It advocates for future research to adopt controlled designs, engage diverse participants, and employ thorough data collection. To build directly on the findings presented here, subsequent studies should include dietary analysis, assessment of environmental effects, and better detailed physiological evaluations, all aimed at eventually refining health and performance optimization strategies for athletes.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/biomedicines12102309/s1.

Author Contributions

J.H.-M. designed the study. G.L., S.R., O.O. and J.H.-M. were involved with data processing. G.L. and S.R. interpreted the data, created figures, and wrote the manuscript. G.L., S.R., O.O., T.H., M.D.C. and J.H.-M. participated in interpreting data and revising the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Approved by the central institutional review board within the Department of Energy (protocol DOE000277).

Informed Consent Statement

The participant received an explanation of the study and signed a written informed consent form approved by the central institutional review board within the Department of Energy (protocol DOE000277).

Data Availability Statement

Acknowledgments

We would like to acknowledge members of the Cook and Hampton-Marcell labs for their valuable insights. We would also like to acknowledge the DuSable Scholars, GPIP programs, and Honors College at UIC. We acknowledge the use of several R libraries that greatly facilitated our analysis and research. Detailed information about the data, code, and sources of libraries used can be found in the data availability statement.

Conflicts of Interest

The authors declare no conflict of interest. The results of this study are presented clearly, honestly, and without fabrication, falsification, or inappropriate data manipulation. The results of the present study do not constitute endorsement by the American College of Sports Medicine.

Abbreviations

ASV, amplicon sequence variant; BMR, basal metabolic rate; BP, blood pressure; EE, excessive exercise; GM, gut microbiome; LRM, linear regression model; MDA, mean decrease accuracy; MET, metabolic equivalent; OOB, out of bag; PCoA, principal coordinate analysis; PERMANOVA, permutational analysis of variance; RF, random forest; rRNA, ribosomal ribonucleic acid; SaO2, arterial oxygen saturation; SCFA, short-chain fatty acid; VO2max, maximal rate of oxygen consumption.

References

  1. Bellicha, A.; Van Baak, M.A.; Battista, F.; Beaulieu, K.; Blundell, J.E.; Busetto, L.; Carraça, E.V.; Dicker, D.; Encantado, J.; Ermolao, A.; et al. Effect of Exercise Training on Weight Loss, Body Composition Changes, and Weight Maintenance in Adults with Overweight or Obesity: An Overview of 12 Systematic Reviews and 149 Studies. Obes. Rev. 2021, 22 (Suppl. S4), e13256. [Google Scholar] [CrossRef]
  2. Pescatello, L.S.; Franklin, B.A.; Fagard, R.; Farquhar, W.B.; Kelley, G.A.; Ray, C.A. Exercise and Hypertension. Med. Sci. Sports Exerc. 2004, 36, 533–553. [Google Scholar] [CrossRef]
  3. Kramer, A. An Overview of the Beneficial Effects of Exercise on Health and Performance. In Physical Exercise for Human Health [Internet]; Xiao, J., Ed.; Advances in Experimental Medicine and Biology; Springer: Singapore, 2020; Volume 1228, pp. 3–22. Available online: http://link.springer.com/10.1007/978-981-15-1792-1_1 (accessed on 4 January 2024).
  4. Flores-Opazo, M.; McGee, S.L.; Hargreaves, M. Exercise and GLUT4. Exerc. Sport Sci. Rev. 2020, 48, 110–118. [Google Scholar] [CrossRef]
  5. O’Keefe, J.H.; Patil, H.R.; Lavie, C.J.; Magalski, A.; Vogel, R.A.; McCullough, P.A. Potential Adverse Cardiovascular Effects From Excessive Endurance Exercise. Mayo Clin. Proc. 2012, 87, 587–595. [Google Scholar] [CrossRef]
  6. Meeusen, R.; Duclos, M.; Foster, C.; Fry, A.; Gleeson, M.; Nieman, D.; Raglin, J.; Rietjens, G.; Steinacker, J.; Urhausen, A. Prevention, Diagnosis, and Treatment of the Overtraining Syndrome: Joint Consensus Statement of the European College of Sport Science and the American College of Sports Medicine. Med. Sci. Sports Exerc. 2013, 45, 186–205. [Google Scholar] [CrossRef]
  7. Sender, R.; Fuchs, S.; Milo, R. Revised Estimates for the Number of Human and Bacteria Cells in the Body. PLoS Biol. 2016, 14, e1002533. [Google Scholar] [CrossRef]
  8. Utzschneider, K.M.; Kratz, M.; Damman, C.J.; Hullarg, M. Mechanisms Linking the Gut Microbiome and Glucose Metabolism. J. Clin. Endocrinol. Metab. 2016, 101, 1445–1454. [Google Scholar] [CrossRef]
  9. Moghetti, P.; Bacchi, E.; Brangani, C.; Donà, S.; Negri, C. Metabolic Effects of Exercise. In Frontiers of Hormone Research [Internet]; Lanfranco, F., Strasburger, C.J., Eds.; S. Karger AG: Basel, Switzerland, 2016; pp. 44–57. Available online: https://www.karger.com/Article/FullText/445156 (accessed on 4 January 2024).
  10. Durk, R.P.; Castillo, E.; Márquez-Magaña, L.; Grosicki, G.J.; Bolter, N.D.; Lee, C.M.; Bagley, J.R. Gut Microbiota Composition Is Related to Cardiorespiratory Fitness in Healthy Young Adults. Int. J. Sport Nutr. Exerc. Metab. 2019, 29, 249–253. [Google Scholar] [CrossRef]
  11. John, G.K.; Mullin, G.E. The Gut Microbiome and Obesity. Curr. Oncol. Rep. 2016, 18, 45. [Google Scholar] [CrossRef]
  12. Cataldi, S.; Poli, L.; Şahin, F.N.; Patti, A.; Santacroce, L.; Bianco, A.; Greco, G.; Ghinassi, B.; Di Baldassarre, A.; Fischetti, F. The Effects of Physical Activity on the Gut Microbiota and the Gut–Brain Axis in Preclinical and Human Models: A Narrative Review. Nutrients 2022, 14, 3293. [Google Scholar] [CrossRef] [PubMed]
  13. Clarke, S.F.; Murphy, E.F.; O’Sullivan, O.; Lucey, A.J.; Humphreys, M.; Hogan, A.; Hayes, P.; O’Reilly, M.; Jeffery, I.B.; Wood-Martin, R.; et al. Exercise and Associated Dietary Extremes Impact on Gut Microbial Diversity. Gut 2014, 63, 1913–1920. [Google Scholar] [CrossRef] [PubMed]
  14. Mailing, L.J.; Allen, J.M.; Buford, T.W.; Fields, C.J.; Woods, J.A. Exercise and the Gut Microbiome: A Review of the Evidence, Potential Mechanisms, and Implications for Human Health. Exerc. Sport Sci. Rev. 2019, 47, 75–85. [Google Scholar] [CrossRef]
  15. De Vadder, F.; Kovatcheva-Datchary, P.; Goncalves, D.; Vinera, J.; Zitoun, C.; Duchampt, A.; Bäckhed, F.; Mithieux, G. Microbiota-Generated Metabolites Promote Metabolic Benefits via Gut-Brain Neural Circuits. Cell 2014, 156, 84–96. [Google Scholar] [CrossRef] [PubMed]
  16. Puddu, A.; Sanguineti, R.; Montecucco, F.; Viviani, G.L. Evidence for the Gut Microbiota Short-Chain Fatty Acids as Key Pathophysiological Molecules Improving Diabetes. Mediat. Inflamm. 2014, 2014, 162021. [Google Scholar] [CrossRef]
  17. Monda, V.; Villano, I.; Messina, A.; Valenzano, A.; Esposito, T.; Moscatelli, F.; Viggiano, A.; Cibelli, G.; Chieffi, S.; Monda, M.; et al. Exercise Modifies the Gut Microbiota with Positive Health Effects. Oxidative Med. Cell. Longev. 2017, 2017, 3831972. [Google Scholar] [CrossRef]
  18. Okamoto, T.; Morino, K.; Ugi, S.; Nakagawa, F.; Lemecha, M.; Ida, S.; Ohashi, N.; Sato, D.; Fujita, Y.; Maegawa, H. Microbiome Potentiates Endurance Exercise through Intestinal Acetate Production. Am. J. Physiol.-Endocrinol. Metab. 2019, 316, E956–E966. [Google Scholar] [CrossRef]
  19. Mach, N.; Fuster-Botella, D. Endurance exercise and gut microbiota: A review. J. Sport Health Sci. 2017, 6, 179–197. [Google Scholar] [CrossRef]
  20. Nay, K.; Jollet, M.; Goustard, B.; Baati, N.; Vernus, B.; Pontones, M.; Lefeuvre-Orfila, L.; Bendavid, C.; Rué, O.; Mariadassou, M.; et al. Gut Bacteria Are Critical for Optimal Muscle Function: A Potential Link with Glucose Homeostasis. Am. J. Physiol. -Endocrinol. Metab. 2019, 317, E158–E171. [Google Scholar] [CrossRef]
  21. Estaki, M.; Pither, J.; Baumeister, P.; Little, J.P.; Gill, S.K.; Ghosh, S.; Ahmadi-Vand, Z.; Marsden, K.R.; Gibson, D.L. Cardiorespiratory Fitness as a Predictor of Intestinal Microbial Diversity and Distinct Metagenomic Functions. Microbiome 2016, 4, 42. [Google Scholar] [CrossRef]
  22. O’Keefe, J.H.; O’Keefe, E.L.; Lavie, C.J. The Goldilocks Zone for Exercise: Not Too Little, Not Too Much. Mo. Med. 2018, 115, 98–105. [Google Scholar]
  23. Etxebarria, N.; Mujika, I.; Pyne, D. Training and Competition Readiness in Triathlon. Sports 2019, 7, 101. [Google Scholar] [CrossRef]
  24. Yuan, X.; Xu, S.; Huang, H.; Liang, J.; Wu, Y.; Li, C.; Yuan, H.; Zhao, X.; Lai, X.; Hou, S. Influence of Excessive Exercise on Immunity, Metabolism, and Gut Microbial Diversity in an Overtraining Mice Model. Scand. J. Med. Sci. Sports 2018, 28, 1541–1551. [Google Scholar] [CrossRef]
  25. Clauss, M.; Gérard, P.; Mosca, A.; Leclerc, M. Interplay Between Exercise and Gut Microbiome in the Context of Human Health and Performance. Front. Nutr. 2021, 8, 637010. [Google Scholar] [CrossRef]
  26. Olshansky, B.; Ricci, F.; Fedorowski, A. Importance of resting heart rate. Trends Cardiovasc. Med. 2022, 33, 502–515. [Google Scholar] [CrossRef]
  27. Kyle, U.G.; Schutz, Y.; Dupertuis, Y.M.; Pichard, C. Body composition interpretation. Nutrition 2003, 19, 597–604. [Google Scholar] [CrossRef]
  28. Whelton, P.K.; Carey, R.M.; Aronow, W.S.; Casey, D.E.; Collins, K.J.; Dennison Himmelfarb, C.; DePalma, S.M.; Gidding, S.; Jamerson, K.A.; Jones, D.W.; et al. 2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the Prevention, Detection, Evaluation, and Management of High Blood Pressure in Adults: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines. Hypertension 2018, 71, 1269–1324. [Google Scholar]
  29. Lee, M.G.; Park, K.S.; Kim, D.U.; Choi, S.M.; Kim, H.J. Effects of high-intensity exercise training on body composition, abdominal fat loss, and cardiorespiratory fitness in middle-aged Korean females. Appl. Physiol. Nutr. Metab. 2012, 37, 1019–1027. [Google Scholar] [CrossRef]
  30. Langleite, T.M.; Jensen, J.; Norheim, F.; Gulseth, H.L.; Tangen, D.S.; Kolnes, K.J.; Heck, A.; Storås, T.; Grøthe, G.; Dahl, M.A.; et al. Insulin Sensitivity, Body Composition and Adipose Depots Following 12 w Combined Endurance and Strength Training in Dysglycemic and Normoglycemic Sedentary Men. Arch. Physiol. Biochem. 2016, 122, 167–179. [Google Scholar] [CrossRef]
  31. Strasser, B.; Spreitzer, A.; Haber, P. Fat Loss Depends on Energy Deficit Only, Independently of the Method for Weight Loss. Ann. Nutr. Metab. 2007, 51, 428–432. [Google Scholar] [CrossRef]
  32. Osilla, E.V.; Safadi, A.O.; Sharma, S. Calories. In StatPearls [Internet]; StatPearls Publishing: Treasure Island, FL, USA, 2023. Available online: http://www.ncbi.nlm.nih.gov/books/NBK499909/ (accessed on 4 January 2024).
  33. Shepherd, J.T. Circulatory response to exercise in health. Circulation 1987, 76 Pt 2, VI3–VI10. [Google Scholar]
  34. Reimers, A.; Knapp, G.; Reimers, C.D. Effects of Exercise on the Resting Heart Rate: A Systematic Review and Meta-Analysis of Interventional Studies. J. Clin. Med. 2018, 7, 503. [Google Scholar] [CrossRef] [PubMed]
  35. Hafen, B.; Sharma, S. Oxygen Saturation. In StatPearls [Internet]; StatPearls Publishing: Treasure Island, FL, USA, 2022. Available online: https://www.ncbi.nlm.nih.gov/books/NBK525974/ (accessed on 4 February 2024).
  36. Sharman, J.E.; LaGerche, A. Exercise blood pressure: Clinical relevance and correct measurement. J. Hum. Hypertens. 2015, 29, 351–358. [Google Scholar] [CrossRef]
  37. Purdom, T.; Cook, M.; Colleran, H.; Stewart, P.; San Diego, L. Low Energy Availability (LEA) and Hypertension in Black Division I Collegiate Athletes: A Novel Pilot Study. Sports 2023, 11, 81. [Google Scholar] [CrossRef] [PubMed]
  38. Ainsworth, B.E.; Haskell, W.L.; Whitt, M.C.; Irwin, M.L.; Swartz, A.M.; Strath, S.J.; O’Brien, W.L.; Bassett, D.R.; Schmitz, K.H.; Emplaincourt, P.O.; et al. Compendium of Physical Activities: An Update of Activity Codes and MET Intensities. Med. Sci. Sports Exerc. 2000, 32 (Suppl. S9), S498–S516. [Google Scholar] [CrossRef] [PubMed]
  39. American College of Sports Medicine; Riebe, D.; Ehrman, J.K.; Liguori, G.; Magal, M. (Eds.) ACSM’s Guidelines for Exercise Testing and Prescription, 10th ed.; Wolters Kluwer: Philadelphia, PA, USA, 2018; 472p. [Google Scholar]
  40. Mohr, A.E.; Jäger, R.; Carpenter, K.C.; Kerksick, C.M.; Purpura, M.; Townsend, J.R.; West, N.P.; Black, K.; Gleeson, M.; Pyne, D.B.; et al. The Athletic Gut Microbiota. J. Int. Soc. Sports Nutr. 2020, 17, 24. [Google Scholar] [CrossRef]
  41. Manor, O.; Dai, C.L.; Kornilov, S.A.; Smith, B.; Price, N.D.; Lovejoy, J.C.; Gibbons, S.M.; Magis, A.T. Health and Disease Markers Correlate with Gut Microbiome Composition across Thousands of People. Nat. Commun. 2020, 11, 5206. [Google Scholar] [CrossRef]
  42. Pickard, J.M.; Zeng, M.Y.; Caruso, R.; Núñez, G. Gut microbiota: Role in pathogen colonization, immune responses, and inflammatory disease. Immunol. Rev. 2017, 279, 70–89. [Google Scholar] [CrossRef]
  43. Cullen, J.M.A.; Shahzad, S.; Dhillon, J. A systematic review on the effects of exercise on gut microbial diversity, taxonomic composition, and microbial metabolites: Identifying research gaps and future directions. Front. Physiol. 2023, 14, 1292673. [Google Scholar] [CrossRef]
  44. Grosicki, G.J.; Pugh, J.; Wosinska, L.; Quilter, K.; Mattimoe, D.; Allen, J.; Joyce, S.A.; O’Sullivan, O.; Close, G.L. Ultra-Endurance Triathlon Competition Shifts Fecal Metabolome Independent of Changes to Microbiome Composition. J. Appl. Physiol. 2023, 135, 549–558. [Google Scholar] [CrossRef]
  45. Desai, M.S.; Seekatz, A.M.; Koropatkin, N.M.; Kamada, N.; Hickey, C.A.; Wolter, M.; Pudlo, N.A.; Kitamoto, S.; Terrapon, N.; Muller, A.; et al. A Dietary Fiber-Deprived Gut Microbiota Degrades the Colonic Mucus Barrier and Enhances Pathogen Susceptibility. Cell 2016, 167, 1339–1353.e21. [Google Scholar] [CrossRef]
  46. Sun, S.; Lulla, A.; Sioda, M.; Winglee, K.; Wu, M.C.; Jacobs, D.R., Jr.; Shikany, J.M.; Lloyd-Jones, D.M.; Launer, L.J.; Fodor, A.A.; et al. Gut Microbiota Composition and Blood Pressure: The CARDIA Study. Hypertension 2019, 73, 998–1006. [Google Scholar] [CrossRef]
  47. Šoltys, K.; Lendvorský, L.; Hric, I.; Baranovičová, E.; Penesová, A.; Mikula, I.; Bohmer, M.; Budiš, J.; Vávrová, S.; Grones, J.; et al. Strenuous Physical Training, Physical Fitness, Body Composition and Bacteroides to Prevotella Ratio in the Gut of Elderly Athletes. Front. Physiol. 2021, 12, 670989. [Google Scholar] [CrossRef] [PubMed]
  48. Kern, T.; Blond, M.B.; Hansen, T.H.; Rosenkilde, M.; Quist, J.S.; Gram, A.S.; Ekstrøm, C.T.; Hansen, T.; Stallknecht, B. Structured Exercise Alters the Gut Microbiota in Humans with Overweight and Obesity—A Randomized Controlled Trial. Int. J. Obes. 2020, 44, 125–135. [Google Scholar] [CrossRef] [PubMed]
  49. Priya, S.; Blekhman, R. Population dynamics of the human gut microbiome: Change is the only constant. Genome Biol. 2019, 20, 150. [Google Scholar] [CrossRef]
  50. Lozupone, C.A.; Stombaugh, J.I.; Gordon, J.I.; Jansson, J.K.; Knight, R. Diversity, stability and resilience of the human gut microbiota. Nature 2012, 489, 220–230. [Google Scholar] [CrossRef]
  51. Jie, Z.; Xia, H.; Zhong, S.-L.; Feng, Q.; Li, S.; Liang, S.; Zhong, H.; Liu, Z.; Gao, Y.; Zhao, H.; et al. The Gut Microbiome in Atherosclerotic Cardiovascular Disease. Nat. Commun. 2017, 8, 845. [Google Scholar] [CrossRef]
  52. Parker, B.J.; Wearsch, P.A.; Veloo, A.C.M.; Rodriguez-Palacios, A. The Genus Alistipes: Gut Bacteria With Emerging Implications to Inflammation, Cancer, and Mental Health. Front. Immunol. 2020, 11, 906. [Google Scholar] [CrossRef]
  53. Hidalgo-Cantabrana, C.; Delgado, S.; Ruiz, L.; Ruas-Madiedo, P.; Sánchez, B.; Margolles, A. Bifidobacteria and Their Health-Promoting Effects. In Bugs as Drugs: Therapeutic Microbes for the Prevention and Treatment of Disease; Britton, R.A., Cani, P.D., Eds.; John Wiley & Sons: Hoboken, NJ, USA, 2017; Microbiol. Spectr.; Volume 5, pp. 73–98. [Google Scholar]
  54. Petriz, B.A.; Castro, A.P.; Almeida, J.A.; Gomes, C.P.; Fernandes, G.R.; Kruger, R.H.; Pereira, R.W.; Franco, O.L. Exercise Induction of Gut Microbiota Modifications in Obese, Non-Obese and Hypertensive Rats. BMC Genom. 2014, 15, 511. [Google Scholar] [CrossRef] [PubMed]
  55. Gurung, M.; Li, Z.; You, H.; Rodrigues, R.; Jump, D.B.; Morgun, A.; Shulzhenko, N. Role of Gut Microbiota in Type 2 Diabetes Pathophysiology. eBioMedicine 2020, 51, 102590. [Google Scholar] [CrossRef]
  56. Petersen, L.M.; Bautista, E.J.; Nguyen, H.; Hanson, B.M.; Chen, L.; Lek, S.H.; Sodergren, E.; Weinstock, G.M. Community Characteristics of the Gut Microbiomes of Competitive Cyclists. Microbiome 2017, 5, 98. [Google Scholar] [CrossRef]
  57. Li, J.; Zhao, F.; Wang, Y.; Chen, J.; Tao, J.; Tian, G.; Wu, S.; Liu, W.; Cui, Q.; Geng, B.; et al. Gut Microbiota Dysbiosis Contributes to the Development of Hypertension. Microbiome 2017, 5, 14. [Google Scholar] [CrossRef]
  58. Nie, K.; Ma, K.; Luo, W.; Shen, Z.; Yang, Z.; Xiao, M.; Tong, T.; Yang, Y.; Wang, X. Roseburia Intestinalis: A Beneficial Gut Organism From the Discoveries in Genus and Species. Front. Cell. Infect. Microbiol. 2021, 11, 757718. [Google Scholar] [CrossRef] [PubMed]
  59. Mondot, S.; Lachkar, L.; Doré, J.; Blottière, H.M.; Hanachi, M. Roseburia, a decreased bacterial taxon in the gut microbiota of patients suffering from anorexia nervosa. Eur. J. Clin. Nutr. 2022, 76, 1486–1489. [Google Scholar] [CrossRef]
  60. Smiljanec, K.; Lennon, S.L. Sodium, hypertension, and the gut: Does the gut microbiota go salty? Am. J. Physiol.-Heart Circ. Physiol. 2019, 317, H1173–H1182. [Google Scholar] [CrossRef]
  61. Hiippala, K.; Kainulainen, V.; Kalliomäki, M.; Arkkila, P.; Satokari, R. Mucosal Prevalence and Interactions with the Epithelium Indicate Commensalism of Sutterella spp. Front. Microbiol. 2016, 7, 1706. Available online: http://journal.frontiersin.org/article/10.3389/fmicb.2016.01706/full (accessed on 19 January 2024). [CrossRef]
  62. Mukhopadhya, I.; Hansen, R.; Nicholl, C.E.; Alhaidan, Y.A.; Thomson, J.M.; Berry, S.H.; Pattinson, C.; Stead, D.A.; Russell, R.K.; El-Omar, E.M.; et al. A Comprehensive Evaluation of Colonic Mucosal Isolates of Sutterella Wadsworthensis from Inflammatory Bowel Disease. PLoS ONE 2011, 6, e27076. [Google Scholar] [CrossRef]
  63. Hansen, R.; Berry, S.H.; Mukhopadhya, I.; Thomson, J.M.; Saunders, K.A.; Nicholl, C.E.; Bisset, W.M.; Loganathan, S.; Mahdi, G.; Kastner-Cole, D.; et al. The Microaerophilic Microbiota of De-Novo Paediatric Inflammatory Bowel Disease: The BISCUIT Study. PLoS ONE 2013, 8, e58825. [Google Scholar] [CrossRef] [PubMed]
  64. Lavelle, A.; Lennon, G.; O’Sullivan, O.; Docherty, N.; Balfe, A.; Maguire, A.; Mulcahy, H.E.; Doherty, G.; O’Donoghue, D.; Hyland, J.; et al. Spatial Variation of the Colonic Microbiota in Patients with Ulcerative Colitis and Control Volunteers. Gut 2015, 64, 1553–1561. [Google Scholar] [CrossRef]
  65. Vanhaecke, T.; Bretin, O.; Poirel, M.; Tap, J. Drinking Water Source and Intake Are Associated with Distinct Gut Microbiota Signatures in US and UK Populations. J. Nutr. 2022, 152, 171–182. [Google Scholar] [CrossRef]
  66. David, L.A.; Maurice, C.F.; Carmody, R.N.; Gootenberg, D.B.; Button, J.E.; Wolfe, B.E.; Ling, A.V.; Devlin, A.S.; Varma, Y.; Fischbach, M.A.; et al. Diet Rapidly and Reproducibly Alters the Human Gut Microbiome. Nature 2014, 505, 559–563. [Google Scholar] [CrossRef]
  67. Zhang, P. Influence of Foods and Nutrition on the Gut Microbiome and Implications for Intestinal Health. Int. J. Mol. Sci. 2022, 23, 9588. [Google Scholar] [CrossRef] [PubMed]
  68. Thurber, C.; Dugas, L.R.; Ocobock, C.; Carlson, B.; Speakman, J.R.; Pontzer, H. Extreme events reveal an alimentary limit on sustained maximal human energy expenditure. Sci. Adv. 2019, 5, eaaw0341. [Google Scholar] [CrossRef] [PubMed]
  69. Pontzer, H.; Durazo-Arvizu, R.; Dugas, L.R.; Plange-Rhule, J.; Bovet, P.; Forrester, T.E.; Lambert, E.V.; Cooper, R.S.; Schoeller, D.A.; Luke, A. Constrained Total Energy Expenditure and Metabolic Adaptation to Physical Activity in Adult Humans. Curr. Biol. 2016, 26, 410–417. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Participant’s swim across the Pacific Ocean. (A) Participant’s progress was tracked using longitudinal and latitudinal coordinates. Vertical lines mark the participant’s location at approximately 20-day intervals. Each dot represents a sample collection date. (B) The participant’s daily distance is graphed over time using a linear model. Non-swimming days are marked as red triangles, while swimming days are marked as blue dots. The non-swimming days were excluded from the model to ensure an accurate representation of the change in daily distance over time. (C) A linear model shows the participant’s systolic blood pressure over time. (D) The participant’s body fat composition, measured in percent fat mass, is graphed using a loess model over time.
Figure 1. Participant’s swim across the Pacific Ocean. (A) Participant’s progress was tracked using longitudinal and latitudinal coordinates. Vertical lines mark the participant’s location at approximately 20-day intervals. Each dot represents a sample collection date. (B) The participant’s daily distance is graphed over time using a linear model. Non-swimming days are marked as red triangles, while swimming days are marked as blue dots. The non-swimming days were excluded from the model to ensure an accurate representation of the change in daily distance over time. (C) A linear model shows the participant’s systolic blood pressure over time. (D) The participant’s body fat composition, measured in percent fat mass, is graphed using a loess model over time.
Biomedicines 12 02309 g001
Figure 2. Machine learning predicts swim day based on GM composition over time. The prediction model employed RF and was validated using OOB of the GM composition data collected on 94 separate sample days. (A) The top 10 taxa that have the most influence on predicting model accuracy are presented. The taxa’s orders are categorized by color, and their genera are labeled on the x-axis. The “Unassigned” category includes bacteria not yet identified at the genus level. (B) The relative abundances of the identifiable top genera are depicted in a Spearman rank correlation over time via a linear regression shaded with 95% confidence intervals.
Figure 2. Machine learning predicts swim day based on GM composition over time. The prediction model employed RF and was validated using OOB of the GM composition data collected on 94 separate sample days. (A) The top 10 taxa that have the most influence on predicting model accuracy are presented. The taxa’s orders are categorized by color, and their genera are labeled on the x-axis. The “Unassigned” category includes bacteria not yet identified at the genus level. (B) The relative abundances of the identifiable top genera are depicted in a Spearman rank correlation over time via a linear regression shaded with 95% confidence intervals.
Biomedicines 12 02309 g002
Figure 3. Unsupervised machine learning identifies two distinct microbial subcommunities. Cluster/enterotype 1 is in red, while cluster/enterotype 2 is represented by blue. (A) PCoA reveals the separation of samples based on microbial community structure and sampling days. Each point represents a sample day, with both axes explaining a specific percentage of variation in the data. The microbial community structure accounts for 29.4% of the variance, while the sampling day explains 12.2% of the distance between points. (B) Density plot visualizes the enterotypes’ distribution patterns to examine their prevalence and dominance throughout the participant’s swim.
Figure 3. Unsupervised machine learning identifies two distinct microbial subcommunities. Cluster/enterotype 1 is in red, while cluster/enterotype 2 is represented by blue. (A) PCoA reveals the separation of samples based on microbial community structure and sampling days. Each point represents a sample day, with both axes explaining a specific percentage of variation in the data. The microbial community structure accounts for 29.4% of the variance, while the sampling day explains 12.2% of the distance between points. (B) Density plot visualizes the enterotypes’ distribution patterns to examine their prevalence and dominance throughout the participant’s swim.
Biomedicines 12 02309 g003
Figure 4. Machine learning reveals important taxa within each enterotype. The prediction model employed RF, was trained on the K-means categorizations, and was validated with OOB. (A) MDA of the RF highlights the top five taxa that exhibit shifts between the subcommunities. The taxa’s orders are categorized by color, and their genera are labeled on the x-axis. The “Unassigned” category includes bacteria not yet identified at the genus level. (B) Comparative analysis of relative abundance for the identified genera between cluster/enterotype 1 and cluster/enterotype 2.
Figure 4. Machine learning reveals important taxa within each enterotype. The prediction model employed RF, was trained on the K-means categorizations, and was validated with OOB. (A) MDA of the RF highlights the top five taxa that exhibit shifts between the subcommunities. The taxa’s orders are categorized by color, and their genera are labeled on the x-axis. The “Unassigned” category includes bacteria not yet identified at the genus level. (B) Comparative analysis of relative abundance for the identified genera between cluster/enterotype 1 and cluster/enterotype 2.
Biomedicines 12 02309 g004
Table 1. Subject physical characteristics and measures.
Table 1. Subject physical characteristics and measures.
VariableQuartile 1Quartile 2Quartile 3Quartile 4CountsM ± SDp Value
Daily swim distance22.39 ± 11.7820.24 ± 6.9214.66 ± 6.8612.58 ± 5.046917.32 ± 8.44**
Heart rate64.4 ± 9.0767.35 ± 5.4364.09 ± 6.6364.83 ± 8.333965.67 ± 6.640.59
Blood pressure115.4 ± 5.27127.24 ± 9.71133.45 ± 10.73132.5 ± 14.1839128.28 ± 11.50*
Body fat composition21.00 ± 0.0020.42 ± 1.0118.19 ± 0.2618.00 ± 0.002719.31 ± 1.41***
The participant’s daily swim distance, heart rate, blood pressure, and body fat composition were analyzed across the quartiles. Each quartile represents a segment of the study period, with the number of days as follows: Quartile 1 (27 days), Quartile 2 (27 days), Quartile 3 (26 days), and Quartile 4 (27 days). Summary statistics were calculated for each quartile, including observed counts, means, and standard deviations. An analysis of variance (ANOVA) was performed to compare the means between quartiles for each variable. The observed counts indicate the number of measurements available for each variable out of 94 data points. Significance levels are indicated: * p < 0.05, ** p < 0.01, *** p < 0.001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lewis, G.; Reczek, S.; Omozusi, O.; Hogue, T.; Cook, M.D.; Hampton-Marcell, J. Machine Learning Reveals Microbial Taxa Associated with a Swim across the Pacific Ocean. Biomedicines 2024, 12, 2309. https://doi.org/10.3390/biomedicines12102309

AMA Style

Lewis G, Reczek S, Omozusi O, Hogue T, Cook MD, Hampton-Marcell J. Machine Learning Reveals Microbial Taxa Associated with a Swim across the Pacific Ocean. Biomedicines. 2024; 12(10):2309. https://doi.org/10.3390/biomedicines12102309

Chicago/Turabian Style

Lewis, Garry, Sebastian Reczek, Osayenmwen Omozusi, Taylor Hogue, Marc D. Cook, and Jarrad Hampton-Marcell. 2024. "Machine Learning Reveals Microbial Taxa Associated with a Swim across the Pacific Ocean" Biomedicines 12, no. 10: 2309. https://doi.org/10.3390/biomedicines12102309

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop