Next Article in Journal
Empirical Research on Male Preference in China: A Result of Gender Imbalance in the Seventh Population Census
Next Article in Special Issue
Six HIT Sessions Improve Cardiorespiratory Fitness and Metabolic Flexibility in Insulin Resistant and Insulin Sensitive Adolescents with Obesity
Previous Article in Journal
The Use of Static Posturography Cut-Off Scores to Identify the Risk of Falling in Older Adults
Previous Article in Special Issue
Exercise Fat Oxidation Is Positively Associated with Body Fatness in Men with Obesity: Defying the Metabolic Flexibility Paradigm
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Systematic Review

A Systematic Review and Meta-Analysis on the Association and Differences between Aerobic Threshold and Point of Optimal Fat Oxidation

by
Ratko Peric
1,2,*,
Zoran Nikolovski
2,
Marco Meucci
3,
Philippe Tadger
4,
Carlo Ferri Marini
5,† and
Francisco José Amaro-Gahete
6,7,*,†
1
Department for Exercise Physiology, Orthopedic Clinic Orthosport, 78000 Banja Luka, Bosnia and Herzegovina
2
Faculty of Kinesiology, University of Split, 21000 Split, Croatia
3
Department of Health and Exercise Science, Appalachian State University, Boone, NC 28608, USA
4
Real World Evidence, IQVIA, 3600 Genk, Belgium
5
Department of Biomolecular Sciences, Division of Exercise and Health Sciences, University of Urbino Carlo Bo, 61029 Urbino, Italy
6
Department of Physiology, Faculty of Medicine, University of Granada, 18001 Granada, Spain
7
PROFITH “PROmoting FITness and Health through Physical Activity” Research Group, Department of Physical Education and Sport, Faculty of Sport Sciences, University of Granada, 18001 Granada, Spain
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Int. J. Environ. Res. Public Health 2022, 19(11), 6479; https://doi.org/10.3390/ijerph19116479
Submission received: 6 April 2022 / Revised: 19 May 2022 / Accepted: 23 May 2022 / Published: 26 May 2022
(This article belongs to the Special Issue Maximal Fat Oxidation: Determinant Factors and Metabolic Flexibility)

Abstract

:
Over the past two decades, scientists have attempted to evaluate whether the point of maximal fat oxidation (FATmax) and the aerobic threshold (AerT) are connected. The existence of such a relationship would allow a more tailored training approach for athletes while improving the efficacy of individualized exercise prescriptions when treating numerous health-related issues. However, studies have reported conflicting results, and this issue remains unresolved. This systematic review and meta-analysis aimed: (i) to examine the strength of the association between FATmax and AerT by using the effect size (ES) of correlation coefficient (r) and standardized mean difference (SMD); (ii) to identify potential moderators and their influence on ES variability. This study was registered with PROSPERO (CRD42021239351) and ClinicalTrials (NCT03789045). PubMed and Google Scholar were searched and fourteen articles, consisting of overall 35 ES for r and 26 ES for SMD were included. Obtained ESs were analyzed using a multilevel random-effects meta-analysis. Our results support the presence of a significant association between FATmax and AerT exercise intensities. In conclusion, due to the large ES variance caused by clinical and methodological differences among the studies, we recommend that future studies follow strict standardization of data collection and analysis of FATmax and AerT-related outcomes.

1. Introduction

Lipids and carbohydrates are the dominant fuels utilized by humans during exercise with their absolute and relative contribution being influenced by sex, diet, exercise intensity and duration, time of the day, and fitness level [1]. During moderate exercise intensities, the energy contribution from lipids increases and then markedly declines to zero at heavy to severe exercise intensities; from that point on, carbohydrates become the dominant energy substrate [2,3]. Carbohydrates, due to their limited stores, can reduce performance during prolonged and/or heavy intensity activities; yet, as little as 1% of body fat can supply sufficient energy for up to 90 km of physical movement, making fat a more suitable fuel source [2]. Maximal fat oxidation point (FATmax) is commonly used to describe an exercise intensity at which fat oxidation is at its highest, whereas exercise intensity matching negligible fat oxidation is labeled FATmin [3,4].
Regular exercise at FATmax intensity has been proposed as a key factor to optimize the body’s ability to oxidize lipids, which is of the highest interest to athletes [5]. Moreover, with the current obesity epidemic representing a serious medical problem due to its association with numerous chronic diseases, (e.g., cardiovascular diseases, hypertension, diabetes), exercising at FATmax intensity has also gained a great deal of attention among public health professionals and has been recommended for treating a number of chronic health issues [5,6,7]. Accurately prescribing exercise intensity is a complex task and there is controversy among both researchers and professionals regarding which of the methods used to design an efficient training plan is the most appropriate [8,9,10] The traditional approach is based on the prescription of exercise intensity as a percentage of maximal oxygen uptake (%VO2max) or maximal heart rate (%HRmax), with these methods commonly represented in literature [8,9]. However, exercise intensity prescriptions based on %VO2max have revealed moderate to large inter-subject variability (35–75%VO2max) at the intensities yielding FATmax [1,8]. The variability of the FATmax intensities becomes lower when %HRmax is used (55–65%HRmax) yet remains ambiguous when it comes to individualized exercise prescription [11,12]. Hence, exercise intensities expressed as a fixed percentage of maximal values might not accurately reflect the metabolic responses of the human body [11,13,14]. For these reasons, some authors recommend that exercise intensity should be prescribed using a more standardized method, such as individual metabolic thresholds since traditional methods fail to account for differences in the subject’s metabolic stress [9,10,11]. In contrast to the relative percentage of VO2max or HRmax, an individualized approach to exercise intensity prescription based on metabolic thresholds describes specific metabolic phases during exercise and thus, intends to account for differences in the body’s physiological and functional capacity [11]. This approach might also homogenize the elicited metabolic stress and consequently reduce individual variability in metabolic responses despite differences in their phenotype [9,15].
During exercise with increasing intensity, three phases of the body’s energy production and two threshold points delineating these phases can be distinguished [15]. These threshold points have been termed the metabolic thresholds and can be determined by either gas analysis or blood lactate techniques [15,16]. Throughout the years, scientists used different terms to identify these two thresholds, whether they wanted to refer to the physiological processes occurring in the body or to the methods used to identify them [11,15,16]. For additional clarification of the physiological and methodological significance of the thresholds, we suggest further reading [11,15,16]. In this paper, we will mention only the first threshold, whereas the term aerobic threshold (AerT) will be used to refer to it. Our goal is to align with the conceptual framework for performance diagnosis and training prescription proposed and clearly described by Meyer et al. (2005) [15].
Ever since the term FATmax was introduced, scientists have tried to determine the existence of a relationship between exercise intensities matching FATmax and AerT, with the aim of assuring a more individualized exercise prescription [17,18,19]. If such a connection exists, it would integrate the most relevant indices for planning and assessing an effective exercise program [9,10]. Hence, over the last two decades, conflicting results with high inter-study variability on the association between exercise intensities matching FATmax and AerT have been reported [20,21,22]. These variations may have resulted from both methodological and clinical differences within and between the studies [12]. To our knowledge, no studies have systematically explained this variability.
Hence, this systematic review and meta-analysis aimed to examine the association between the FATmax and AerT, identify relevant moderators, and examine their influence on effect size variability.

2. Materials and Methods

2.1. Study Design

This systematic review was registered in the International Prospective Register of Systematic Reviews (PROSPERO) (ID: CRD42021239351) and is part of a pre-registered trial on ClinicalTrials.gov (ID: NCT03789045). The Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) checklist and flow diagram for reporting systematic reviews and meta-analyses was used as a tool to structure this review and describe the methodology, and systematically present our search findings [23].

2.2. Search Strategy

We searched MEDLINE (via PubMed) and Google Scholar for studies exploring the FATmax and AerT relationship. The search was performed using the Boolean logic, which limits the search results with operators including AND/OR to only those documents containing relevant key terms in the scope of the review. The search combined the following key terms: “fat oxidation”, OR “maximal fat oxidation”, OR “optimal fat oxidation”, OR “peak fat oxidation”, AND “aerobic threshold”, OR “anaerobic threshold”, OR “ventilatory threshold”, OR “lactate threshold”, OR “metabolic threshold”, OR “gas exchange threshold”, representing variety in terminology used [15,16]. The search was developed and conducted by two independent researchers (PR and NZ) and reported using the PRISMA statement [23]. Furthermore, since the FATmax phrase was introduced in 2001 [2], the inclusion date for the publications was restricted from 1 January 2001, to present. The last search was run on 1 April 2021.

2.3. Data Extraction

Two investigators (PR and NZ) independently performed, in an unblinded standardized manner, screening of the retrieved records, and checked whether they met the eligibility criteria, with a third reviewer being involved when opinion differences were present (FMC). Data were extracted from each of the selected articles and summarized in an Excel spreadsheet (Microsoft, Redmond, WA, USA) for further analysis.

2.4. Inclusion/Exclusion Criteria

Research articles were selected using the defined population, intervention, comparison, outcomes, and study (PICOS) design [24], with study characteristics for inclusion established as: (i) original studies, (i.e., randomized and non-randomized controlled trials, cohort studies, case–control studies, and cross-sectional studies) in form of (ii) full text, abstracts or congress presentations. Furthermore, the studies included had to report AerT and FATmax intensities occurrence, (i.e., mean ± SD) and their association, (i.e., Pearson correlation, Kendall rank correlation, or Spearman correlation coefficient). If one of two requirements failed to be reported or was unable to be determined, the corresponding author was contacted. However, independent of its form, to be considered eligible, these studies were required to show a clear and reproducible description of the methods used to determine both AerT and FATmax. Case reports, editorials, reviews, and opinion papers were excluded. In addition, only studies that included participants with no evidence of any metabolic, pulmonary, or cardiovascular diseases (conditions potentially affecting substrate utilization) were considered acceptable. No other restrictions to participants’ characteristics were introduced other than age (18–60 years). Report criteria required studies to be written in English and to be published in a peer-reviewed journal. The grading of recommendations, assessment, development, and evaluation (GRADE) approach was applied to define the quality of a body of evidence for selected studies [25].

2.5. Statistical Analysis

All statistical analyses were performed with R software (version 4.0.4) (The R Foundation, Vienna, Austria) by using metafor and dmetar packages [26,27]. Pearson’s correlation coefficients (r) and standardized mean difference (SMD) between the exercise intensities at which AerT and FATmax appeared were used as ESs; such ESs were analyzed separately to estimate the strength of the correlation and the size difference between the two exercise intensities, respectively. Summary SMD and r estimates were determined using a random-effects model and presented as mean and 95% confidence (CI) and prediction (PI) interval [28].
Considering that r can be biased by the measurement unit used to compute it due to possible spurious correlation, a meta-analysis of r had to be performed separately for each measurement unit used to identify exercise intensity at FATmax and AerT [29]. Indeed, the correlation between variables that are non-independent due to the share of a common denominator, (i.e., variables expressed as a fraction of body weight or maximal values), can be influenced and over-estimate the correlation between the original independent variables [29,30]. On the other hand, SMD allows ESs deriving from different measurement units to be pooled in one single meta-analysis [30].
To account for dependencies among ESs, which in certain cases were clustered within the same study, multilevel meta-analyses (MA) were performed where the level 1 variance is attributed to pure sampling error, the level 2 described the within-studies variance, and level 3 represented the amount of between-studies variance [31,32,33]. Distribution of variance among the different levels was used to identify the required levels of modeling [33]. Moreover, the model assumptions about the identifiability of each parameter (the variances of each model) were checked using the profile likelihood (PL) of each fitted model; when the PL did not provide an identifiable profile of the parameter, a simpler model was considered [34]. Sensitivity analysis based on the leave-one-out method and statistical outlier detection test was implemented in order to explore the impact of excluding or including any ES, determine the impact of distortion on the pooled overall effect estimate as well as avoid potential pseudoreplication problems [35,36]. In a multilevel meta-analysis, the risk of bias across studies (publication bias) requires a specialized analysis; thus, publication bias was determined by an extended Funnel plot test and the adapted Funnel plot for multilevel meta-analysis [37]. The risk of bias in individual studies, (i.e., internal validity) included in this paper was examined using a ROBIS tool, with results of this assessment presented in a graphical format [38].
For each outcome parameter, degrees of heterogeneity were measured with Cochran’s test for chi-squared statistic of total (Q) and expected variance (df) and expressed as the Higgins (I2) statistic [39,40]. If heterogeneity was present, the random-effects model was the preferred model, and the weighing factor, the inverse of the between-studies and within-studies variance was used [28]. Additionally, the PI calculated for each ES provided a predicted range for the true treatment effect in an individual study, additionally describing the degrees of heterogeneity between the studies [26]. In cases of relevant heterogeneity between the studies (0% to 40%: might not be important; 30% to 60%: may represent moderate heterogeneity; 50% to 90%: may represent substantial heterogeneity; 75% to 100%: considerable heterogeneity), relevant moderators and matching subgroups were identified and an analysis of changes in variation in effects was performed [33,41]. For each SMD and r meta-analysis, potential sources of heterogeneity in a test of moderators were assessed by using an omnibus test based on F distribution [42] which was performed solely if at least three ESs were available per subgroup [33]. The standardized measure of effect size (Cohen’s F) was also used to describe the strength and practical significance of the test of moderator, with values f = 0.1 is a small effect, f = 0.25 is a medium effect, and f = 0.4 is a large effect, respectively [43]. The critical value for the F distribution (f2) was determined from the table with the α = 0.05 [43].
When r was used as the ES, if measures of association other than Pearson’s correlation (r) were reported, they were converted to r as previously described [44]. Moreover, to obtain unbiased weights for each study, Fisher’s z-transformation was used to convert r values derived from the original studies to z values, which were used for the statistical analyses (z values were transformed back to r for presentation purposes) [28,45]. Correlations were classified as weak (𝑟 ≤ 0.30), moderate (𝑟 = 0.30–0.50), significant (𝑟 = 0.50–0.70), and strong (𝑟 > 0.70) [46].
When SMD was used as the ES, means and SD values of exercise intensities at FATmax and AerT were extracted from all the retrieved studies. The SMD was computed to account for the differences between FATmax and AerT since the studies assessed the same outcome, (i.e., exercise intensity at FATmax and AerT) but measured them by using different measurement units [47]. Additionally, since FATmax and AerT exercise intensities were paired data deriving from the same individual, the SMD was computed to account for paired measures by accounting for the correlation between the two exercise intensities [48]. Treating the two intensities as independent measures could rise the unit-of-analysis error by providing confidence intervals that are likely to be too wide, and reducing the trial’s weight, with the possible consequence of disguising clinically important heterogeneity [39]. For one study where r was not reported [49], r had to be calculated using a different measurement unit than that one used to report mean and SD values, (i.e., %VO2max) and therefore used to compute SMD which was assumed to be equal to the pooled r derived from the same measurement unit as ES. Cohen’s rule of thumb for interpretation of the SMD statistic was followed: a value of ≤0.2 indicates no effect, a value of 0.21 to 0.5 indicates a small effect, a value of 0.51 to 0.8 indicates a medium effect and a value of >0.81 indicates a large effect [43].
A descriptive exploration of the plotted ESs (r and SMD) was used as a proxy of agreement, where high agreement was assumed in case of high r and small effect SMD (r > 0.7 and SMD < 0.5) and low agreement in case of low r and medium to large effect SMD (r < 0.7 or SMD > 0.51). Furthermore, this method was used to explore measurement units as a potential source of spuriousness [29].

3. Results

3.1. Descriptive Results

The systematic search retrieved a total of 71,400 papers with 18,300 identified as duplicates. Thereafter, screening of the paper’s title and language was performed with a total of 575 potentially relevant papers included in the abstract screening. Subsequently, eighty records were selected to be read in full as potentially eligible. An additional sixty-six articles were excluded after reading the full text as they did not fulfill the inclusion criteria. Data extraction was performed by two reviewers (PR and NZ) independently, using a data extraction Excel form (Microsoft, Redmond, WA, USA). Any disagreements about data extraction were solved by consensus or by the decision of a third reviewer. For one study, correlation coefficients were converted from Kendall’s tau to Pearson r [20]. The literature search and study selection process are reported using the PRISMA statement in Figure 1. The internal validity of included studies is presented in Figure 2. One correlation ES was excluded as it was identified as an outlier after sensitivity analysis (results not shown) due to its ES estimate being too extreme to fit within pooled ES and not overlapping with the 95% CI of the pooled effect [50]. Moreover, its ES overly contributed to the heterogeneity, while at the same time was not very influential concerning the overall pooled weight (presumably due to the small sample size). Overall, fourteen papers were included in the present study with a total of 35 ES for r and 26 ES for SMD.

3.2. Exploring Heterogeneity and Subgroup Analysis

Due to anticipated heterogeneity, three investigators (PR, AGFJ, and FMC) independently performed the identification of common variables, (i.e., relevant moderators) and their subgroups allowing sources of variation to be investigated. Identified moderator variables, categorized by differences in characteristics of the studies, (i.e., methodological diversity) and by study populations, (i.e., clinical diversity), are reported in Table 1.

3.3. Clinical Diversity

When the selected studies were examined for their clinical diversity, two moderators were identified: (i) sex and (ii) physical activity level, each with two subgroups (Table 1). All included papers collectively evaluated 855 participants, out of which 526 (61.52%) were males and 329 (38.48%) were females. Seven studies (50%) used males as primary participants, whereas one (7.14%) study tested only females. In six (42.86%) studies, both sexes were evaluated. When classified by physical activity level, 296 (34.62%) subjects were active while 559 (65.38%) were inactive, (i.e., sedentary or obese). Seven studies (50%) examined the active population whereas six (42.86%) studies examined inactive participants. Only one study (7.14%) examined both groups.

3.4. Methodological Diversity

When selected studies were examined for their methodological diversity, six moderators with relevant subgroups were identified (Table 1). Cycle ergometry was the preferred method in nine (64.29%) studies, whereas treadmill was employed in five (35.71%). When considering the methods used to determine the AerT, five (35.71%) studies preferred the lactate method whereas gas analysis was used in seven (50%). Two (14.29%) times, both methods were used. When evaluating the measurement methods used to establish correlation, five methods were identified: thirteen (37.14%) cases preferred mL/min/kg with L/min, %VO2max and b/min were used in seven (20%), and %HRmax in two (5.71%) cases. The analysis of test protocols used to assess VO2max/AerT, showed that all studies favored a graded exercise test (GXT) type protocol. Twelve (85.71%) studies preferred shorter stages (≤3-min) while longer stages (>3-min) were used in only one study (7.14%). One (7.14%) study examined the association by using both short and long stages. FATmax was identified by using visual inspection of the appropriate plots in nine (64.29%) studies while five (35.71%) studies approached this issue by using a mathematical model. Finally, ten (71.43%) studies determined FATmax during the same GXT used to assess VO2max/AerT, whereas four (28.57%) studies used an additional test.

3.5. Meta-Analysis Study of Correlation Coefficients

3.5.1. Overall Effect

All data provided by the selected studies met the inclusion standard for correlation meta-analysis with the studies evaluated for their heterogeneity and bias. Pooled ESs calculated using random effects, 95% CI, and PI with the results of heterogeneity and distribution of variance results for each measurement unit are presented in Table 2. By observing the adapted Funnel plot for multilevel meta-analysis (results not shown) for each measurement unit used to assess r, we observed that the distribution of study-specific effects was actually quite symmetrical, with no indication of the existence of publication bias.

3.5.2. B/min Measurement Method

For the observed measurement method, only one moderator met the criteria set to perform the test of moderators.

Sex

In the observed data (with the sub-grouping) the PL for the three-level MA was non- informative, so a two-level MA was fitted. When sex was examined as a moderator, the test of moderators revealed no statistical differences (F1,4 = 0.54, critical f2 = 7.71, p = 0.502). For this moderator, pooled values for a common estimate of heterogeneity showed Q = 46.57, df = 4, I2 = 91.19, p < 0.001. For males, mean ES was 0.50 (95% CI −0.03 to 0.81; PI −0.35 to 0.90) whereas for females, mean ES was 0.71 (95% CI −0.75 to 0.99; PI −0.99 to 1.00). Total heterogeneity for females was Q = 42.42, df = 2, I2 = 94.56%, p < 0.001. For males, heterogeneity was Q = 4.16, df = 2, I2 = 52.14%, p < 0.125.

3.5.3. mL/kg/min Measurement Method

For the observed measurement method, the following moderators met the inclusion criteria set to perform the test of moderators.

Sex

In the observed data (with the sub-grouping) the PL for the three-level MA was non-informative, so a two-level MA was fitted. When sex was examined as a moderator, no statistical differences were noticed (F1,11 = 0.01, critical f2 = 4.84, p = 0.994). For this moderator, pooled values for a common estimate of heterogeneity showed Q = 44.70, df = 11, I2 = 75.87, p < 0.001. For males, mean ES was 0.73 (95% CI 0.50 to 0.86; PI −0.03 to 0.96) whereas for females, mean ES was 0.74 (95% CI 0.56 to 0.85; PI 0.29 to 0.92). Total heterogeneity for females was Q = 12.22, df = 5, I2 = 59.71%, p < 0.032. For males, heterogeneity was Q = 32.47, df = 6, I2 = 81.95%, p < 0.001, respectively.

Physical Activity

For this moderator, the PL for the three-level MA was informative, so this was the fitted model. Test of moderators (F1,11 = 0.03, critical f2 = 4.84, p = 0.874) concluded that both subgroups had similar variances. Pooled heterogeneity was high (Q = 40.12, df = 11, I2 = 78.93%, p < 0.001). The analysis of this moderator and its subgroups showed mean ES for active (0.77, 95% CI 0.40 to 0.92; PI −0.11 to 0.97) and inactive (0.79, 95% CI 0.58 to 0.90; PI 0.25 to 0.95) individuals to be similar. Substantial I2 (93.16%) for the active group (Q = 26.32, df = 5, p < 0.001) originated mostly from level 3 (79.31%) and was negligible from level 2 (3.85%). For inactive individuals, heterogeneity (Q = 13.81, df = 6, p < 0.032, I2 = 71.5%) derived negligible from level 2 (5.43%) and considerably from level 3 (66.07%).

Ergometer

The PL for the three-level MA was informative, so this was a fitted model. When the type of ergometer was examined for existence of variance, no statistical differences were revealed (F1,11 = 0.47, critical f2 = 4.84, p = 0.053). For this moderator, the presence of heterogeneity was (Q = 29.87, df = 11, p < 0.002) divided between level 2 (7.45%) and level 3 (55.59%), respectively. For the treadmill, mean ES was 0.86 (95% CI 0.71 to 0.93; PI 0.72 to 0.93) whereas for the cycle ergometer, mean ES was 0.69 (95% CI 0.48 to 0.83; PI 0.15 to 0.92). Identified heterogeneity for the treadmill was (Q = 1.54, df = 2, I2 = 0.00%, p < 0.462). For cycling, observed heterogeneity was (Q = 28.33, df = 9, p < 0.001) and originated from level 3 (63.38%) and minimally from level 2 (7%).

AerT Detection Method

The PL for the three-level MA was informative, so this was the fitted model. Test for subgroup differences showed F1,11 = 1.55, critical f2 = 4.84, p = 0.240. Pooled ES for gas analysis was 0.82 (95% CI 0.66 to 0.90; PI 0.41 to 0.95) and for lactate method 0.67 (95% CI 0.33 to 0.86; PI −0.07 to 0.94). With an overall high heterogeneity identified (Q = 11.18, df = 11, p = 0.001) divided between level 2 (6.22%) and level 3 (65.21%), moderate heterogeneity (I2 = 58.14%) was observed in cases using gas analysis, whereas high heterogeneity (I2 = 79.12%), unevenly distributed between level 2 (8.57%) and level 3 (70.55%) in cases using the lactate method was detected.

FATmax Test

For this moderator, the PL for the three-level MA was informative, so this was the fitted model. When the test used to detect FATmax was analysed, two subgroups were identified: (i) FATmax determined during VO2max/AerT test, (ii) FATmax determined with an additional test (F1,11 = 2.38, critical f2 = 4.84, p = 0.151). For this moderator, pooled values for a common estimate reported substantial heterogeneity (Q = 40.05, df = 11, I2 = 71.44% (level 2 = 5.77% and level 3 = 65.67%), p < 0.001). For the first subgroup, mean ES was 0.80 (95% CI 0.68 to 0.88 and PI 0.40 to 0.95) whereas for additional test group, mean ES was 0.53 (95% CI −0.15 to 0.87 and PI −0.15 to 0.87). Even though total I2 for the identical VO2max protocol subgroup was moderate (Q = 39.20, df = 9, I2 = 75.54%, p < 0.001), it originated mainly from level 3 (67.47%) with small contribution from level 2 (8.08%). In the case of the additional test, no heterogeneity was noted (Q = 0.00, df = 2, I2 = 0.00%, p < 0.653).

3.5.4. %O2max Measurement Method

For the observed measurement method, the following moderators met the inclusion criteria set to perform the test of moderators.

Physical Activity

For this moderator, the PL for the three-level MA was non-informative, so the fitted model was a two-level. Results of test of moderators were F1,5 = 0.01, critical f2 = 6.61, p = 0.982. Pooled heterogeneity was Q = 74.79, df = 6, I2 = 93.93%, p < 0.001. Subgroups analysis showed mean ESs for active (0.55, 95% CI −0.83 to 0.96; PI −0.99 to 1.00) and inactive (0.55, 95% CI −0.19 to 0.89; PI −0.81 to 0.98) individuals to be similar. Substantial I2 (85.19%) was observed for the both active (Q = 16.34, df = 2, p < 0.001) and inactive individuals (Q = 33.07, df = 3, p < 0.001, I2 = 94.47%), respectively.

Ergometer

The PL for the three-level MA was non-informative, so the fitted model was a two-level. Results for the test of moderators were F1,5 = 0.76, critical f2 = 6.61, p = 0.793. For this moderator, the presence of heterogeneity was high (Q = 74.45, df = 5, I2 = 93.81%, p < 0.001). For the treadmill, mean ES was 0.59 (95% CI −0.21 to 0.92; PI −0.86 to 0.99) whereas for the cycle ergometer, mean ES was 0.50 (95% CI −0.79 to 0.98; PI −0.98 to 1.00). Total identified heterogeneity for the treadmill was (Q = 67.21, df = 3, I2 = 96.42%, p < 0.001). For cycling, observed heterogeneity was Q = 7.44, df = 2, I2 = 72%, p < 0.001.

FATmax Detection Method

The PL for the three-level MA was non- informative, so a two-level MA was fitted. When evaluating the FATmax detection method as a moderator, two subgroups were identified: visual and mathematical methods (F1,5 = 0.06, critical f2 = 6.61, p = 0.823). For the visual method of detection, mean ES was 0.52 (95% CI −0.76 to 0.97 and PI −0.99 to 1.00) whereas for the mathematical method, mean ES was 0.60 (95% CI −0.18 to 0.92 and PI −0.77 to 0.98). For this moderator, pooled values for a common estimate confirmed the existence of substantial heterogeneity (Q = 60.71, df = 5, I2 = 93.86%, p < 0.001). Total I2 for the visual inspection method was considerably high (Q = 50.48, df = 2, I2 = 97.61%, p < 0.001). For the mathematical method, heterogeneity was substantial (Q = 10.22, df = 3, I2 = 73.40%, p < 0.001). The expected effect for future studies in both groups varied from very weak to very strong, reflecting high heterogeneity.

3.5.5. L/min Measurement Method

For the observed measurement method, no moderators met the inclusion criteria set to perform the test of moderators.

3.6. Meta-Analysis Study of Standardised Mean Differences

3.6.1. Overall Effect

Since SMD allows adjustment for different effect sizes deriving from different measurement units without bias presence, calculated SMDs were pooled in one single meta-analysis. ESs were calculated using random effects, and 95% CI with the results of heterogeneity, and the distribution of variance results for each measurement unit is presented in Figure 3. Polled PI was −1.43 to 0.97 with SE = 0.16. Residual heterogeneity was substantial and similarly divided between level 2 (39.75%) and level 3 (52.73%). In 69.23%, FATmax preceded AerT whereas the test of moderators revealed that only in the case of long stage GXT did AerT tend to precede FATmax (Table 3). By observing the adapted funnel plot for multilevel meta-analysis (results not shown) for each measurement unit used to assess SMD, we observed that the distribution of study-specific effects was actually quite symmetrical, with no indication of the existence of publication bias.

3.6.2. Sex

For this moderator, the PL for the three-level MA was informative, so this was the fitted model. Test of moderators revealed F1,22 = 0.01, critical f2 = 4.30, p = 0.936. For this moderator, pooled values for a common estimate of heterogeneity showed Q = 290.54, df = 22, p < 0.001 with variance equally being distributed between level 2 (41.65%) and level 3 (51.49%).

3.6.3. Physical Activity

For this moderator, the PL for the three-level MA was informative, so this was the fitted model. Test of moderators results were F1,24 = 1.20, critical f2 = 4.260, p = 0.285. Residual heterogeneity (Q = 223.48, df = 24, p < 0.001) variance was distributed across level 2 (39.24%) and level 3 (53.11%).

3.6.4. Ergometer

The PL for the three-level MA was informative, so a three-level MA was used. When the type of ergometer was examined, the results were F1,24 = 0.33, critical f2 = 4.260, p = 0.572. For this moderator, the presence of residual heterogeneity was confirmed (Q = 285.26, df = 24, p < 0.001) with I2 originating from level 2 (35.06%) and level 3 (58.09%).

3.6.5. VO2max/AerT Test

The PL for the three-level MA was non-informative for the VO2max/AerT test subgroups, so a two-level MA was fitted. The test for subgroup differences showed F1,24 = 3.19, critical f2 = 4.260, p = 0.087 with substantial residual heterogeneity present (Q = 237.21, df = 24, I2 = 89.9%, p = 0.001).

3.6.6. AerT Detection Method

The PL for the three-level MA was informative, so this was the fitted model. Tests for subgroup differences in AerT detection methods showed F1,24 = 0.03, critical f2 = 4.260, p = 0.857. With an overall high heterogeneity identified (Q = 281.50, df = 24, p < 0.001), substantial heterogeneity (I2 = 95.23%) was distributed within (1.3%) and between (93.93%) studies using gas analysis, whereas moderate heterogeneity (Q = 92.03, df = 13, I2 = 89.41%, p < 0.001), unevenly distributed between level 2 (71.62%) and level 3 (17.79%) in studies using the lactate method were detected.

3.6.7. FATmax Detection Method

The PL for the three-level MA was informative. When evaluating the FATmax detection method, two subgroups were identified: visual and mathematical methods, with following results (F1,24 = 1.76, critical f2 = 4.260, p = 0.197). For this moderator, pooled values for a common estimate confirmed the existence of substantial heterogeneity (Q = 300.77, df = 24, I2 = 92.39% (level 2 38.67% and level 3 53.72%), p < 0.001). The expected effect for future studies in both groups varied from very weak to very strong, reflecting high heterogeneity.

3.7. Correlation Coefficients and Standardized Mean Differences

Finally, r vs. SMD between the exercise intensities at FATmax and AerT were plotted in Figure 4; providing an integrated view of all ESs. The combination of r and SMD can be seen as a proxy of agreement, where high agreement is observed in the case of high r and low SMD (r > 0.7 and SMD < 0.5, represented by the dark grey square in Figure 4) and low agreement related to low r or high SMD (r < 0.7 and SMD > 0.51, white background in Figure 4).

4. Discussion

The main conclusion of our study was that the FATmax and AerT connection exhibits a consistent and strong association. Moreover, the variations in methodological designs of the studies, combined with the clinical diversity seem to contribute to the apparently inconsistent, (i.e., heterogeneous) results. These findings represent an important practical application that allows a more tailored and individualized approach to exercise prescription.
The Strongest r, with the narrowest 95% CI and PI, was observed in the case of mL/min/kg, implying the highest association when this method is used. Test of moderators followed by the subgroups analysis revealed no influence of clinical differences on FATmax and AerT correlation. However, substantial heterogeneity was still noticed for both clinical moderators which remained considerable even after subgroup analysis. If heterogeneity is lower within the subgroups than across the pooled data, we can assume that the subgroup analysis contributes toward explaining heterogeneity in the overall analysis. However, this was not the case. Potential explanations could be drawn from the characteristics of study participants, (i.e., demographics, sex, age range, body composition, etc.), which in our study could not be controlled. Furthermore, stronger correlation and narrower 95% CI and PI were noticed in the studies using a treadmill, gas analysis for AerT detection, and a single GXT used to determine AerT and FATmax, respectively. These findings suggest more robust and reliable results since CIs can be interpreted as an estimate of the “true” effect. A sufficient number of participants were included in each subgroup, so the covariate distribution is not concerning in these results. As a general rule, the larger the differences between the subgroups and narrower their CI’s, the more robust and reliable the results are. Concerning methods used to detect AerT, results for the lactate subgroup indicate large dispersion in the correlations; hence, the results observed cannot be considered deriving from a single population. Contrary, gas analysis was revealed to be a more reliable method when comparing exercise intensities of AerT and FATmax, with the lactate method significantly reducing the strength of the association. However, it is worth noting that the observed presence of considerable heterogeneity for each subgroup could originate from multiple methods associated with either gas analysis or lactate tests, which require further examination [56]. Based on the available data, it is recommended that future trials should prefer gas analysis over lactate methods when aiming to assess the correlation between AerT and FATmax. When considering the test used to determine FATmax, no heterogeneity was identified for an additional test, presumably due to the small number of included studies. However, in the group where FATmax and AerT were determined during a single GXT, substantial heterogeneity was revealed, presumably as a result of methodological differences among the studies. These differences can probably be attributed to different GXT protocol methodologies (stages length and incline differences), contributing toward subgroup variability. In our MA, stages ranging from 1 min to 6 min were reported (Table 1). However, due to the small number of included studies and to adequately assess statistical significance, stages ≤3 min were classified as short whereas, stages >3 min were classified as long [13,14]. Even though reliability was higher for the short stages group, it still remains vague if 1-min, 2-min or 3-min stages are more adequate as no consensus exists regarding the optimal GXT methodology when determining VO2max/AerT [1,8,13]. Hence, using a single GXT should be a preferable method, which, in the long run, would allow further insight into the variability within the short stages. Furthermore, this strategy could inhibit potential pseudoreplication problems, while at the same time being less time-consuming and more economical for both researchers and subjects [12,36].
Even though the L/min measurement method was revealed to have the second strongest correlation coefficient, no moderators met the inclusion criteria set to perform the test of moderators. However, this method revealed robust and reliable results due to narrow 95% CI and associated PI, demonstrating a possible future application of this method. However, since a non-sufficient number of participants were included in each subgroup, the covariate distribution is a concerning factor for this analysis.
When the b/min measurement method was used, lower r with a wider 95% CI and PI were observed, compared to previously discussed methods, with only one moderator meeting the required standards for the test of moderators. Nevertheless, sex was revealed to have no influence on the correlation strength with a somewhat stronger correlation and wider 95% CI and PI observed in females compared to males. This result could be attributed to the substantially larger number of males included in this test of moderators, allowing more robust results. Considering this, future studies should expect a high variability in their results when this method of correlation assessment is used.
Even though the %VO2max was observed as the method with the weakest correlation, none of the tested moderators were revealed to have a significant influence on the correlation strength. This result, in line with our original framework, goes to show that the individual metabolic responses to exercise cannot accurately be identified by calculating fixed percentages of maximal values [11,13,14]. Moreover, future studies using this method could expect large heterogeneity in their results since 95% CI and PI observed were wide, leading to low reliability of obtained results since obtained CIs cannot be interpreted as an estimate of the “true” effect.
The SMD analysis showed a small negative pooled effect size, revealing no significant differences between the intensities at which FATmax and AerT occur, with a strong tendency of FATmax to precede AerT. Furthermore, it seems that the duration of the stages used during an exercise test for VO2max/AerT determination may affect the association between FATmax and the AerT. When long stages are used in a GXT, AerT tended to precede FATmax, demonstrating a direct influence of the specific moderator on the order. The fact that longer stages tend to underestimate FATmax could be a potential reason for this occurrence [1,12]. This finding advises—when prescribing individualized exercise—researchers and professionals should devote their attention toward methods standardization considering the VO2max/AerT identification as that will also optimize the oxidation of fat [9,10,11]. The heterogeneity for the pooled SMD meta-analysis was high, with a tendency of lowering during the subgroup SMDs analysis; indicating an influence of the methodological differences on the exercise intensities matching FATmax and AerT. The test of moderators showed no significant effect for any of the moderators considered, which, as for the r ES analyses, seems to highlight that most of the heterogeneity was due to multiple factors caused by the lack of standardized methods and guidelines to assess the association between FATmax and AerT.
When r and SMD ESs were plotted (see Figure 4), no evident pattern in relation to the moderators tested was noticed. Out of the 25 matched correlation coefficients and SMDs, 10 showed high r and low SMD, which seemed to be relatively equally distributed between the moderators fulfilling the requirement for the test of moderators. Moreover, although high r and low SMD Ess seemed to be more frequent in certain subgroups, (i.e., short stages VO2max protocol, single GXT, and gas analysis method), there are several other matched ESs computed from studies that used mentioned moderators, yet not reported high r and low SMD. Interestingly, when the studies reporting high r and low SMD were considered, there seemed to be a pattern. In fact, if a matched pair of ESs showed high r and low SMD, this was also true for all the other ESs of that specific study. This implies that the combination of low SMD and high r was relatively uniform within the study, suggesting the existence of possible further methodological or clinical differences that remained uncovered during the moderators’ analysis. Studies with measurement units such as b/min or L/min, which can be considered as non-spurious measures (not normalized to subjects), show a tendency to have weak to moderate correlation values or high SMDs; with one study as the only exception [18]. Contrary, the studies with low SMD and high correlation values tend to be studies that use a potentially spurious measures such as %HRmax, %V02max, and mL/min/kg (see the dark grey rectangle in Figure 4). This may be due to the fact that when the exercise intensity is expressed in relative terms, the interindividual variability is reduced as the exercise intensity is normalized and uniform among individuals.
The present study is not without limitations. First, this systematic review included only English written peer-reviewed published studies restricted to healthy adult subjects pooling data from nonrandomized studies. Therefore, many confounding factors that might affect the correlation between FATmax and AerT could not be controlled. Second, although the number of participants included in the study was large, numbers in the several subgroups were relatively limited, inhibiting appropriate statistical analysis. Third, although these findings strongly suggest that a relationship exists between the two intensities, correlations and SMD are not sufficient to confirm that the intensities necessarily coincide and that the error between the two measures is small. Indeed, agreement and correlation are widely-used concepts that assess the relationship between variables and although similar and related, they represent different notions of connection [57]. Due to the lack of the information needed to directly assess agreement in the selected studies, our results could not provide direct information regarding the associated levels of agreement and error between the parameters. However, since high correlations and low SMD are two conditions necessary and related to high levels of agreement, our study was able to provide a proxy of agreement. Nevertheless, a future study exploring the topic of agreement is required.
The current results have wide practical implications. Development of affordable and unconventional field test strategies, such as methods using heart rate variability [58], perceived exertion [59], and respiratory frequency [60], could allow the identification of the AerT with relatively good accuracy, simplicity, and velocity. From a physiological perspective, AerT can be considered as a “biomarker” for FATmax identification (vice versa), which in turn could replace costly tests aiming to detect mentioned indices. From a practical point of view, the existence of a FATmax and AerT association has important implications for both athletes and clinical populations, providing a basis for more applied and individualized training prescriptions since FATmax training is defined as an effective method to enhance physical activity and fat oxidation [6,7]. This knowledge can improve training effectiveness by helping to recognize each subject’s unique physiological and energetic demands, allowing an accurate and highly tailored aerobic training prescription.

5. Conclusions

Several recommendations could be drawn from our results. First, the existence of methodological and clinical differences between and within the studies can play a decisive role when establishing the correlation between FATmax and AerT. Using mL/min/kg as a measurement unit, short GXT stages, and gas analysis are recommended for standardization of future trials. In addition, the assessment should preferably be performed during one GXT session, whereas clinical differences, ergometer type, and FATmax detection method, all have a trivial influence on correlation strength. Likewise, our MA showed non-homogenous PI results ranging from almost 0 to 1. Therefore, when non-standardized, r of the future studies is expected to range from almost no correlation to absolute correlation. Hence, we concluded that when methodological differences are standardized, a strong correlation between FATmax and AerT is present.

Author Contributions

R.P. and C.F.M. participated in the design of the study, contributed to data collection, data reduction/analysis, and interpretation of results; Z.N., M.M., and F.J.A.-G. participated in the design of the study and contributed to data collection; P.T. contributed to data analysis and interpretation of results. All authors equally contributed to the manuscript writing. All authors have read and agreed to the published version of the manuscript, and agree with the order of presentation of the authors.

Funding

This research received no external funding.

Institutional Review Board Statement

Ethical review and approval were waived for this study due to this study being a Systematic Review and Meta-Analysis.

Informed Consent Statement

Patient consent was waived due to this study being a Systematic Review and Meta-Analysis.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Amaro-Gahete, F.J.; Sanchez-Delgado, G.; Jurado-Fasoli, L.; De-la-O, A.; Castillo, M.J.; Helge, J.W.; Ruiz, J.R. Assessment of maximal fat oxidation during exercise: A systematic review. Scand. J. Med. Sci. Sports 2019, 29, 910–921. [Google Scholar] [CrossRef] [PubMed]
  2. Jeukendrup, A.; Achten, J. FATmax: A new concept to optimize fat oxidation during exercise? Eur. J. Sport Sci. 2001, 1, 1–5. [Google Scholar] [CrossRef]
  3. Achten, J.; Jeukendrup, A.E. Maximal fat oxidation during exercise in trained men. Int. J. Sports Med. 2003, 24, 603–608. [Google Scholar] [CrossRef] [PubMed]
  4. Achten, J.; Gleeson, M.; Jeukendrup, A.E. Determination of the exercise intensity that elicits maximal fat oxidation. Med. Sci. Sports Exerc. 2002, 34, 92–97. [Google Scholar] [CrossRef]
  5. Sahlin, K.; Sallstedt, E.; Bishop, D.; Tonkonogi, M. Turning down lipid oxidation during heavy exercise—What is the mechanism. J. Physiol. Pharmacol. 2008, 59, 19–30. [Google Scholar]
  6. Chávez-Guevara, I.A.; Urquidez-Romero, R.; Pérez-León, J.A.; González-Rodríguez, E.; Moreno-Brito, V.; Ramos-Jiménez, A. Chronic Effect of FATmax Training on Body Weight, Fat Mass, and Cardiorespiratory Fitness in Obese Subjects: A Meta-Analysis of Randomized Clinical Trials. Int. J. Environ. Res. Public Health 2020, 17, 7888. [Google Scholar] [CrossRef]
  7. Wang, J.; Tan, S.; Cao, L. Exercise training at the maximal fat oxidation intensity improved health-related physical fitness in overweight middle-aged women. J. Exerc. Sci. Fit. 2015, 13, 111–116. [Google Scholar] [CrossRef] [Green Version]
  8. Ferri-Marini, C.; Sisti, D.; Leon, A.S.; Skinner, J.S.; Sarzynski, M.A.; Bouchard, C.; Rocchi, M.B.L.; Piccoli, G.; Stocchi, V.; Federici, A.; et al. HR and VO2 fractions Are Not Equivalent: Is It time to Rethink Aerobic Exercise Prescription Methods? Med. Sci. Sports Exerc. 2001, 53, 174–182. [Google Scholar] [CrossRef]
  9. Mann, T.; Lamberts, R.P.; Lambert, M.I. Methods of prescribing relative exercise intensity: Physiological and practical considerations. Sports Med. 2013, 43, 613–625. [Google Scholar] [CrossRef]
  10. Iannetta, D.; Inglis, E.C.; Mattu, A.T.; Fontana, F.Y.; Pogliaghi, S.; Keir, D.A.; Murias, J.M. Critical Evaluation of Current Methods for Exercise Prescription in Women and Men. Med. Sci. Sports Exerc. 2020, 52, 466–473. [Google Scholar] [CrossRef]
  11. Emerenziani, G.P.; Gallotta, M.C.; Meucci, M.; Di Luigi, L.; Migliaccio, S.; Donini, L.M.; Strollo, F.; Guidetti, L. Effects of Aerobic Exercise Based upon Heart Rate at Aerobic Threshold in Obese Elderly Subjects with Type 2 Diabetes. Int. J. Endocrinol. 2015, 2015, 695297. [Google Scholar] [CrossRef]
  12. Nikolovski, Z.; Barbaresi, S.; Cable, T.; Peric, R. Evaluating the influence of differences in methodological approach on metabolic thresholds and fat oxidation points relationship. Eur. J. Sport Sci. 2021, 21, 61–68. [Google Scholar] [CrossRef]
  13. Beltz, N.M.; Gibson, A.L.; Janot, J.M.; Kravitz, L.; Mermier, C.M.; Dalleck, L.C. Graded Exercise Testing Protocols for the Determination of VO2max: Historical Perspectives, Progress, and Future Considerations. J. Sports Med. 2016, 2016, 3968393. [Google Scholar] [CrossRef] [Green Version]
  14. Muscat, K.M.; Kotrach, H.G.; Wilkinson-Maitland, C.A.; Schaeffer, M.R.; Mendonca, C.T.; Jensen, D. Physiological and perceptual responses to incremental exercise testing in healthy men: Effect of exercise test modality. Appl. Physiol. Nutr. Metab. 2015, 40, 1199–1209. [Google Scholar] [CrossRef] [Green Version]
  15. Meyer, T.; Lucía, A.; Earnest, C.P.; Kindermann, W. A conceptual framework for performance diagnosis and training prescription from submaximal gas exchange parameters-theory and application. Int. J. Sports Med. 2005, 26, 38–48. [Google Scholar] [CrossRef] [PubMed]
  16. Poole, D.C.; Rossiter, H.B.; Brooks, G.A.; Gladden, L.B. The anaerobic threshold: 50+ years of controversy. J. Physiol. 2021, 599, 737–767. [Google Scholar] [CrossRef]
  17. Bircher, S.; Knechtle, B. Relationship between Fat Oxidation and Lactate Threshold in Athletes and Obese Women and Men. J. Sports Sci. Med. 2004, 3, 174–181. [Google Scholar]
  18. Emerenziani, G.P.; Ferrari, D.; Marocco, C.; Greco, E.A.; Migliaccio, S.; Lenzi, A.; Baldari, C.; Guidetti, L. Relationship between individual ventilatory threshold and maximal fat oxidation (MFO) over different obesity classes in women. PLoS ONE 2019, 14, e0215307. [Google Scholar] [CrossRef]
  19. Michallet, A.S.; Tonini, J.; Regnier, J.; Guinot, M.; Favre-Juvin, A.; Bricout, V.; Halimi, S.; Wuyam, B.; Flore, P. Methodological aspects of crossover and maximum fat-oxidation rate point determination. Diabetes Metab. 2008, 34, 514–523. [Google Scholar] [CrossRef]
  20. Bircher, S.; Knechtle, B.; Knecht, H. Is the intensity of the highest fat oxidation at the lactate concentration of 2 mmol L−1? A comparison of two different exercise protocols. Eur. J. Clin. Investig. 2005, 35, 491–498. [Google Scholar] [CrossRef]
  21. González-Haro, C. Maximal fat oxidation rate and cross-over point with respect to lactate thresholds do not have good agreement. Int. J. Sports Med. 2011, 32, 379–385. [Google Scholar] [CrossRef] [PubMed]
  22. Peric, R.; Meucci, M.; Bourdon, P.C.; Nikolovski, Z. Does the aerobic threshold correlate with the maximal fat oxidation rate in short stage treadmill tests? J. Sports Med. Phys. Fit. 2018, 58, 1412–1417. [Google Scholar] [CrossRef] [PubMed]
  23. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Syst. Rev. 2021, 10, 89. [Google Scholar] [CrossRef] [PubMed]
  24. Schardt, C.; Adams, M.B.; Owens, T.; Keitz, S.; Fontelo, P. “Utilization of the PICO framework to improve searching PubMed for clinical questions”. BMC Med. Inform. Decis. Mak. 2007, 7, 16. [Google Scholar] [CrossRef] [Green Version]
  25. Guyatt, G.H.; Oxman, A.D.; Vist, G.E.; Kunz, R.; Falck-Ytter, Y.; Alonso-Coello, P.; Schünemann, H.J.; GRADE Working Group. GRADE: An emerging consensus on rating quality of evidence and strength of recommendations. BMJ 2008, 336, 924–926. [Google Scholar] [CrossRef] [Green Version]
  26. Julian, P.T.H.; Thompson, G.S.; Spiegelhalter, J.D. A Re-Evaluation of Random-Effects Meta-Analysis. J. R. Stat. Soc. 2009, 172, 137–159. [Google Scholar]
  27. Harrer, M.; Cuijpers, P.; Furukawa, T.; Ebert, D.D. dmetar: Companion R Package for The Guide ‘Doing Meta-Analysis in R’. R Package Version 0.0.9000. Available online: http://dmetar.protectlab.org/ (accessed on 30 June 2021).
  28. Hedges, L.V.; Vevea, J.L. Fixed- and random-effects models in meta-analysis. Psychol. Methods 1998, 3, 486–504. [Google Scholar] [CrossRef]
  29. Simon, H.A. Spurious Correlation: A Causal Interpretation. J. Am. Stat. Assoc. 1954, 49, 467–479. [Google Scholar] [CrossRef]
  30. Dunlap, P.W.; Dietz, J.; Cortina, M.J. The Spurious Correlation of Ratios that Have Common Variables: A Monte Carlo Examination of Pearson’s Formula. J. Gen. Psychol. 1997, 124, 182–193. [Google Scholar] [CrossRef]
  31. Higgins, J.P.T.; Thomas, J.; Chandler, J.; Cumpston, M.; Li, T.; Page, M.J.; Welch, V.A. Cochrane Handbook for Systematic Reviews of Interventions, 2nd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2019. [Google Scholar]
  32. Van den Noortgate, W.; López-López, J.A.; Marín-Martínez, F.; Sánchez-Meca, J. Three-level meta-analysis of dependent effect sizes. Behav. Res. Methods 2013, 45, 576–594. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Douglas, B.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Softw. 2015, 67, 1–48. [Google Scholar]
  34. Spiegelhalter, D.J.; Abrams, K.R.; Myles, J. Bayesian Approaches to Clinical Trials and Health-Care Evaluation, 1st ed.; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
  35. Viechtbauer, W.; Cheung, M.W. Outlier and influence diagnostics for meta-analysis. Res. Synth. Methods 2010, 1, 112–125. [Google Scholar] [CrossRef] [PubMed]
  36. Scammacca, N.; Roberts, G.; Stuebing, K.K. Meta-Analysis with Complex Research Designs: Dealing with Dependence from Multiple Measures and Multiple Group Comparisons. Rev. Educ. Res. 2014, 84, 328–364. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Fernández-Castilla, B.; Declercq, L.; Jamshidi, S.L.; Beretvas, N.; Onghena, P.; Van den Noortgate, W. Detecting Selection Bias in Meta-Analyses with Multiple Outcomes: A Simulation Study. J. Exp. Educ. 2021, 89, 125–144. [Google Scholar] [CrossRef]
  38. Whiting, P.; Savović, J.; Higgins, J.P.; Caldwell, D.M.; Reeves, B.C.; Shea, B.; Davies, P.; Kleijnen, J.; Churchill, R.; ROBIS Group. ROBIS: A new tool to assess risk of bias in systematic reviews was developed. J. Clin. Epidemiol. 2016, 69, 225–234. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Higgins, J.P.; Thompson, S.G.; Deeks, J.J.; Altman, D.G. Measuring inconsistency in meta-analyses. BMJ 2003, 327, 557–560. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  40. Sterne, J.A.; Sutton, A.J.; Ioannidis, J.P.; Terrin, N.; Jones, D.R.; Lau, J.; Carpenter, J.; Rücker, G.; Harbord, R.M.; Schmid, C.H.; et al. Recommendations for examining and interpreting funnel plot asymmetry in meta-analyses of randomized controlled trials. BMJ 2011, 343, d4002. [Google Scholar] [CrossRef] [Green Version]
  41. Song, F.; Sheldon, T.A.; Sutton, A.J.; Abrams, K.R.; Jones, D.R. Methods for exploring heterogeneity in meta-analysis. Eval. Health Prof. 2001, 24, 126–151. [Google Scholar] [CrossRef]
  42. Kao, L.S.; Green, C.E. Analysis of variance: Is there a difference in means and what does it mean? J. Surg. Res. 2008, 144, 158–170. [Google Scholar] [CrossRef] [Green Version]
  43. Cohen, J. Statistical Power Analysis for the Behavioural Sciences, 2nd ed.; Routledge: New York, NY, USA, 1988. [Google Scholar]
  44. Gilpin, A.R. Table for Conversion of Kendall’S Tau to Spearman’S Rho Within the Context of Measures of Magnitude of Effect for Meta-Analysis. Educ. Psychol. Meas. 1993, 53, 87–92. [Google Scholar] [CrossRef]
  45. David, F.N. The Moments of the z and F Distributions. Biometrika 1949, 36, 394–403. [Google Scholar] [CrossRef]
  46. Akoglu, H. User’s guide to correlation coefficients. Turk. J. Emerg. Med. 2018, 18, 91–93. [Google Scholar] [CrossRef]
  47. Andrade, C. Mean Difference, Standardized Mean Difference (SMD), and Their Use in Meta-Analysis: As Simple as It Gets. J. Clin. Psychiatry 2020, 22, 20f13681. [Google Scholar] [CrossRef]
  48. Gibbons, R.D.; Hedeker, D.R.; Davis, J.M. Estimation of effect size from a series of experiments involving paired comparisons. J. Educ. Stat. 1993, 18, 271–279. [Google Scholar] [CrossRef] [Green Version]
  49. Achten, J.; Jeukendrup, A.E. Relation between plasma lactate concentration and fat oxidation rates over a wide range of exercise intensities. Int. J. Sports Med. 2004, 25, 32–37. [Google Scholar] [CrossRef]
  50. Bircher, S.; Knechtle, B.; Müller, G.; Knecht, H. Is the highest fat oxidation rate coincident with the anaerobic threshold in obese women and men? Eur. J. Sport Sci 2005, 5, 79–87. [Google Scholar] [CrossRef]
  51. Gmada, N.; Marzouki, H.; Haj-Sassi, R.; Tabka, Z.; Shephard, R.; Brun, J.F.; Bouhlel, E. Relative and absolute reliability of the crossover and maximum fat oxidation points and their relationship to ventilatory threshold. Sci. Sports 2013, 28, 99–105. [Google Scholar] [CrossRef]
  52. Meucci, M.; Peric, R. Correlation between Aerobic Threshold and Point of Maximal Fat Utilization in Male Runners. Med. Sci. Sports Exerc. 2016, 48 (Suppl. 1), 466–467. [Google Scholar] [CrossRef]
  53. Peric, R.; Nikolovski, Z. Can metabolic thresholds be used as exercise markers in adult men with obesity. Comp. Exerc. Physiol. 2020, 16, 113–119. [Google Scholar] [CrossRef]
  54. Rynders, C.A.; Angadi, S.S.; Weltman, N.Y.; Gaesser, G.A.; Weltman, A. Oxygen uptake and ratings of perceived exertion at the lactate threshold and maximal fat oxidation rate in untrained adults. Eur. J. Appl. Physiol. 2011, 111, 2063–2068. [Google Scholar] [CrossRef] [Green Version]
  55. Venables, M.C.; Achten, J.; Jeukendrup, A.E. Determinants of fat oxidation during exercise in healthy men and women: A cross-sectional study. J. Appl. Physiol. 2005, 98, 160–167. [Google Scholar] [CrossRef] [Green Version]
  56. Caiozzo, V.J.; Davis, J.A.; Ellis, J.F.; Azus, J.L.; Vandagriff, R.; Prietto, C.A.; McMaster, W.C. A comparison of gas exchange indices used to detect the anaerobic threshold. J. Appl. Physiol. 1982, 53, 1184–1189. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  57. Liu, J.; Tang, W.; Chen, G.; Lu, Y.; Feng, C.; Tu, X.M. Correlation and agreement: Overview and clarification of competing concepts and measures. Shanghai Arch. Psychiatry 2016, 28, 115–120. [Google Scholar] [PubMed]
  58. Zimatore, G.; Falcioni, L.; Gallotta, M.C.; Bonavolontà, V.; Campanella, M.; De Spirito, M.; Guidetti, L.; Baldari, C. Recurrence quantification analysis of heartrate variability to detect both ventilatory thresholds. PLoS ONE 2021, 16, e0249504. [Google Scholar] [CrossRef] [PubMed]
  59. Nicolò, A.; Massaroni, C.; Passfield, L. Respiratory Frequency during Exercise: The Neglected Physiological Measure. Front. Physiol. 2017, 8, 922. [Google Scholar] [CrossRef] [PubMed]
  60. Cannon, D.T.; Kolkhorst, F.W.; Buono, M.J. On the determination of ventilatory threshold and respiratory compensation point via respiratory frequency. Int. J. Sports Med. 2009, 30, 157–162. [Google Scholar] [CrossRef]
Figure 1. PRISMA flow diagram of the literature search and study selection process.
Figure 1. PRISMA flow diagram of the literature search and study selection process.
Ijerph 19 06479 g001
Figure 2. Graphical presentation for ROBIS results.
Figure 2. Graphical presentation for ROBIS results.
Ijerph 19 06479 g002
Figure 3. Forest plot reporting SMD. NOTE: Reported values for each study are presented by using random-effect model weights, ES, and 95% CI [12,18,19,20,21,22,49,52,53,54,55].
Figure 3. Forest plot reporting SMD. NOTE: Reported values for each study are presented by using random-effect model weights, ES, and 95% CI [12,18,19,20,21,22,49,52,53,54,55].
Ijerph 19 06479 g003
Figure 4. Correlation coefficients vs. SMD plot for exercise intensities methods, categorized by measurement units. NOTE: Bluish colors are used for display of potentially non-spurious measurement units, (i.e., bpm and L/min), and the reddish colors are used for display of potentially spurious measurement units, (i.e., %HRmax, %VO2max, and mL/min/kg), respectively [12,18,19,20,21,22,49,52,53,54,55].
Figure 4. Correlation coefficients vs. SMD plot for exercise intensities methods, categorized by measurement units. NOTE: Bluish colors are used for display of potentially non-spurious measurement units, (i.e., bpm and L/min), and the reddish colors are used for display of potentially spurious measurement units, (i.e., %HRmax, %VO2max, and mL/min/kg), respectively [12,18,19,20,21,22,49,52,53,54,55].
Ijerph 19 06479 g004
Table 1. Descriptive statistics of included studies with identified categorical moderator variables, categorised by differences in characteristics of the studies (methodological diversity) and study populations (clinical diversity).
Table 1. Descriptive statistics of included studies with identified categorical moderator variables, categorised by differences in characteristics of the studies (methodological diversity) and study populations (clinical diversity).
Study IDNrParticipants CharacteristicsPhysical Activity LevelErgometerExercise Intensity of FATmaxExercise Intensity of AerTCorrelation Made UsingAeT Detection MethodVO2max ProtocolFATmax Detection MethodFATmax Protocol
Achten et al., (2004) [49]a330.69MaleActiveCycle73.4 ± 8.0%HRmaxN/Nb/minLactateShortVisualIdentical as VO2max
b0.65MaleActiveCycle63 ± 8.6%VO2max60.9 ± 5.2%VO2maxL/minLactateShortVisualIdentical as VO2max
Bircher et al., (2004) [17]a100.75MaleActiveCycle75%VO2max77.29 ± 0.09%VO2maxL/minLactateShortVisualAdditional
b0.32MaleInactiveCycle65%VO2max49.45 ± 0.11%VO2maxL/minLactateShortVisualAdditional
c100.67FemaleActiveCycle75%VO2max76.44 ± 0.08%VO2maxL/minLactateShortVisualAdditional
d0.43FemaleInactiveCycle65%VO2max49.74 ± 0.12%VO2maxL/minLactateShortVisualAdditional
Bircher et al., (2005) [20]a480.56MaleActiveCycle22.47 ± 8.01 mL/kg/min27.73 ± 11.00 mL/min/kgmL/min/kgLactateShortVisualIdentical as VO2max
b0.43MaleActiveCycle106 ± 21 b/min118 ± 30 b/minb/minLactateShortVisualIdentical as VO2max
c0.37MaleActiveCycle135 ± 26 b/min127 ± 23 b/minb/minLactateLongVisualIdentical as VO2max
d0.43MaleActiveCycle32.5 ± 11.84 mL/kg/min27.39 ± 12.56 mL/min/kgmL/min/kgLactateLongVisualIdentical as VO2max
e300.57FemaleActiveCycle21.57 ± 6.09 mL/kg/min25.76 ± 8.99 mL/min/kgmL/min/kgLactateShortVisualIdentical as VO2max
f0.39FemaleActiveCycle113 ± 19 b/min123.9 ± 20.3 b/minb/minLactateShortVisualIdentical as VO2max
g0.46FemaleActiveCycle140 ± 23 b/min125 ± 18.7 b/minb/minLactateLongVisualIdentical as VO2max
h0.73FemaleActiveCycle31.36 ± 9.28 mL/kg/min26.32 ± 10.00 mL/min/kgmL/min/kgLactateLongVisualIdentical as VO2max
Emerenziani et al., (2019) [18]a520.94FemaleInactiveTreadmill121.6 ± 17.33 b/min123.3 ± 16 b/minb/minGas analysisShortMathematicalIdentical as VO2max
b0.82FemaleInactiveTreadmill14.7 ± 2.37 mL/kg/min15.13 ± 2.17 mL/min/kgmL/min/kgGas analysisShortMathematicalIdentical as VO2max
c0.93FemaleInactiveTreadmill69.13 ± 9.07%HRmax68.87 ± 8.87%HRmax%HRmaxGas analysisShortMathematicalIdentical as VO2max
d0.76FemaleInactiveTreadmill70.43 ± 11.43%VO2max72.87 ± 10.4%VO2max%VO2maxGas analysisShortMathematicalIdentical as VO2max
Gmada et al., (2013) [51]a120.85MaleInactiveCycleN/NN/N%VO2maxGas analysisShortMathematicalAdditional
Gonzalez-Haro (2011) [21]a11−0.02MaleActiveCycle52.3 ± 7.2%VO2max67.3 ± 12.9%VO2max%VO2maxLactateLongMathematicalIdentical as VO2max
b110.37MaleActiveCycle52 ± 5.7%VO2max63.7 ± 12.9%VO2max%VO2maxLactateLongMathematicalIdentical as VO2max
Meucci et al., (2016) [52]a500.87MaleActiveTreadmill47.4 ± 4.25%VO2max45.54 ± 4.07%VO2max%VO2maxGas analysisShortVisualIdentical as VO2max
Michallet et al., (2008) [19]a90.95MaleActiveCycle1.4 ± 0.4 L/min1.7 ± 0.6 L/minL/minGas analysisShortMathematicalAdditional
b50.52FemaleActiveCycle1.4 ± 0.4 L/min1.6 ± 0.4 L/minL/minLactateShortMathematicalAdditional
Nikolovski et al., (2021) [12]a220.80MaleActiveCycle21.34 ± 3.64 mL/min/kg22.15 ± 4.84 mL/min/kgmL/min/kgGas analysisShortMathematicalIdentical as VO2max
b0.73MaleActiveCycle61.52 ± 7.24%HRmax63.38 ± 9.75%HRmax%HRmaxGas analysisShortMathematicalIdentical as VO2max
Peric et al., (2018) [22]a570.88MaleActiveTreadmill24.74 ± 5.52 mL/min/kg23.95 ± 5.32 mL/min/kgmL/min/kgGas analysisShortVisualIdentical as VO2max
Peric et al., (2020) [53]a190.89MaleInactiveTreadmill17.23 ± 1.02 mL/kg/min17.32 ± 1.10 mL/min/kgmL/min/kgGas analysisShortVisualIdentical as VO2max
Rynders et al., (2011) [54]a740.77MaleInactiveCycle14.0 ± 7.3 mL/min/kg15.4 ± 5.7 mL/min/kgmL/min/kgLactateShortVisualIdentical as VO2max
b740.85FemaleInactiveCycle11.4 ± 7.5 mL/min/kg12.7 ± 5.8 mL/min/kgmL/min/kgLactateShortVisualIdentical as VO2max
Venables et al., (2005) [55]a1570.20MaleInactiveTreadmill45 ± 12.53%VO2max63 ± 12.53%VO2max%VO2maxGas analysisShortVisualIdentical as VO2max
b1430.23FemaleInactiveTreadmill52 ± 11.96%VO2max67 ± 11.96%VO2max%VO2maxGas analysisShortVisualIdentical as VO2max
Bircher et al., (2005) [50]a150.495FemaleInactiveCycle65%VO2max52.08 ± 12.13%VO2maxmL/min/kgLactateShortVisualAdditional
b0.655FemaleInactiveCycle65%VO2max50.18 ± 7.14%VO2maxmL/min/kgGas analysisShortVisualAdditional
c130.377MaleInactiveCycle65%VO2max46.83 ± 6.99%VO2maxmL/min/kgGas analysisShortVisualAdditional
NOTE: N/N intensity not reported, short (≤3-min) and long (>3-min), identical as VO2max (FATmax and AerT determined during one test) and additional (FATmax determined during additional test with stages length > 3 min), different ES’s within the included studies have been marked with letters for identification and analysis purposes.
Table 2. Descriptive statistics of pooled ESs according to measurement units.
Table 2. Descriptive statistics of pooled ESs according to measurement units.
Measurement UnitES (r)CI 95%PII2 Level 2 (%)I2 Level 3 (%)Qp
Beats per minute (b/min)0.6220.165 to 0.859−0.600 to 0.97390.46-62.4<0.001
Percentage of maximal oxygen uptake (%VO2max)0.5610.124 to 0.816−0.603 to 0.96292.65-74.8<0.001
Litres per minute (L/min)0.6860.385 to 0.863−0.207 to 0.95560.95-14.70.023
Millilitres per kilogram per minute (mL/kg/min)0.7770.642 to 0.8650.320 to 0.9415.3669.945.7<0.001
NOTE: %HRmax failed to fulfill the inclusion criteria for statistical significance and to accurately estimate heterogeneity.
Table 3. Subgroups meta-analysis for SMD. NOTE: A negative mean indicates FATmax preceded AerT.
Table 3. Subgroups meta-analysis for SMD. NOTE: A negative mean indicates FATmax preceded AerT.
Moderator and GroupsSMDHeterogeneity TestI2
MeanSE(95% CI)[95% PI]Q Statisticp2-Level (%)3-Level (%)
Sex
Males−0.170.19(−0.58, 0.25)[−1.49, 1.16]Q13 = 187.1<0.00140.851.9
Females−0.200.17(−0.59, 0.18)[−1.39, 0.98]Q9 = 103.5<0.00189.42.4
Physical activity level
Active−0.120.21(−0.56, 0.33)[−1.49, 1.25]Q16 = 131.4<0.00153.437.8
Inactive−0.450.22(−0.96, 0.05)[−1.55, 0.64]Q8 = 92.1<0.0011.390.1
Ergometer
Treadmill−0.070.32(−0.81, 0.67)[−1.86, 1.72]Q8 = 184.3<0.0011.794.8
Cycling−0.290.17(−0.64, 0.06)[−1.38, 0.81]Q16 = 101.0<0.00165.422.6
VO2max test
Long stages0.090.29(−0.65, 0.82)[−1.75, 1.93]Q5 = 30.1<0.00191.2/
Short stages−0.320.11(−0.54, −0.09)[−1.27, 0.64]Q19 = 207.1<0.00189.4/
AerT detection method
Gas analysis−0.230.26(−0.80, 0.35)[−1.80, 1.35]Q24 = 189.5<0.0011.393.9
Lactate analysis−0.220.18(−0.62, 0.17)[−1.38, 0.93]Q13 = 92.03<0.00171.617.8
FATmax detection method
Visual−0.070.23(−0.56, 0.41)[−1.52, 1.37]Q15 = 281.4<0.00146.748.3
Mathematical−0.480.17(−0.86, −0.09)[−1.26, 0.31]Q9 = 19.40.0228.662.2
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Peric, R.; Nikolovski, Z.; Meucci, M.; Tadger, P.; Ferri Marini, C.; Amaro-Gahete, F.J. A Systematic Review and Meta-Analysis on the Association and Differences between Aerobic Threshold and Point of Optimal Fat Oxidation. Int. J. Environ. Res. Public Health 2022, 19, 6479. https://doi.org/10.3390/ijerph19116479

AMA Style

Peric R, Nikolovski Z, Meucci M, Tadger P, Ferri Marini C, Amaro-Gahete FJ. A Systematic Review and Meta-Analysis on the Association and Differences between Aerobic Threshold and Point of Optimal Fat Oxidation. International Journal of Environmental Research and Public Health. 2022; 19(11):6479. https://doi.org/10.3390/ijerph19116479

Chicago/Turabian Style

Peric, Ratko, Zoran Nikolovski, Marco Meucci, Philippe Tadger, Carlo Ferri Marini, and Francisco José Amaro-Gahete. 2022. "A Systematic Review and Meta-Analysis on the Association and Differences between Aerobic Threshold and Point of Optimal Fat Oxidation" International Journal of Environmental Research and Public Health 19, no. 11: 6479. https://doi.org/10.3390/ijerph19116479

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop