2.1. Mass Conservation
From the Global Carbon Project [
10], we obtain the emission and concentration data. A global mass balance representation of the yearly atmospheric CO
flow is created, as it has been performed in [
11], where they prove that the CO
adjustment time is much larger than the CO
residence time. The analysis of the airborne fraction and sink rate of anthropogenic emissions by [
6] is also based on this global carbon budget (their Equation (1)).
For our purposes the components of the global carbon budget equation are
C the CO concentration of the atmosphere at the end of year i,
the global emissions of human origin during year i,
the global land use net emissions during year i,
the global natural net emissions during year i,
other special causes of emissions, such as El Nino, volcanos, etc.,
the global net absorption of CO during year i into the oceans and biosphere ():
Without explicit external information,
cannot be discriminated from
or
. Therefore, we set
to 0 and include all inferred special causes in the unknown N
in this investigation. With
the equation becomes
This is not a model, but the formulation of the necessary mass conservation as seen from the atmosphere, similar to a bank account with cumulative yearly deposits and withdrawals, not directly dealing with the daily ups and downs or with the exact nature of the earning and spending processes. As a matter of fact, Equation (2) must be fulfilled at all time scales.
For consistency, all quantities have to be converted to the same unit (1 ppm = 2.124 GtC, 1 GtC = 3.664 Gt CO
[
10]). Here, all calculations are performed with the unit “ppm”. All CO
-related data, emissions, land use change, and CO
concentration growth are from the Global Carbon Budget 2021, covering the years 1850–2020. Land use change data are subject to considerable uncertainty, with an error range of
ppm.
We assume a value for land use change emission that is 0.2 ppm lower than the published mean value, as discussed further below. This is well within the error range and, therefore, justifiable.
Figure 1 shows that the total emissions (
, blue) come to exceed the yearly CO
concentration growth (
, green), indicating that we have an increasingly effective absorption (
, red ) with growing CO
concentration.
Another benefit of displaying these raw data is that, before 1900, the anthropogenic emissions were considerably smaller than other variations, such as land use change. In fact, there are roughly four historical phases:
The phase before 1900, where explicit emissions are smaller than implicit ones, due to land use change; however, there is a small, but increasing, CO concentration growth.
The phase between 1900 and 1950 with growing emissions, but approximately constant CO concentration growth and slightly increasing land use change.
The phase from 1950 to 2010 with growing emissions and growing concentration growth.
Recent publications indicate that emissions have remained approximately constant since 2010 [
12] and are expected to remain approximately constant for the forseeable future [
13] (their Figure 2, stated policies scenario). The challenge is to estimate reasonable projections of CO
concentration based on these emission assumptions.
2.2. Exploratory Analysis
As a first exploratory analysis of these data, the scatter plot in
Figure 2 relates the effective CO
absorption to the CO
concentration.
Qualitatively, we see a long-term linear dependence of the effective absorption on the atmospheric CO concentration with significant short-term deviations, where the effective zero-absorption line is crossed at approx. 280 ppm. This is considered to be the pre-industrial equilibrium CO concentration, where natural yearly emissions are balanced by the yearly absorptions. The average yearly absorption is approx. 2.5% of the CO concentration exceeding 280 ppm.
While the correlation coefficient of 0.97 is very high, there are clear non-random deviations from an ideal linear behaviour. With the large uncertainty of the land use change and contingent effects, such as volcano eruptions and influences such as ENSO (El Nino Southern Oscillation), it is not surprising that there are systematic deviations from a perfect line [
8].
Regarding the predictions of future CO
concentration, the key question is whether the deviations are averaging out or whether there is a systematic saturation trend that is limiting the absorption of CO
. Some climate researchers claim that the absorption will decline [
14], but there are other papers providing evidence of increasing absorption [
15].
We can see from these data that a reliable estimate of the historical equilibrium concentration requires the whole data range. An estimation based on all data above 310 ppm (year 1950) results in a smaller equilibrium value of 245 ppm.
Starting from the mass conservation Equation (2), above, on the basis of the preceding considerations, we state two hypotheses, from which the actual model is derived:
2.3. Hypothesis 1: The Absorption A Is Proportional to Previous
CO Concentration
The physical justification for this assumption is the fact that the partial pressure of CO
, which is relevant for absorption processes, increases in proportion to concentration. It is also known that C
plants, representing the majority of all plants, have a linear absorption property. The absorption property of C
plants is nearly flat, but also linear in the range of 280–560 ppm, resulting in a linear behaviour when averaging over all plants. Halparin [
7] analysed the different processes of gas transport into the ocean, with the conclusion that all relevant processes can be linearised (his Equation (14)):
Net primary production of plants,
Phytoplankton production, a smaller effect of dead biomass sinking to large depths of the ocean,
Gas exchange with the deep ocean.
Assuming that there are different absorption constants for oceanic (a
), phytoplankton (a
), and biospheric absorption (a
), under the linearity assumption, they can be added to a single constant a:
Both the oceanic and biospheric processes may consist of multiple sub-processes, e.g., the photosynthesis of C plants has a much larger proportionality constant to that of C plants in the relevant CO concentration range of 280.. 560 ppm. As long as linearity holds, the net absorption constant is reflected by the sum of all elementary absorption constants.
This is a radical simplification of the box-diffusion model [
16] referred to in [
1]. Instead of assuming separate boxes for the mixed layer and for the biosphere, we assume a one-dimensional diffusion process between the atmosphere, ocean, and biosphere, with a single diffusion constant, making no explicit assumptions about the properties of the mixed layer nor the mechanism of the absorption in the biosphere. The advantage of this model is that we do not have to make any speculative assumptions about numerous possible model parameters, some of which are quite arbitrary (e.g., the thickness of the mixed layer), but instead, restrict the whole model to a single absorption parameter and an additive constant, both of which are estimated from measured data. While we do not know the contributing components, we can measure their total effect over time.
The authors of the Bern model claim that the “net primary production of the land biosphere and the surface ocean carbon uptake depend on atmospheric CO
and surface temperature in a nonlinear way” [
4] by assuming a superposition of four absorption processes. This statement contradicts our assumption of a single absorption process. We will show in the appendix that the impulse response functions of the Bern model ([
4], their Equation (21)), can be mapped into the form of our model with a time varying absorption parameter,
a (
Appendix A). Statistical tests will decide if there is a need to actually introduce this time dependence or whether it is more appropriate to assume a constant absorption parameter, making relative absorption a linear function of CO
concentration. Any deviations from the validity of a linear model will show up in the residual error. This gives our model a method of intrinsic validation, and the model can be extended when required.
2.3.1. Temperature Dependence of the Absorption Parameter
We have to consider the possibility that the absorption parameter,
a, depends on temperature. Investigations of ice core data clearly indicate a temperature dependency of CO
concentration [
17]. The open question is whether there is a measurable dependency during the time scale and the temperature scale of our investigation. We will hypothetically test the temperature dependence of absorption parameter
a with a linear dependency on sea surface temperature anomaly T with the HadSST2 temperature dataset T
[
18]. The linear dependence of the absorption parameter on the temperature anomaly is assumed, as in the Bern model [
4]:
2.3.2. CO Concentration as a Hypothetical Proxy for Temperature
When we make predictions with hypothetical future CO
emissions, we do not know the future temperatures. Without diving into the problematic discussion about the degree, how strong the influence of CO
concentration is on temperature [
19], we assume the “worst case” of full predictability of temperature effects by CO
concentration (
Figure 3).
Without making any assumptions about C
T
causality, the estimated functional dependence of the temperature proxy from the regression on CO
concentration was found to be
We are aware that this is a very incomplete model. It ignores the obvious, significant, trend-reversing deviations between 1900 and 1975, and it also ignores the dominant contribution of cloud albedo reduction to global warming [
20], whereby 80% of recent warming is caused by albedo reduction and only 20% by increase of CO
concentration. Nevertheless, the proxy is still a suitable tool for estimating an upper bound of temperature dependence on CO
concentration. Based on measured data, it avoids speculations and discussions about hypothetical feedback factors of CO
sensitivity.
2.3.3. Corollary: Carbon Sinks Are Not Expected to Be Saturated in the Near Future
This is related to, and is a consequence of, the linearity assumption. Much of the debate about carbon sinks is concerned with numerous details about the possible saturation of various “boxes” in the models [
21]. There are strong reasons to consider the atmosphere and the mixed layer, i.e., the top 75 m of the ocean as a single “box”, which exchanges gases with the deep ocean and the biosphere [
7]. Due to the fact that the deep ocean contains more than 50 times the CO
of the atmosphere, or 4000 times the yearly global emissions, this means that the whole atmospheric content is just about 2% of the ocean content. Therefore, we do not expect this huge “box” to be saturated any time soon.
There are four indications supporting the assumption that the up-taking reservoirs are not saturated:
- (1)
We can test the past 70 years for linearity. If there was any sign of saturation, this would have shown up as a deviation from the linearity assumption. We will see that, in the residual deviations from the model, if the actual absorption parameter decreased with time, the real CO concentration at the end would be larger than estimated by the model.
- (2)
The global carbon budget [
10] clearly shows an increasing trend in both the ocean sink, as well as the (biosphere) land sink.
- (3)
A recent article revised the estimates of the ocean-atmosphere CO
flux [
15], making it consistent with the increasing ocean sink found in the global carbon budget.
- (4)
We can make a rough estimation of the expected ocean uptake. The ocean has a total carbon inventory of 38,000 GtC ≈ 140,000 Gt CO. If we assume the realistic scenario of constant future emissions at today’s level (37 Gt CO per year) and we assume that they are all absorbed by the ocean, by 2100 that would be approx. 3000 Gt CO, just about 2% of the current ocean inventory.
Whatever our subjective opinion may be regarding future absorption, the measured data provide us with the current trend and its potential changes in the near future.
2.4. Hypothesis 2: Natural Emissions and Absorptions Are Balanced
This implies that, without anthropogenic emissions, C
= C
= C
, resulting in a constant equilibrium concentration C
. Equations (1)–(3) imply that global natural emissions are constant:
This relation makes a falsifiable statement about the magnitude of those natural emissions, which are not compensated by absorptions within the time unit of measurement, which is a year in this investigation. The statement of assumed constant equilibrium concentration requires further clarification. We know, e.g., from ice core investigations that historical CO concentration is not constant and most likely depends on temperature. A linear dependence on temperature can be mapped onto a linear dependence of the absorption parameter a on temperature, which is covered by Equation (5).
As we know, there are causes for systematic changes in the natural emissions, e.g., volcanic eruptions, ocean cycles, and changes of land use. We will see from the residual deviations of the measured data how significant these influences are and if the model needs to be adapted. For the time being, we initially assume no changes of natural emissions within the investigated time range 1850–2020. As with the previous assumption, the residual error of the model will lead to possible further fine-tuning of the model. Three possible deviations are possible:
A systematic “trend” in the natural emissions. This would either increase or decrease the estimated absorption factor and the equilibrium concentration given a constant model of natural emissions,
Short-term zero centered variations within a year. These variations do not show up in our model, due to the one year sampling interval,
Long-term variations of more than a year are not averaged out. They are visible in the residual error of the predicted CO content.
2.5. The Modelling Equations
From Equations (2), (3), and (7), we obtain the final regression equation for an assumed constant absorption parameter,
a:
where the innovation or residual,
, is an unselfcorrelated random variable with zero mean. Equation (8) emphasises the fact that the effective absorption depends linearly on the difference between the actual and the equilibrium CO
concentration C
. This implicitly includes the natural carbon cycle, described by the equilibrium concentration C
. Initially, we estimate
as the constant natural emissions simultaneously with
a in this regression equation, for estimating temperature-dependent models we assume a fixed known value for the equilibrium CO
concentration
and estimate the absorption function
a.
Starting with the available data, from emissions , land use change, , CO growth, and the absorptions are modelled according to Equation (8) for the time interval 1850–2014,
For
, the emission change per year due to land use, the uncertainty is considerable [
10]. Therefore, we have a certain amount of freedom to adapt its value, in order to satisfy other given constraints.
The absorption constant,
a, and the natural equilibrium concentration
in a given time interval are obtained through estimation with the least squares method, where the dependent variable is the left hand side of Equation (8), and the independent variable is
, by means of the Python module OLS (statsmodel-OLS-0.13.5). The results are shown in
Table 1.
This results in ppm, with error bounds [277, 282] ppm, and a half-life time of an emission pulse years, with error bounds [26, 30] years.
is very close, and its error bounds contain the widely accepted pre-industrial equilibrium CO
concentration of 280 ppm. As can be seen from model Equation (8), we can substitute global variations of
and
When using the center value of the land use change error band, i.e., on average 0.55 ppm, the calculated would have been ppm, which we consider to be too small to be compatible with the accepted value of 280 ppm. Therefore, in the face of the large uncertainty of land use change estimates, we prefer to assume slightly lower average land use change-caused emissions (average 0.35 ppm) over an inconsistent equilibrium concentration. This substitution, however, only changes the equilibrium concentration.
Modelling the raw, noisy differential absorption data with this—simplest possible—model shows a fairly good approximation over the whole time span. The residual
shows variations, but no systematic trend over the time span 1850–2014 (
Figure 4).
The blue differential effective absorption data are approximated by the orange model curve, and the green curve shows the residual deviations. The standard deviation of the residuum is 0.2 ppm, the same order of magnitude as the uncertainty of emission and concentration measurements, in particular,
. We can, therefore, safely assume that the residual error reflects zero mean deviations of land use change or natural emissions, as discussed in [
8].
We notice that, before 1900, the absorptions are smaller than this error, which means that analysing absorption values at times before 1900 is not meaningful. We also observe that, after 1950, when the quality of measurements dramatically increased, there is much more variability in the differential measurements of CO concentration. This justifies doubts about the data quality before 1950 and justifies a separate analysis of the much more reliable data after 1950.
Next, we investigate the possible dependence of the absorption parameter
a on the sea surface temperature from the data set HadSST2 [
18]. Using Equation (8) with temperature-dependent variable a from Equation (5) leads to a more complex three parameter estimation problem:
This cannot be easily solved in closed form when
is variable. After showing that the data are consistent with
ppm in the case for constant absorption
a, we assume
to be constant as an
a priori condition and simplify the model equation by fixing
to this value and only estimate the absorption parameters
and
from the data. Implicitly, this means that we make use of the assumption that the equilibrium concentration is constant. The results of the temperature-dependent, two-parameter absorption estimation problem are displayed in
Table 2.
The temperature-dependent parameter is statistically significant, and we obtain a slightly negative trend of the absorption parameter with increasing sea surface temperature since 1900.
The negative temperature dependence tells us that, before 1900, the absorption has no identifiable trend and that, between 1900 and 2000, the absorption appears to have decreased. The relative absorption in 1900 of 3% corresponds to the half-life time of 23 years for an emission pulse. However, in 2014, the relative absorption is still larger than 2.3% of the CO concentration, which corresponds to a half-life time of 30 years for an emission pulse. The fact that the absorption parameter changes with time in this model variant prohibits the use of a time-invariant convolution kernel for computing the CO concentration from emissions. Before we draw conclusions from this result, we need to validate the estimation.
2.5.1. Prediction and Model Validation
In order to validate the model, we are in the comfortable situation, in that there is a long time series, so we can perform an ex-post predictions by restricting the training data of the model to the year 2000 and make predictions of the CO concentration of the years 2001–2020, which are available for comparison.
In the validation process, we compare all 3 discussed model variants:
Assumed constant absorption parameter;
Assumed temperature-dependent absorption parameter;
Assumed absorption parameter dependent on temperature modelled by CO concentration.
2.5.2. Estimation with Limited Data Range and Model Validation
The estimation results based on historical data may depend, to a certain extent, on the selected time interval, in particular, when we let the value of the equilibrium concentration of CO
be determined from the data. This explains why previous authors arrive at such different results for the equilibrium concentration [
7,
8]. There are several reasons for constraining the data range:
As stated above, there is no large variability of both CO emissions and CO concentration before the year 1900. Moreover, the measurements at that time were not really reliable. Therefore, the signal-to-noise ratio is so large that, for the determination of concentration changes as a function of CO emissions, it is better to dispense with these data.
We want to evaluate the predictive quality of the data model. Therefore, we limit the training data to 1999 and compare the predicted CO concentration of the years 2000 to 2020 with the actual measurements.
We further argue that the data of the first part of the 20th century are also not really reliable, indicated, e.g., by the nearly constant yearly change of CO
concentration, despite growing emissions, as well as the extreme uncertainty of land use change data. We also know that CO
concentration measurement methods have greatly improved in the 1950s [
22]. We will, therefore, make an evaluation with training data from 1950 to 1999 and build the model based on these data.
We allow the investigated data interval to have its own equilibrium concentration, according to the available data. The equilibrium concentration is determined by the initial model with constant absorption parameter a.
2.5.3. Estimation Based on Data from 1950 to 2000
The much better CO
concentration measurements after 1950, in conjunction with the fact that the overwhelming bulk of anthropogenic emissions were released after 1950, justify investigating the second half of the 20th century separately. The result of the parameter estimation is displayed in
Table 3.
This implies an equlibrilium CO concentration of C = 242 ppm with the error bounds [232, 251]. The half-life time of an emission pulse is 44 years with the 95% error bounds [39, 48].
The temperature-dependent estimation, according to Equation (11), with fixed C
ppm, leads to the results of
Table 4.
The temperature-dependent part of the absorption is clearly not significantly different from 0. When using the CO
temperature proxy from Equation (6), we obtain the results in
Table 5.
Using the CO
proxy for temperature, there is also no significant temperature dependence. Therefore, we are forced to take the model with constant absorption parameter as the best possible absorption model for the 50 years from 1950 to 2000 (
Figure 5).
The diagram in
Figure 5 confirms that there is no deviation from the constant absorption parameter curve when assuming a hypothetical temperature dependence.
2.5.4. Validation Based on Data from 1950 to 2000
Based on the model parameters from the 1950–2000 data, recursive evaluation of Equation (8) with future emission and land use data allows the prediction of future CO concentrations from .
Figure 6 shows an excellent prediction in the center of the 95% gray error bar. There are small apparently periodical variations between the predictions and the actual data. Spencer [
8] has explained these systematic deviations of up to 1 ppm with the Multivariate ENSO Index [
23], further improving the already excellent fit. For projections of future emission scenarios, these small deviations, which are symmetric with respect to zero, do not play a significant role. Spencer also identifies volcanic activities, e.g., the Pinatubo eruption, but also these small deviations do not change the functional dependency between anthropogenic emissions and CO
concentration in a significant way.