Machine-Learning-Based Approach to Optimize CO2-WAG Flooding in Low Permeability Oil Reservoirs

Gao, Ming; Liu, Zhaoxia; Qian, Shihao; Liu, Wanlu; Li, Weirong; Yin, Hengfei; Cao, Jinhong

doi:10.3390/en16176149

Open AccessArticle

Machine-Learning-Based Approach to Optimize CO₂-WAG Flooding in Low Permeability Oil Reservoirs

by

Ming Gao

^1,2,*,

Zhaoxia Liu

^1,2,

Shihao Qian

³

,

Wanlu Liu

^1,2,

Weirong Li

³,

Hengfei Yin

^1,2 and

Jinhong Cao

^1,2

¹

PetroChina Research Institute of Petroleum Exploration & Development, Beijing 100083, China

²

State Key Laboratory of Enhanced Oil and Gas Recovery, Beijing 100083, China

³

Department of Petroleum Engineering, Xi’an Shiyou University, Xi’an 710065, China

^*

Author to whom correspondence should be addressed.

Energies 2023, 16(17), 6149; https://doi.org/10.3390/en16176149

Submission received: 31 July 2023 / Revised: 14 August 2023 / Accepted: 19 August 2023 / Published: 24 August 2023

(This article belongs to the Special Issue Advances in Carbon Capture, Utilization and Storage Technologies (CCUS))

Download

Browse Figures

Versions Notes

Abstract

:

One of the main applications of carbon capture, utilization, and storage (CCUS) technology in the industry is carbon-dioxide-enhanced oil recovery (CO₂-EOR). However, accurately and rapidly assessing their application potential remains a major challenge. In this study, a numerical model of the CO₂-WAG technique was developed using the reservoir numerical simulation software CMG (Version 2021), which is widely used in the field of reservoir engineering. Then, 10,000 different reservoir models were randomly generated using the Monte Carlo method for numerical simulations, with each having different formation physical parameters, fluid parameters, initial conditions, and injection and production parameters. Among them, 70% were used as the training set and 30% as the test set. A comprehensive analysis was conducted using eight different machine learning regression methods to train and evaluate the dataset. After evaluation, the XGBoost algorithm emerged as the top-performing method and was selected as the optimal approach for the prediction and optimization. By integrating the production prediction model with a particle swarm optimizer (PSO), a workflow for optimizing the CO₂-EOR parameters was developed. This process enables the rapid optimization of the CO₂-EOR parameters and the prediction of the production for each period based on cumulative production under different geological conditions. The proposed XGBoost-PSO proxy model accurately, reliably, and efficiently predicts production, thereby making it an important tool for optimizing CO₂-EOR design.

Keywords:

CO₂-EOR; XGBoost regression; PSO; parameter optimization

1. Introduction

Oil resources have always been the primary source of fossil energy for global energy demand. However, extracting the remaining oil from complex reservoir formations remains a challenge, thereby making it increasingly important to enhance extraction efficiency [1]. In addition, the use of fossil fuels leads to the emission of a large amount of CO₂, which is a greenhouse gas, thereby resulting in global climate change, which has become a major challenge facing the world today [2]. Carbon capture, utilization, and storage (CCUS) technology is considered to be an important approach to mitigating global climate change [3]. Based on the challenges mentioned above, CO₂-enhanced oil recovery (CO₂-EOR) technology is a method within CCUS that can enhance oil recovery by using the CO₂ in a miscible displacement process and effectively sequestering CO₂ in the lower portion of the reservoir. This technology has relatively low requirements for the purity of the CO₂ and allows for the recycling of CO₂, thereby reducing process costs [4]. CO₂-EOR, with its potential to significantly enhance oil recovery while achieving carbon capture, utilization, and storage, represents an EOR method that offers both societal and economic benefits.

The injection of CO₂ is applied to enhance the oil recovery (EOR) due to its superior capabilities in improving the fluid properties under reservoir conditions. The fundamental mechanisms of CO₂-EOR include reducing interfacial tension (IFT), lowering oil viscosity, oil swelling, and light hydrocarbon extraction. These mechanisms contribute to the enhanced recovery of the oil in reservoirs [5,6,7]. Compared to other gases such as natural gas, air, and nitrogen (N₂), carbon dioxide (CO₂) has a lower minimum miscibility pressure (MMP) than oil. Therefore, selecting CO₂ as the injection gas for displacing oil can achieve better miscibility and more effectively recover the oil [8]. In addition to continuous CO₂ injection for miscible displacement, the CO₂-WAG (water alternating gas) technique has been proposed to improve the flowability of CO₂ in the reservoir and to prevent CO₂ fingering. This technique involves alternating injections of CO₂ and water, which enhance the efficiency of the CO₂ propagation and oil displacement [9,10]. The CO₂-WAG technique was first used by Mobil Corporation in 1957 in a sandstone reservoir in Alberta, Canada. In addition, the CO₂-WAG technique alleviates the issue of rapid CO₂ breakthrough and increases the resistance to gas phase flow. It also reduces the resistance to the water phase flow and increases the mobility ratio [11]. According to surveys, the WAG technique has achieved significant success and has been employed in 80% of the oilfield projects in the United States, thereby demonstrating its superiority in enhancing oilfield development and improving oil recovery [12]. Christensen et al. [13] conducted a study on 59 WAG fields and found that, in all WAG cases, the average oil recovery rate increased by 10%. This demonstrates the positive impact and effectiveness of WAG technology in enhancing oil field production. Al-Bayati et al. [14] investigated the impact of core-scale heterogeneity on the oil recovery efficiency of CO₂-WAG injection. The research findings indicated that CO₂-WAG injection exhibited better performance in homogeneous, layered, and composite samples. Sun et al. [15] investigated the feasibility of the CO₂ phase through porous media in WAG injection scenarios and successfully increased the oil recovery factor (RF) by approximately 46%. The gas-to-water injection ratio was identified as a crucial parameter affecting the efficiency of water-gas alternating injection [16,17]. Khather et al. [18] investigated the impact of CO₂–carbonate interaction on the oil and gas recovery in three heterogeneous carbonate rock core samples with different initial oil saturations (low and moderate permeability). Overall, CO₂-WAG injection after water flooding resulted in an increase in the recovery factor of over 30% for the three rock cores. Ren et al. [19] conducted experiments on CO₂-EOR and storage in oilfields in the Ordos Basin in China using two CO₂ injection schemes: continuous injection (CI) and water alternating gas (WAG) injection. The results showed that the equal injection of CO₂ and WAG significantly increased the crude oil production.

Currently, the optimization of CO₂-EOR technology is a focal point of attention for many oilfield and reservoir engineers. Rodrigues et al. [20] utilized the CMG reservoir numerical simulation software (Version 2021) to optimize the application of WAG in a sub-salt offshore oilfield in Brazil. They proposed a CO₂-WAG operational design method suitable for carbonate reservoirs, with a focus on economic viability, CO₂ recycling efficiency, and project risks. However, traditional parameter optimization methods are time-consuming and labor-intensive. They tend to overlook complex nonlinear relationships and the underlying influencing factors. Moreover, these methods are often based on specific models and algorithms, thus lacking the flexibility to adapt to different oilfield situations and variations. They have certain limitations and lack adaptability. Therefore, the introduction of more advanced optimization techniques, such as machine learning and metaheuristic algorithms, can better address the complexity and uncertainty of oilfields, thereby enhancing the optimization efficiency and accuracy.

At the current stage, rapidly evolving intelligent algorithms, such as machine learning, have found significant applications in the field of petroleum exploration and development. Sen et al. [21] employed a Specialized RNN Unit (SRU) model, which is a type of recurrent neural network (RNN), to optimize the parameters and predict the production in actual CO₂-EOR projects. The injection rate, injection pressure, cumulative injection volume of the injection wells, and bottom hole flowing pressure of the production wells were used as inputs for the SRU model, while the fluid production of the production wells served as the output. Li et al. [22] utilized the random forest (RF) regression algorithm to predict the performance of the CO₂-WAG technique, including oil well production, CO₂ storage volume, and CO₂ storage efficiency. The CO₂-WAG cycle, CO₂ injection rate, and water-gas ratio were identified as the main injection parameters. The prediction results showed a close approximation between the predicted values and the actual values in the test set. The average absolute prediction deviations for cumulative oil production, CO₂ storage volume, and CO₂ storage efficiency were 1.10%, 3.04%, and 2.24%, respectively. He et al. [23] proposed an optimization workflow for CO₂-EOR operations based on machine learning methods and heuristic optimization algorithms. Their workflow included a power consumption prediction using a Gaussian process regression (GPR) model, which combines a nonlinear autoregressive neural network with external inputs (NARX) model for oil production prediction and an operational optimization model. The optimization results were significant; the optimization parameters used included the duration of the water/gas alternating injection cycles, the bottom hole pressure of the production wells, and the injection rate of water.

Some researchers, in order to swiftly explore the solution space and find the global optimal solution, have combined metaheuristic algorithms with machine learning. By harnessing the predictive capability of machine learning to guide the search process of metaheuristic algorithms, they can quickly identify the optimal solution and achieve better results in parameter optimization. In 2018, Mohagheghia et al. [24] utilized a robust evolutionary algorithm to automatically optimize the performance of the hydrocarbon WAG technique used in the E segment of the Norne oilfield. They employed the net present value (NPV) as the objective function and two global semi-random search strategies, namely, the genetic algorithm (GA) and particle swarm optimization (PSO). Parameters such as the water injection volume, gas injection volume, bottom hole pressure of producing wells, cycle ratio, cycle duration, injected hydrocarbon gas fraction, and total WAG cycle were optimized. You et al. [25] combined Gaussian-SVR (support vector regression) with a Gaussian kernel to construct a surrogate model, and the hyperparameters of the surrogate model were optimized using Bayesian optimization. The trained surrogate model was then coupled with a multi-objective particle swarm optimization (MOPSO) protocol. This approach was used to optimize the complex CO₂-WAG process, which involves many control parameters. The optimization parameters included operational variables for controlling the CO₂-WAG process, such as the duration of the water/gas alternating injection cycle, the bottom hole pressure control, and the injection rates for each well. Jaber [26] utilized the genetic algorithm (GA) technique based on the surrogate model to optimize the most influential parameters in the CO₂-WAG process in the Subba-Nahr Umr reservoir. Four operational variables were considered for optimizing the CO₂-WAG displacement: the CO₂-to-water slug size ratio (WAG), cyclic length (CL), bottom hole pressure (BHP), and CO₂ slug size (SZ). The results demonstrated that the highest incremental oil recovery (∆FOE) of 9.7% in the Subba-Nahr Umr reservoir could be achieved with a WAG ratio of 1.5, a cyclic length of 3 months, a bottom hole pressure of 2221 psi, and a CO₂ slug size of 0.91. Based on the above, it can be observed that, in most cases of CO₂-EOR parameter optimization, the dataset is relatively small, and the optimization objective functions often only include specific time points of production, which cannot form a complete production curve. As a result, there are limitations and particularities. Due to the lack of complete production curves and large-scale time series data, machine learning and other prediction methods may not fully leverage their advantages and may struggle to achieve global optimization results.

This study proposes a comprehensive workflow for optimizing CO₂-EOR (WAG) parameters by combining reservoir numerical simulation with machine learning. In Section 2, the machine learning methods used in this study are described, along with the workflow. Section 3 focuses on establishing the geological and numerical models of the reservoir. In Section 4, the study conducted a correlation analysis of geological and operational parameters. The performance of production prediction models based on different machine learning models was evaluated, and the best machine learning model was selected. In Section 5, the selected machine learning model, combined with particle swarm optimization (PSO), was used for capacity prediction and parameter optimization. Discussions and conclusions are presented in Section 6.

2. Methods

This section describes the methodological principles and workflow of the main algorithms used in this study. Eight machine learning methods were employed to build the prediction models, including linear regression [27,28], ridge regression [29], decision tree (DT) [30], random forest (RF) [31,32], gradient boosting decision tree (GBDT) [33], extreme gradient boosting (XGBoost) [34], K-nearest neighbors (KNN) [35], and neural network (NN) [36]. This study proposes a coupled model of the machine learning algorithm XGBoost and particle swarm optimization (PSO) [37] to address the optimization problem. Therefore, the focus is on introducing the XGBoost algorithm and the particle swarm optimization algorithm (PSO).

2.1. XGBoost Algorithm

XGBoost is an expandable tree boosting system proposed by Chen et al. [34]. It is an improved version of the gradient boosting decision tree (GBDT) algorithm [38] and is widely used in classification and regression tasks. The basic idea of XGBoost is similar to GBDT, but it incorporates several optimizations, which include the following:

Optimizing the loss function by employing a second-order Taylor expansion to enhance computational accuracy.
Simplifying the model using regularization terms to avoid overfitting [39].
Utilizing a block storage structure to enable parallel computing and improve efficiency.

The structure of the XGBoost algorithm is illustrated in Figure 1, and the model details are described below.

Given a training dataset

T = \{(x_{1}, y_{1}), (x_{2}, y_{2}), \dots, (x_{n}, y_{n})\}

, a loss function

l (y_{i}, {\hat{y}}_{i})

, and a regularization term

Ω (f_{k})

, the objective function can be expressed as follows:

L (ϕ) = \sum_{i} l (y_{i}, {\hat{y}}_{i}) + \sum_{k} Ω (f_{k}),

(1)

where

L (ϕ)

is the representation in the linear space,

i

denotes the

i

-th sample,

k

represents the

k

-th tree,

{\hat{y}}_{i}

hat

i

is the predicted value of the

i

-th sample

x_{i}

, and

\sum_{k} Ω (f_{k})

represents the complexity of the trees.

Due to the expression of the objective function in GBDT, we can rewrite it as follows:

{\hat{y}}_{i} = \sum_{k = 1}^{K} f_{k} (x_{i}) = {\hat{y}}_{i}^{(t - 1)} + f_{t} (x_{i}) .

(2)

In this case, the expression of

L (ϕ)

can be transformed into the following form:

L^{(t)} = \sum_{i = 1}^{n} l (y_{i}, {\hat{y}}_{i}^{(t - 1)} + f_{t} (x_{i})) + \sum_{k} Ω (f_{k}) .

(3)

2.2. Particle Swarm Optimization (PSO)

Particle swarm optimization (PSO) is an evolutionary computation technique that was first introduced by Eberhart and Kennedy in 1995 [37]. The basic concept of PSO originates from the study of the foraging behavior in bird flocks and is a simplified model of swarm intelligence algorithms. The algorithm was initially inspired by the regular patterns observed in the movements of prey bird flocks, which led to the development of a simplified model using collective intelligence. PSO utilizes collaboration and information sharing among individuals within a swarm to search for the optimal solution [41].

Figure 2 shows the flow of the PSO algorithm, where each particle individually searches for the optimal solution in the search space. The optimal solution is recorded as the current individual extremum and shared with the other particles in the entire particle population. The particles move at a certain speed in the search space, wherein they dynamically adjust their respective speed and position according to their own flight experience and the flight experience of other particles [42].

The equation to update particle velocity in the PSO algorithm is as follows:

V_{n e w} = ω V_{i d} + C_{1} r a n d o m (0,1) (P_{i d} - X_{i d}) + C_{2} r a n d o m (0,1) (P_{g d} - X_{i d}),

(4)

where

V_{i d}

is the current velocity of the particle;

ω

is the inertia factor (with velocity there is motion inertia);

r a n d o m (0,1)

is the random number generation function that generates random numbers between 0 and 1;

P_{i d}

is the current position of the particle;

X_{i d}

is the global best position of this particle;

P_{g d}

represents the current best position among all particles in the population; and C₁ and C₂ denote the learning factors, which learn from the best position in the history of this particle and the best position in the population, respectively.

2.3. Workflow

As shown in Figure 3, the process of prediction and the parameters optimization of CO₂-EOR can be divided into three steps:

Step 1: Numerical Model and Database Establishment. Extensive literature research is conducted to gather knowledge on optimizing CO₂-EOR parameters and production profiles. The reservoir numerical simulation software CMG (Version 2021) was utilized to build the CO₂-EOR numerical model. By employing the Monte Carlo method, 10,000 sets of different reservoir models were randomly generated and simulated to obtain corresponding production curves for various geological parameters, fluid parameters, relative permeability parameters, and injection/production parameters.

Step 2: Machine Learning Model Selection. Firstly, a correlation analysis was conducted to assess the relationships between different CO₂-EOR parameters. Then, using the dataset generated in the first step, which consisted of 10,000 sets of diverse parameters and corresponding production curves, the machine learning models were trained and evaluated. Eight different machine learning models were employed and trained with the dataset to determine their performance in predicting CO₂-EOR production. Through thorough evaluation and comparison, the XGBoost algorithm was selected as the best-performing machine learning method for this study.

Step 3: CO₂-EOR Production Prediction and Parameter Optimization. The XGBoost-PSO proxy model was employed to predict CO₂-EOR production and optimize the CO₂-EOR parameters.

3. Establishment of Numerical Model and Database

3.1. Establishment of CO₂-EOR Numerical Model

First, based on the actual geological parameters of the oilfield, a characterization model was established, which took into account factors such as well spacing, fluid properties, and heterogeneity. The numerical model consisted of a grid with dimensions of 21 × 21 × 5, with a grid spacing of 10 m in the I direction, 10 m in the J direction, and 5 m in the K direction. Therefore, the feature model had dimensions of 210 m in length, 210 m in width, and 25 m in depth. The well pattern was deployed as a 1/4 five-spot pattern, with one injector well and one producer well per pattern, as shown in Figure 4. The basic parameters of the feature model are described in Table 1.

3.2. Establishing the Database

After building the geologic model, a large dataset needs to be generated to train the predictive model built using machine learning. In this study, a numerical model was used to randomly generate cumulative production data for 10,000 sets of geological and completion parameters. This study investigated a total of 24 parameters, including geological parameters, fluid parameters, initial conditions, and injection/production parameters. The parameters included in this study are as follows. The geological parameters included the following: initial pressure, porosity, permeability, temperature, and spacing in the I, J, and K directions. The fluid parameters included the following: oil density, gas specific gravity, residual oil saturation index, water saturation, oil saturation, oil viscosity, and phase mixing parameter. The phase saturation parameters included the following: residual water saturation, residual oil saturation in the oil-water system, residual oil saturation in the gas-liquid system, and residual gas saturation. The injection/production parameters included the following: gas injection well bottom flow pressure, water injection well bottom flow pressure, production well bottom flow pressure, WF ending time, WAG gas injection rate, and WAG water injection rate. The range of the values for each parameter is shown in Table 2, and the distribution of each parameter is illustrated in Figure 5. The applicable range for each parameter in the table was primarily based on the actual conditions of CO₂-driven oil reservoirs in China.

The objective function used in this study was the cumulative oil production, which is the output obtained by simulating the monthly production for each combination using the numerical simulation model. Figure 6 and Figure 7 respectively illustrate the cumulative oil production curve and the distribution of the cumulative oil production. Based on the data and the accompanying figures, it can be observed that the minimum cumulative oil production was 10⁴ m³, while the maximum cumulative oil production was 7.2 × 10⁵ m³. The majority of the distribution fell within the range of 0–10⁵ m³ of oil production.

4. Machine Learning Model Preference

4.1. Correlation Analysis

By observing the results of the correlation analysis in Figure 8, it can be concluded that there are strong linear correlations between cumulative oil production in CO₂-EOR and geological parameters, fluid parameters, phase saturation parameters, and injection/production parameters. The discovery of these correlations is significant for gaining a deeper understanding of reservoir characteristics and for optimizing the CO₂-EOR process.

From the perspective of the geological parameters (Figure 8a), there was a positive correlation between cumulative oil production in CO₂-EOR and certain factors. Notably, there were strong linear correlations between the cumulative oil production and the spacing in the I, J, and K directions, which had correlation coefficients of 0.748, 0.748, and 0.327, respectively. This indicates that the spacing in these directions significantly influences the oil production during the CO₂-EOR process. Additionally, the porosity and permeability showed correlations with the cumulative oil production in CO₂-EOR, which yielded correlation coefficients of 0.258 and 0.211, respectively. This suggests that, as porosity and permeability increase, the cumulative oil production in CO₂-EOR also increases. Porosity represents the void space in the reservoir, while permeability reflects the capacity for fluid flow within the reservoir. Higher porosity and permeability values indicate larger effective storage capacity and better fluid migration capability, thereby enabling CO₂ to react more fully with crude oil, which increases the cumulative oil production.

From the perspective of the fluid parameters and phase saturation parameters (Figure 8a,b), there was a weak correlation with the cumulative oil production in CO₂-EOR. For instance, in the fluid parameters, the correlation coefficients for the residual oil saturation index, gas specific gravity, and phase mixing parameter were 0.011, −0.007, and −0.008, respectively. Similarly, in the phase permeability parameters, the correlation coefficients for the residual gas saturation, residual oil saturation in the oil–water system, and residual oil saturation in the gas–liquid system were 0.006, −0.004, and −0.012, respectively. These correlation coefficients being close to zero indicate that there is a weak linear relationship between the phase permeability parameters and the cumulative oil production in CO₂-EOR.

Furthermore, from the perspective of the injection–production parameters (Figure 8d), there was a strong linear correlation between the CO₂-EOR cumulative oil production and CO₂-WAG injection volume. The correlation coefficient for the CO₂-WAG injection volume was 0.187. This indicates that increasing the CO₂-WAG injection volume can effectively enhance the displacement efficiency of the CO₂ and increase oil production in the reservoir. Optimizing these parameters can lead to more efficient oil recovery in the CO₂-EOR process.

4.2. Machine Learning Model Building

The dataset was split into 70% for training and 30% for testing. Eight machine learning models, including linear regression, ridge regression, decision tree, random forest, gradient boosting decision tree, extreme gradient boosting, K-nearest neighbors, and the neural network, were established. By comparing their accuracies, the model with the highest accuracy was selected as the optimal model.

To evaluate the prediction accuracy of the machine learning models, the coefficient of determination (R²) was selected as the metric [43]. The R² value ranges from 0 to 1, with a higher value indicating a better fit of the model. The specific formula to calculate R² is as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{K} {(\hat{y_{i}} - y_{i})}^{2}}{\sum_{i = 1}^{K} {(\bar{y_{i}} - y_{i})}^{2}}

(5)

where

\bar{y_{i}}

is the mean value of

y_{i}

.

Figure 9 presents the scatter plots of the predicted results versus the true results for the eight predictive models investigated in this study. The corresponding coefficient of determination (R²) values for the selected predictive models are shown in Table 3. Among them, the linear regression, ridge regression, K-nearest neighbors, and decision tree models exhibited scattered predicted points and true points around the 45-degree line, thereby indicating poor predictive performance, with test R² values of 0.95, 0.81, 0.79, and 0.91, respectively. In contrast, the extreme gradient boosting (XGBoost) model showed a concentration of predicted points and true points along the 45-degree line, with a high test R² value of 0.98, thereby indicating a low error and good predictive performance.

Based on the results, it can be observed that, among the eight machine learning methods, the extreme gradient boosting (XGBoost) model exhibited the best predictive performance. It achieved a training R² of 0.99 and a test R² of 0.98. This model is suitable for use as a predictive optimization model to optimize the injection and production parameters, as well as to predict cumulative oil production. Table 4 provides the hyperparameters of the XGBoost predictive model.

5. CO₂-EOR Parameter Optimization

5.1. Coupling of PSO Optimization and XGBoost Model

Particle swarm optimization (PSO) is a widely recognized metaheuristic algorithm that is known for its ability to effectively explore the solution space and find global optima by simulating the collective behavior of a swarm of particles. In this study, PSO was used to search for the optimal combination of CO₂-WAG parameters that maximizes the cumulative oil production. When applying the PSO algorithm, a trained XGBoost model was used to evaluate the suitability of a large number of project design parameters. With the aid of the surrogate model, the computational burden of the optimization procedure was significantly reduced, thereby allowing for more iterations of the PSO algorithm. The parameters of the final PSO model, along with the XGBoost parameters, are provided in Table 5, and the optimization process is depicted in Figure 10.

5.2. Production Prediction and Parameter Optimization

In the process of exploiting reservoirs using CO₂-EOR technology, various operational and injection/production parameters have an impact on the cumulative oil production. Therefore, the XGBoost-PSO optimization model was employed to optimize the operational parameters, thereby aiming to enhance the cumulative oil production and recovery factor of the reservoir. During the optimization process, a set of key parameters was considered, including water injection well bottom flow pressure, gas injection well bottom flow pressure, production well bottom flow pressure, WAG gas injection, and WAG water injection.

By optimizing these parameters, the final optimization results were obtained, and they are shown in Table 6. Figure 11 and Figure 12 illustrate the optimized cumulative oil production and daily oil production, respectively. From the figures, it is evident that the cumulative oil production and daily oil production under the CO₂-WAG method were significantly higher than under the WF method. This finding indicates the immense potential of CO₂-EOR technology in improving oil recovery. The optimized cumulative oil production successfully increased from 425,916 m³ to 475,047 m³. This implies that, by optimizing the operational parameters, the oil production potential of the reservoir can be further enhanced.

By establishing the XGBoost-PSO optimization model and optimizing the operational parameters, this process can provide reliable guidance and decision support for reservoir development. This optimization model not only improved the cumulative oil production and recovery factor, but also provides crucial support for the long-term sustainable development of the oilfield. Therefore, further research and optimization of this model are necessary to further enhance the efficiency and benefits of reservoir development.

6. Discussion and Conclusions

Machine learning is a popular research method used in data processing, and this study utilized a predictive model that has significant room for improvement. In this study, we used the XGBoost model, which allows for the customization of parameters such as the number of hidden layers, the number of neurons, and the learning rate to suit specific needs. Additionally, these hyperparameters can be adjusted to enhance the model’s predictive accuracy. Optimization techniques such as PSO or GA can also be introduced to further strengthen the model’s hyperparameters. Furthermore, the evaluation and optimization in this study only considered the cumulative production, daily production, and recovery rate as the objective functions. Future research can consider incorporating other metrics such as net present value (NPV) as objective functions.

This study utilized the XGBoost machine learning algorithm to establish a workflow for evaluating the cumulative gas production in CO₂-EOR modeling. This workflow was used for capacity prediction and parameter optimization in CO₂-EOR. The following conclusions were drawn:

(1): Compared to traditional simulation and prediction methods, machine learning approaches can effectively handle reservoir data and address non-linear problems. By incorporating multiple factors such as geology and operations, they significantly improve the efficiency and accuracy of the models.
(2): The investigation of the correlation between various factors and the cumulative oil production reveals that, from a geological perspective, there is a strong linear correlation between the porosity and permeability with the CO₂-EOR cumulative oil production. From an injection/production parameter perspective, there is a strong linear correlation between the CO₂-WAG gas injection rate and the CO₂-EOR cumulative oil production.
(3): Different machine learning models exhibited varying performance results in predicting production. By comparing eight different production prediction models, it can be concluded that the extreme gradient boosting (XGBoost) model outperforms other machine learning models in terms of predictive performance. The XGBoost model achieved an R² score of 0.99 on the training set and 0.98 on the testing set.
(4): The cumulative oil production, daily oil production, and recovery factor under the CO₂-WAG method were significantly higher than those under the WF method. This finding suggests that CO₂-EOR technology has great potential in improving the recovery factor of oil reservoirs.
(5): During the optimization of the CO₂-EOR parameters, PSO was coupled with the trained XGBoost model. PSO efficiently searches the parameter space to find the optimal CO₂-EOR parameters that maximize the cumulative oil production, thus saving computational costs. The optimized parameters resulted in a higher cumulative oil production and recovery factor when compared to previous results.

Author Contributions

Conceptualization, M.G. and Z.L.; methodology, M.G. and W.L. (Weirong Li); software, S.Q.; validation, Z.L., W.L. (Wanlu Liu) and H.Y.; formal analysis, W.L. (Wanlu Liu); investigation, J.C.; resources, H.Y.; data curation, J.C.; writing—original draft preparation, S.Q.; writing—review and editing, M.G. and W.L. (Weirong Li); visualization, J.C.; supervision, M.G.; project administration, M.G.; funding acquisition, M.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Science and Technology project of the CNPC in China (grant No. 2021ZZ01-03 and No. 2021ZZ01-06).

Data Availability Statement

The raw/processed data required to reproduce these findings cannot be shared at this time, as the data also forms part of an ongoing study.

Acknowledgments

The authors are grateful for the financial support of the CNPC in China.

Conflicts of Interest

The authors declare that the publication of this paper has no conflict of interest.

References

Kondori, J.; Miah, M.I.; Zendehboudi, S.; Khan, F.; Heagle, D. Hybrid Connectionist Models to Assess Recovery Performance of Low Salinity Water Injection. J. Pet. Sci. Eng. 2021, 197, 107833. [Google Scholar] [CrossRef]
Guo, J.-X.; Huang, C.; Wang, J.-L.; Meng, X.-Y. Integrated Operation for the Planning of CO₂ Capture Path in CCS–EOR Project. J. Pet. Sci. Eng. 2020, 186, 106720. [Google Scholar] [CrossRef]
Allahyarzadeh Bidgoli, A.; Hamidishad, N.; Yanagihara, J.I. The Impact of Carbon Capture Storage and Utilization on Energy Efficiency, Sustainability, and Production of an Offshore Platform: Thermodynamic and Sensitivity Analyses. J. Energy Resour. Technol. 2022, 144, 112102. [Google Scholar] [CrossRef]
Song, C. Global Challenges and Strategies for Control, Conversion and Utilization of CO₂ for Sustainable Development Involving Energy, Catalysis, Adsorption and Chemical Processing. Catal. Today 2006, 115, 2–32. [Google Scholar] [CrossRef]
Jiang, J.; Rui, Z.; Hazlett, R.; Lu, J. An Integrated Technical-Economic Model for Evaluating CO₂ Enhanced Oil Recovery Development. Appl. Energy 2019, 247, 190–211. [Google Scholar] [CrossRef]
Kong, S.; Huang, X.; Li, K.; Song, X. Adsorption/Desorption Isotherms of CH₄ and C₂H₆ on Typical Shale Samples. Fuel 2019, 255, 115632. [Google Scholar] [CrossRef]
Huang, X.; Gu, L.; Li, S.; Du, Y.; Liu, Y. Absolute Adsorption of Light Hydrocarbons on Organic-Rich Shale: An Efficient Determination Method. Fuel 2022, 308, 121998. [Google Scholar] [CrossRef]
Tang, Y.; Hou, C.; He, Y.; Wang, Y.; Chen, Y.; Rui, Z. Review on Pore Structure Characterization and Microscopic Flow Mechanism of CO₂ Flooding in Porous Media. Energy Technol. 2021, 9, 2000787. [Google Scholar] [CrossRef]
Kulkarni, M.M.; Rao, D.N. Experimental Investigation of Miscible and Immiscible Water-Alternating-Gas (WAG) Process Performance. J. Pet. Sci. Eng. 2005, 48, 1–20. [Google Scholar] [CrossRef]
Karimaie, H.; Nazarian, B.; Aurdal, T.; Nøkleby, P.H.; Hansen, O. Simulation Study of CO₂ EOR and Storage Potential in a North Sea Reservoir. Energy Procedia 2017, 114, 7018–7032. [Google Scholar] [CrossRef]
Tang, R.; Wang, H.; Yu, H.; Wang, W.; Chen, L. Effect of Water and Gas Alternate Injection on CO₂ Flooding. Fault Block Oil Gas Field 2016, 23, 358–362. [Google Scholar] [CrossRef]
Sanchez, N.L. Management of Water Alternating Gas (WAG) Injection Projects; OnePetro: Richardson, TX, USA, 1999. [Google Scholar]
Christensen, J.R.; Stenby, E.H.; Skauge, A. Review of WAG Field Experience. SPE Reserv. Eval. Eng. 2001, 4, 97–106. [Google Scholar] [CrossRef]
Al-Bayati, D.; Saeedi, A.; Myers, M.; White, C.; Xie, Q.; Clennell, B. Insight Investigation of Miscible SCCO₂ Water Alternating Gas (WAG) Injection Performance in Heterogeneous Sandstone Reservoirs. J. CO₂ Util. 2018, 28, 255–263. [Google Scholar] [CrossRef]
Sun, X.; Liu, J.; Dai, X.; Wang, X.; Yapanto, L.M.; Zekiy, A.O. On the Application of Surfactant and Water Alternating Gas (SAG/WAG) Injection to Improve Oil Recovery in Tight Reservoirs. Energy Rep. 2021, 7, 2452–2459. [Google Scholar] [CrossRef]
Pancholi, S.; Negi, G.S.; Agarwal, J.R.; Bera, A.; Shah, M. Experimental and Simulation Studies for Optimization of Water–Alternating-Gas (CO₂) Flooding for Enhanced Oil Recovery. Pet. Res. 2020, 5, 227–234. [Google Scholar] [CrossRef]
Ren, B.; Duncan, I.J. Maximizing Oil Production from Water Alternating Gas (CO₂) Injection into Residual Oil Zones: The Impact of Oil Saturation and Heterogeneity. Energy 2021, 222, 119915. [Google Scholar] [CrossRef]
Khather, M.; Yekeen, N.; Al-Yaseri, A.; Al-Mukainah, H.; Giwelli, A.; Saeedi, A. The Impact of Wormhole Generation in Carbonate Reservoirs on CO₂-WAG Oil Recovery. J. Pet. Sci. Eng. 2022, 212, 110354. [Google Scholar] [CrossRef]
Ren, D.; Wang, X.; Kou, Z.; Wang, S.; Wang, H.; Wang, X.; Tang, Y.; Jiao, Z.; Zhou, D.; Zhang, R. Feasibility Evaluation of CO₂ EOR and Storage in Tight Oil Reservoirs: A Demonstration Project in the Ordos Basin. Fuel 2023, 331, 125652. [Google Scholar] [CrossRef]
Rodrigues, H.; Mackay, E.; Arnold, D.; Silva, D. Optimization of CO₂-WAG and Calcite Scale Management in Pre-Salt Carbonate Reservoirs; OnePetro: Richardson, TX, USA, 2019. [Google Scholar]
Sen, D.; Chen, H.; Datta-Gupta, A. Inter-Well Connectivity Detection in CO₂ WAG Projects Using Statistical Recurrent Unit Models. Fuel 2022, 311, 122600. [Google Scholar] [CrossRef]
Imani, G. Machine Learning-Assisted Prediction of Oil Production and CO₂ Storage Effect in CO₂-Water-Alternating-Gas Injection (CO₂-WAG). Appl. Sci. 2022, 12, 958. [Google Scholar] [CrossRef]
He, R.; Ma, W.; Ma, X.; Liu, Y. Modeling and Optimizing for Operation of CO₂-EOR Project Based on Machine Learning Methods and Greedy Algorithm. Energy Rep. 2021, 7, 3664–3677. [Google Scholar] [CrossRef]
Mohagheghian, E.; James, L.A.; Haynes, R.D. Optimization of Hydrocarbon Water Alternating Gas in the Norne Field: Application of Evolutionary Algorithms. Fuel 2018, 223, 86–98. [Google Scholar] [CrossRef]
You, J.; Ampomah, W.; Tu, J.; Morgan, A.; Sun, Q.; Wei, B.; Wang, D. Optimization of Water-Alternating-CO₂ Injection Field Operations Using a Machine-Learning-Assisted Workflow. SPE Reserv. Eval. Eng. 2022, 25, 214–231. [Google Scholar] [CrossRef]
Jaber, A.K. Genetic Algorithm to Optimize Miscible Water Alternate CO₂ Flooding in Heterogeneous Clastic Reservoir. Arab. J. Geosci. 2022, 15, 714. [Google Scholar] [CrossRef]
Dehghan, M.H.; Hamidi, F.; Salajegheh, M. Study of Linear Regression Based on Least Squares and Fuzzy Least Absolutes Deviations and Its Application in Geography. In Proceedings of the 2015 4th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Zahedan, Iran, 9–11 September 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1–6. [Google Scholar]
Kavitha, S.; Varuna, S.; Ramya, R. A Comparative Analysis on Linear Regression and Support Vector Regression. In Proceedings of the 2016 Online International Conference on Green Engineering and Technologies (IC-GET), Coimbatore, India, 19 November 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–5. [Google Scholar]
Dorugade, A.V. New Ridge Parameters for Ridge Regression. J. Assoc. Arab. Univ. Basic Appl. Sci. 2014, 15, 94–99. [Google Scholar] [CrossRef]
Wang, Y.; Xia, S.-T. Unifying Attribute Splitting Criteria of Decision Trees by Tsallis Entropy. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 2507–2511. [Google Scholar]
Gamal, H.; Alsaihati, A.; Elkatatny, S.; Haidary, S.; Abdulraheem, A. Rock Strength Prediction in Real-Time While Drilling Employing Random Forest and Functional Network Techniques. J. Energy Resour. Technol. 2021, 143, 093004. [Google Scholar] [CrossRef]
Brieman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Jordan, M.I.; Mitchell, T.M. Machine Learning: Trends, Perspectives, and Prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13 August 2016; Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
Peterson, L. K-Nearest Neighbor. Scholarpedia 2009, 4, 1883. [Google Scholar] [CrossRef]
Abiodun, O.I.; Jantan, A.; Omolara, A.E.; Dada, K.V.; Mohamed, N.A.; Arshad, H. State-of-the-Art in Artificial Neural Network Applications: A Survey. Heliyon 2018, 4, e00938. [Google Scholar] [CrossRef]
Kennedy, J.; Eberhart, R. Particle Swarm Optimization. In Proceedings of the ICNN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; IEEE: Piscataway, NJ, USA, 1995; Volume 4, pp. 1942–1948. [Google Scholar]
Xu, Y.; Zhao, X.; Chen, Y.; Yang, Z. Research on a Mixed Gas Classification Algorithm Based on Extreme Random Tree. Appl. Sci. 2019, 9, 1728. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of the Advances in Neural Information Processing Systems, Stateline, NV, USA, 3–6 December 2012; Curran Associates, Inc.: Red Hook, NY, USA, 2012; Volume 25. [Google Scholar]
Song, K.; Yan, F.; Ding, T.; Gao, L.; Lu, S. A Steel Property Optimization Model Based on the XGBoost Algorithm and Improved PSO. Comput. Mater. Sci. 2020, 174, 109472. [Google Scholar] [CrossRef]
Eberhart; Shi, Y. Particle Swarm Optimization: Developments, Applications and Resources. In Proceedings of the 2001 Congress on Evolutionary Computation, Seoul, Republic of Korea, 27–30 May 2001; IEEE Cat. No.01TH8546. IEEE: Piscataway, NJ, USA, 2001; Volume 1, pp. 81–86. [Google Scholar]
Andalib Sahnehsaraei, M.; Mahmoodabadi, M.J.; Taherkhorsandi, M.; Castillo-Villar, K.K.; Mortazavi Yazdi, S.M. A Hybrid Global Optimization Algorithm: Particle Swarm Optimization in Association with a Genetic Algorithm. In Complex System Modelling and Control through Intelligent Soft Computations; Zhu, Q., Azar, A.T., Eds.; Studies in Fuzziness and Soft Computing; Springer International Publishing: Cham, Switzerland, 2015; Volume 319, pp. 45–86. ISBN 978-3-319-12882-5. [Google Scholar]
Ottah, D.G.; Ikiensikimama, S.S.; Matemilola, S.A. Aquifer Matching With Material Balance Using Particle Swarm Optimization Algorithm—PSO. In Proceedings of the SPE Nigeria Annual International Conference and Exhibition, Lagos, Nigeria, 4 August 2015; p. SPE-178319-MS. [Google Scholar]

Figure 1. The structure of the XGBoost algorithm [40].

Figure 2. Particle swarm optimization Process.

Figure 3. Workflow diagram.

Figure 4. Schematic of the reservoir model in the feature model.

Figure 5. Distribution of each parameter. The red color represents geological parameters, the orange color represents fluid parameters, the green color represents phase saturation parameters, and the blue color represents injection/production parameters.

Figure 6. Cumulative oil production curve.

Figure 7. Cumulative oil production distribution curve.

Figure 8. Correlation analysis of parameters with cumulative oil production.

Figure 9. Model performance for each model using training and test sets.

Figure 10. Optimization workflow.

Figure 11. Comparison of cumulative oil production before and after optimization.

Figure 12. Comparison of daily oil production before and after optimization.

Table 1. Basic parameters of the feature model.

Parameter Type	Parameters	Value	Unit
Geological parameters	Initial pressure	20	MPa
	Temperature	70	°C
	Porosity	0.2	/
	Permeability	30	mD
	Spacing in the I direction	10	m
	Spacing in the J direction	10	m
	Spacing in the K direction	4	m
Fluid parameters	Oil density	799.2	kg/m³
	Gas specific gravity	0.70	/
	Residual oil saturation index	86.10	/
	Oil viscosity	7.67	mPa·s
	Water saturation	0.30	/
	Oil saturation	0.70	/
	Phase mixing parameter	0.70	/
Phase saturation parameters	Residual water saturation	0.3	/
	Residual oil saturation in oil-water system	0.2	/
	Residual oil saturation in gas-liquid system	0.15	/
	Residual gas saturation	0.15	/
Injection/production parameters	Gas injection well bottom flow pressure	30	MPa
	Water injection well bottom flow pressure	21	MPa
	Production well bottom flow pressure	5	MPa
	WF ending time	3650	Day
	WAG gas injection	60	m³/day
	WAG water injection	60	m³/day

Table 2. Range of values for each parameter.

Parameters [16,17,22]	Minimum Value	Maximum Value	Unit	Symbol in Figure 5
Initial pressure	15	25	MPa	Initial pressure
Temperature	45.00	120.00	°C	Temperature
Porosity	0.15	0.25	/	Por
Permeability	3.5	240.0	mD	PERMI
Spacing in the I direction	5	20	m	Di
Spacing in the J direction	5	20	m	Dj
Spacing in the K direction	2	5	m	Dk
Oil density	700	900	kg/m³	Oil density
Gas specific gravity	0.53	0.87	/	Gas gravity
Residual oil saturation index	10.00	200.00	/	Rsi
Oil viscosity	0.15	15.00	mPa·s	Viso
Water saturation	0.20	0.40	/	Sw
Oil saturation	0.60	0.80	/	So
Residual gas saturation	0.50	0.85	/	Omegas
Residual water saturation	0.20	0.40	/	SWCON
Residual oil saturation in oil-water system	0.15	0.25	/	SOIRW
Residual oil saturation in gas-liquid system	0.10	0.20	/	SORG
Residual gas saturation	0.10	0.20	/	SGCON
Gas injection well bottom flow pressure	22	38	MPa	INJG BHP
Water injection well bottom flow pressure	20	40	MPa	INJW BHP
Production well bottom flow pressure	3	5	MPa	Prod BHP
WF ending time	2700	4600	Day	WF Ending time
WAG gas injection	45.00	180.00	m³/day	WAG INJG
WAG water injection	45.00	180.00	m³/day	WAG INJW

Table 3. Comparison of predictive performance of the models.

Machine Learning Algorithms	Train R²	Test R²
Linear Regression	0.82	0.75
Ridge Regression	0.82	0.81
Decision Tree	1.00	0.91
Random Forest	0.97	0.96
K Nearest Neighbors	0.80	0.79
Neural Network	0.96	0.96
Gradient Boosting Decision Tree	0.97	0.96
XGBoost	0.99	0.98

Table 4. Machine learning XGBoot model hyperparameters.

Model Hyperparameters	Minimum Value	Maximum Values
Number of boosting stages	10	500
Learning rate	0.01	0.2
Maximum depth of tree	3	10
Gamma	0	0.4
Minimum child weight	1	5
Subsample	0.5	1
Colsample by tree	0.5	1

Table 5. Hyperparameters of PSO algorithm and XGBoost model.

Parameters in PSO Algorithm	Value
Population number group size	15
Maximum number of iterations maximum	50
Inertia weight (ω)	0.8
Learning factor (c1)	2
Learning factor (c2)	2

Table 6. Optimization of basic parameters of CO₂-EOR.

Optimization Parameters	WF	WAG	Optimization of WAG	Unit
Gas injection well bottom flow pressure	28.50	28.50	25.33	MPa
Water injection well bottom flow pressure	21	21	33.76	MPa
Production well bottom flow pressure	4.00	4.00	4.29	MPa
WF ending time	3777	3777	3533	day
WAG gas injection	0	144	120	m³/day
WAG water injection	174	174	66.71	m³/day
Cumulative oil production	319,234	425,916	475,047	m³

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, M.; Liu, Z.; Qian, S.; Liu, W.; Li, W.; Yin, H.; Cao, J. Machine-Learning-Based Approach to Optimize CO₂-WAG Flooding in Low Permeability Oil Reservoirs. Energies 2023, 16, 6149. https://doi.org/10.3390/en16176149

AMA Style

Gao M, Liu Z, Qian S, Liu W, Li W, Yin H, Cao J. Machine-Learning-Based Approach to Optimize CO₂-WAG Flooding in Low Permeability Oil Reservoirs. Energies. 2023; 16(17):6149. https://doi.org/10.3390/en16176149

Chicago/Turabian Style

Gao, Ming, Zhaoxia Liu, Shihao Qian, Wanlu Liu, Weirong Li, Hengfei Yin, and Jinhong Cao. 2023. "Machine-Learning-Based Approach to Optimize CO₂-WAG Flooding in Low Permeability Oil Reservoirs" Energies 16, no. 17: 6149. https://doi.org/10.3390/en16176149

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine-Learning-Based Approach to Optimize CO₂-WAG Flooding in Low Permeability Oil Reservoirs

Abstract

1. Introduction