Next Article in Journal
Solar Energy Harnessing Technologies towards De-Carbonization: A Systematic Review of Processes and Systems
Next Article in Special Issue
Mechanism and Quantitative Characterization of Wettability on Shale Surfaces: An Experimental Study Based on Atomic Force Microscopy (AFM)
Previous Article in Journal
Retrofit Analysis of City-Scale Residential Buildings in the Hot Summer and Cold Winter Climate Zone
Previous Article in Special Issue
Effect of H2O Content on the Corrosion Behavior of X52 Steel in Supercritical CO2 Streams Containing O2, H2S, SO2 and NO2 Impurities
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine-Learning-Based Approach to Optimize CO2-WAG Flooding in Low Permeability Oil Reservoirs

1
PetroChina Research Institute of Petroleum Exploration & Development, Beijing 100083, China
2
State Key Laboratory of Enhanced Oil and Gas Recovery, Beijing 100083, China
3
Department of Petroleum Engineering, Xi’an Shiyou University, Xi’an 710065, China
*
Author to whom correspondence should be addressed.
Energies 2023, 16(17), 6149; https://doi.org/10.3390/en16176149
Submission received: 31 July 2023 / Revised: 14 August 2023 / Accepted: 19 August 2023 / Published: 24 August 2023

Abstract

:
One of the main applications of carbon capture, utilization, and storage (CCUS) technology in the industry is carbon-dioxide-enhanced oil recovery (CO2-EOR). However, accurately and rapidly assessing their application potential remains a major challenge. In this study, a numerical model of the CO2-WAG technique was developed using the reservoir numerical simulation software CMG (Version 2021), which is widely used in the field of reservoir engineering. Then, 10,000 different reservoir models were randomly generated using the Monte Carlo method for numerical simulations, with each having different formation physical parameters, fluid parameters, initial conditions, and injection and production parameters. Among them, 70% were used as the training set and 30% as the test set. A comprehensive analysis was conducted using eight different machine learning regression methods to train and evaluate the dataset. After evaluation, the XGBoost algorithm emerged as the top-performing method and was selected as the optimal approach for the prediction and optimization. By integrating the production prediction model with a particle swarm optimizer (PSO), a workflow for optimizing the CO2-EOR parameters was developed. This process enables the rapid optimization of the CO2-EOR parameters and the prediction of the production for each period based on cumulative production under different geological conditions. The proposed XGBoost-PSO proxy model accurately, reliably, and efficiently predicts production, thereby making it an important tool for optimizing CO2-EOR design.

1. Introduction

Oil resources have always been the primary source of fossil energy for global energy demand. However, extracting the remaining oil from complex reservoir formations remains a challenge, thereby making it increasingly important to enhance extraction efficiency [1]. In addition, the use of fossil fuels leads to the emission of a large amount of CO2, which is a greenhouse gas, thereby resulting in global climate change, which has become a major challenge facing the world today [2]. Carbon capture, utilization, and storage (CCUS) technology is considered to be an important approach to mitigating global climate change [3]. Based on the challenges mentioned above, CO2-enhanced oil recovery (CO2-EOR) technology is a method within CCUS that can enhance oil recovery by using the CO2 in a miscible displacement process and effectively sequestering CO2 in the lower portion of the reservoir. This technology has relatively low requirements for the purity of the CO2 and allows for the recycling of CO2, thereby reducing process costs [4]. CO2-EOR, with its potential to significantly enhance oil recovery while achieving carbon capture, utilization, and storage, represents an EOR method that offers both societal and economic benefits.
The injection of CO2 is applied to enhance the oil recovery (EOR) due to its superior capabilities in improving the fluid properties under reservoir conditions. The fundamental mechanisms of CO2-EOR include reducing interfacial tension (IFT), lowering oil viscosity, oil swelling, and light hydrocarbon extraction. These mechanisms contribute to the enhanced recovery of the oil in reservoirs [5,6,7]. Compared to other gases such as natural gas, air, and nitrogen (N2), carbon dioxide (CO2) has a lower minimum miscibility pressure (MMP) than oil. Therefore, selecting CO2 as the injection gas for displacing oil can achieve better miscibility and more effectively recover the oil [8]. In addition to continuous CO2 injection for miscible displacement, the CO2-WAG (water alternating gas) technique has been proposed to improve the flowability of CO2 in the reservoir and to prevent CO2 fingering. This technique involves alternating injections of CO2 and water, which enhance the efficiency of the CO2 propagation and oil displacement [9,10]. The CO2-WAG technique was first used by Mobil Corporation in 1957 in a sandstone reservoir in Alberta, Canada. In addition, the CO2-WAG technique alleviates the issue of rapid CO2 breakthrough and increases the resistance to gas phase flow. It also reduces the resistance to the water phase flow and increases the mobility ratio [11]. According to surveys, the WAG technique has achieved significant success and has been employed in 80% of the oilfield projects in the United States, thereby demonstrating its superiority in enhancing oilfield development and improving oil recovery [12]. Christensen et al. [13] conducted a study on 59 WAG fields and found that, in all WAG cases, the average oil recovery rate increased by 10%. This demonstrates the positive impact and effectiveness of WAG technology in enhancing oil field production. Al-Bayati et al. [14] investigated the impact of core-scale heterogeneity on the oil recovery efficiency of CO2-WAG injection. The research findings indicated that CO2-WAG injection exhibited better performance in homogeneous, layered, and composite samples. Sun et al. [15] investigated the feasibility of the CO2 phase through porous media in WAG injection scenarios and successfully increased the oil recovery factor (RF) by approximately 46%. The gas-to-water injection ratio was identified as a crucial parameter affecting the efficiency of water-gas alternating injection [16,17]. Khather et al. [18] investigated the impact of CO2–carbonate interaction on the oil and gas recovery in three heterogeneous carbonate rock core samples with different initial oil saturations (low and moderate permeability). Overall, CO2-WAG injection after water flooding resulted in an increase in the recovery factor of over 30% for the three rock cores. Ren et al. [19] conducted experiments on CO2-EOR and storage in oilfields in the Ordos Basin in China using two CO2 injection schemes: continuous injection (CI) and water alternating gas (WAG) injection. The results showed that the equal injection of CO2 and WAG significantly increased the crude oil production.
Currently, the optimization of CO2-EOR technology is a focal point of attention for many oilfield and reservoir engineers. Rodrigues et al. [20] utilized the CMG reservoir numerical simulation software (Version 2021) to optimize the application of WAG in a sub-salt offshore oilfield in Brazil. They proposed a CO2-WAG operational design method suitable for carbonate reservoirs, with a focus on economic viability, CO2 recycling efficiency, and project risks. However, traditional parameter optimization methods are time-consuming and labor-intensive. They tend to overlook complex nonlinear relationships and the underlying influencing factors. Moreover, these methods are often based on specific models and algorithms, thus lacking the flexibility to adapt to different oilfield situations and variations. They have certain limitations and lack adaptability. Therefore, the introduction of more advanced optimization techniques, such as machine learning and metaheuristic algorithms, can better address the complexity and uncertainty of oilfields, thereby enhancing the optimization efficiency and accuracy.
At the current stage, rapidly evolving intelligent algorithms, such as machine learning, have found significant applications in the field of petroleum exploration and development. Sen et al. [21] employed a Specialized RNN Unit (SRU) model, which is a type of recurrent neural network (RNN), to optimize the parameters and predict the production in actual CO2-EOR projects. The injection rate, injection pressure, cumulative injection volume of the injection wells, and bottom hole flowing pressure of the production wells were used as inputs for the SRU model, while the fluid production of the production wells served as the output. Li et al. [22] utilized the random forest (RF) regression algorithm to predict the performance of the CO2-WAG technique, including oil well production, CO2 storage volume, and CO2 storage efficiency. The CO2-WAG cycle, CO2 injection rate, and water-gas ratio were identified as the main injection parameters. The prediction results showed a close approximation between the predicted values and the actual values in the test set. The average absolute prediction deviations for cumulative oil production, CO2 storage volume, and CO2 storage efficiency were 1.10%, 3.04%, and 2.24%, respectively. He et al. [23] proposed an optimization workflow for CO2-EOR operations based on machine learning methods and heuristic optimization algorithms. Their workflow included a power consumption prediction using a Gaussian process regression (GPR) model, which combines a nonlinear autoregressive neural network with external inputs (NARX) model for oil production prediction and an operational optimization model. The optimization results were significant; the optimization parameters used included the duration of the water/gas alternating injection cycles, the bottom hole pressure of the production wells, and the injection rate of water.
Some researchers, in order to swiftly explore the solution space and find the global optimal solution, have combined metaheuristic algorithms with machine learning. By harnessing the predictive capability of machine learning to guide the search process of metaheuristic algorithms, they can quickly identify the optimal solution and achieve better results in parameter optimization. In 2018, Mohagheghia et al. [24] utilized a robust evolutionary algorithm to automatically optimize the performance of the hydrocarbon WAG technique used in the E segment of the Norne oilfield. They employed the net present value (NPV) as the objective function and two global semi-random search strategies, namely, the genetic algorithm (GA) and particle swarm optimization (PSO). Parameters such as the water injection volume, gas injection volume, bottom hole pressure of producing wells, cycle ratio, cycle duration, injected hydrocarbon gas fraction, and total WAG cycle were optimized. You et al. [25] combined Gaussian-SVR (support vector regression) with a Gaussian kernel to construct a surrogate model, and the hyperparameters of the surrogate model were optimized using Bayesian optimization. The trained surrogate model was then coupled with a multi-objective particle swarm optimization (MOPSO) protocol. This approach was used to optimize the complex CO2-WAG process, which involves many control parameters. The optimization parameters included operational variables for controlling the CO2-WAG process, such as the duration of the water/gas alternating injection cycle, the bottom hole pressure control, and the injection rates for each well. Jaber [26] utilized the genetic algorithm (GA) technique based on the surrogate model to optimize the most influential parameters in the CO2-WAG process in the Subba-Nahr Umr reservoir. Four operational variables were considered for optimizing the CO2-WAG displacement: the CO2-to-water slug size ratio (WAG), cyclic length (CL), bottom hole pressure (BHP), and CO2 slug size (SZ). The results demonstrated that the highest incremental oil recovery (∆FOE) of 9.7% in the Subba-Nahr Umr reservoir could be achieved with a WAG ratio of 1.5, a cyclic length of 3 months, a bottom hole pressure of 2221 psi, and a CO2 slug size of 0.91. Based on the above, it can be observed that, in most cases of CO2-EOR parameter optimization, the dataset is relatively small, and the optimization objective functions often only include specific time points of production, which cannot form a complete production curve. As a result, there are limitations and particularities. Due to the lack of complete production curves and large-scale time series data, machine learning and other prediction methods may not fully leverage their advantages and may struggle to achieve global optimization results.
This study proposes a comprehensive workflow for optimizing CO2-EOR (WAG) parameters by combining reservoir numerical simulation with machine learning. In Section 2, the machine learning methods used in this study are described, along with the workflow. Section 3 focuses on establishing the geological and numerical models of the reservoir. In Section 4, the study conducted a correlation analysis of geological and operational parameters. The performance of production prediction models based on different machine learning models was evaluated, and the best machine learning model was selected. In Section 5, the selected machine learning model, combined with particle swarm optimization (PSO), was used for capacity prediction and parameter optimization. Discussions and conclusions are presented in Section 6.

2. Methods

This section describes the methodological principles and workflow of the main algorithms used in this study. Eight machine learning methods were employed to build the prediction models, including linear regression [27,28], ridge regression [29], decision tree (DT) [30], random forest (RF) [31,32], gradient boosting decision tree (GBDT) [33], extreme gradient boosting (XGBoost) [34], K-nearest neighbors (KNN) [35], and neural network (NN) [36]. This study proposes a coupled model of the machine learning algorithm XGBoost and particle swarm optimization (PSO) [37] to address the optimization problem. Therefore, the focus is on introducing the XGBoost algorithm and the particle swarm optimization algorithm (PSO).

2.1. XGBoost Algorithm

XGBoost is an expandable tree boosting system proposed by Chen et al. [34]. It is an improved version of the gradient boosting decision tree (GBDT) algorithm [38] and is widely used in classification and regression tasks. The basic idea of XGBoost is similar to GBDT, but it incorporates several optimizations, which include the following:
  • Optimizing the loss function by employing a second-order Taylor expansion to enhance computational accuracy.
  • Simplifying the model using regularization terms to avoid overfitting [39].
  • Utilizing a block storage structure to enable parallel computing and improve efficiency.
The structure of the XGBoost algorithm is illustrated in Figure 1, and the model details are described below.
Given a training dataset T = x 1 , y 1 , x 2 , y 2 , , x n , y n , a loss function l y i , y ^ i , and a regularization term Ω f k , the objective function can be expressed as follows:
L ϕ = i l y i , y ^ i + k Ω f k ,
where L ϕ is the representation in the linear space, i denotes the i -th sample, k represents the k -th tree, y ^ i  hat i is the predicted value of the i -th sample x i , and k Ω f k represents the complexity of the trees.
Due to the expression of the objective function in GBDT, we can rewrite it as follows:
y ^ i = k = 1 K f k x i = y ^ i t 1 + f t x i .
In this case, the expression of L ϕ can be transformed into the following form:
L t = i = 1 n l y i , y ^ i t 1 + f t x i + k Ω f k .

2.2. Particle Swarm Optimization (PSO)

Particle swarm optimization (PSO) is an evolutionary computation technique that was first introduced by Eberhart and Kennedy in 1995 [37]. The basic concept of PSO originates from the study of the foraging behavior in bird flocks and is a simplified model of swarm intelligence algorithms. The algorithm was initially inspired by the regular patterns observed in the movements of prey bird flocks, which led to the development of a simplified model using collective intelligence. PSO utilizes collaboration and information sharing among individuals within a swarm to search for the optimal solution [41].
Figure 2 shows the flow of the PSO algorithm, where each particle individually searches for the optimal solution in the search space. The optimal solution is recorded as the current individual extremum and shared with the other particles in the entire particle population. The particles move at a certain speed in the search space, wherein they dynamically adjust their respective speed and position according to their own flight experience and the flight experience of other particles [42].
The equation to update particle velocity in the PSO algorithm is as follows:
V n e w = ω V i d + C 1 r a n d o m ( 0,1 ) ( P i d X i d ) + C 2 r a n d o m ( 0,1 ) ( P g d X i d ) ,
where V i d is the current velocity of the particle; ω is the inertia factor (with velocity there is motion inertia); r a n d o m 0,1 is the random number generation function that generates random numbers between 0 and 1; P i d is the current position of the particle; X i d is the global best position of this particle; P g d represents the current best position among all particles in the population; and C1 and C2 denote the learning factors, which learn from the best position in the history of this particle and the best position in the population, respectively.

2.3. Workflow

As shown in Figure 3, the process of prediction and the parameters optimization of CO2-EOR can be divided into three steps:
Step 1: Numerical Model and Database Establishment. Extensive literature research is conducted to gather knowledge on optimizing CO2-EOR parameters and production profiles. The reservoir numerical simulation software CMG (Version 2021) was utilized to build the CO2-EOR numerical model. By employing the Monte Carlo method, 10,000 sets of different reservoir models were randomly generated and simulated to obtain corresponding production curves for various geological parameters, fluid parameters, relative permeability parameters, and injection/production parameters.
Step 2: Machine Learning Model Selection. Firstly, a correlation analysis was conducted to assess the relationships between different CO2-EOR parameters. Then, using the dataset generated in the first step, which consisted of 10,000 sets of diverse parameters and corresponding production curves, the machine learning models were trained and evaluated. Eight different machine learning models were employed and trained with the dataset to determine their performance in predicting CO2-EOR production. Through thorough evaluation and comparison, the XGBoost algorithm was selected as the best-performing machine learning method for this study.
Step 3: CO2-EOR Production Prediction and Parameter Optimization. The XGBoost-PSO proxy model was employed to predict CO2-EOR production and optimize the CO2-EOR parameters.

3. Establishment of Numerical Model and Database

3.1. Establishment of CO2-EOR Numerical Model

First, based on the actual geological parameters of the oilfield, a characterization model was established, which took into account factors such as well spacing, fluid properties, and heterogeneity. The numerical model consisted of a grid with dimensions of 21 × 21 × 5, with a grid spacing of 10 m in the I direction, 10 m in the J direction, and 5 m in the K direction. Therefore, the feature model had dimensions of 210 m in length, 210 m in width, and 25 m in depth. The well pattern was deployed as a 1/4 five-spot pattern, with one injector well and one producer well per pattern, as shown in Figure 4. The basic parameters of the feature model are described in Table 1.

3.2. Establishing the Database

After building the geologic model, a large dataset needs to be generated to train the predictive model built using machine learning. In this study, a numerical model was used to randomly generate cumulative production data for 10,000 sets of geological and completion parameters. This study investigated a total of 24 parameters, including geological parameters, fluid parameters, initial conditions, and injection/production parameters. The parameters included in this study are as follows. The geological parameters included the following: initial pressure, porosity, permeability, temperature, and spacing in the I, J, and K directions. The fluid parameters included the following: oil density, gas specific gravity, residual oil saturation index, water saturation, oil saturation, oil viscosity, and phase mixing parameter. The phase saturation parameters included the following: residual water saturation, residual oil saturation in the oil-water system, residual oil saturation in the gas-liquid system, and residual gas saturation. The injection/production parameters included the following: gas injection well bottom flow pressure, water injection well bottom flow pressure, production well bottom flow pressure, WF ending time, WAG gas injection rate, and WAG water injection rate. The range of the values for each parameter is shown in Table 2, and the distribution of each parameter is illustrated in Figure 5. The applicable range for each parameter in the table was primarily based on the actual conditions of CO2-driven oil reservoirs in China.
The objective function used in this study was the cumulative oil production, which is the output obtained by simulating the monthly production for each combination using the numerical simulation model. Figure 6 and Figure 7 respectively illustrate the cumulative oil production curve and the distribution of the cumulative oil production. Based on the data and the accompanying figures, it can be observed that the minimum cumulative oil production was 104 m3, while the maximum cumulative oil production was 7.2 × 105 m3. The majority of the distribution fell within the range of 0–105 m3 of oil production.

4. Machine Learning Model Preference

4.1. Correlation Analysis

By observing the results of the correlation analysis in Figure 8, it can be concluded that there are strong linear correlations between cumulative oil production in CO2-EOR and geological parameters, fluid parameters, phase saturation parameters, and injection/production parameters. The discovery of these correlations is significant for gaining a deeper understanding of reservoir characteristics and for optimizing the CO2-EOR process.
From the perspective of the geological parameters (Figure 8a), there was a positive correlation between cumulative oil production in CO2-EOR and certain factors. Notably, there were strong linear correlations between the cumulative oil production and the spacing in the I, J, and K directions, which had correlation coefficients of 0.748, 0.748, and 0.327, respectively. This indicates that the spacing in these directions significantly influences the oil production during the CO2-EOR process. Additionally, the porosity and permeability showed correlations with the cumulative oil production in CO2-EOR, which yielded correlation coefficients of 0.258 and 0.211, respectively. This suggests that, as porosity and permeability increase, the cumulative oil production in CO2-EOR also increases. Porosity represents the void space in the reservoir, while permeability reflects the capacity for fluid flow within the reservoir. Higher porosity and permeability values indicate larger effective storage capacity and better fluid migration capability, thereby enabling CO2 to react more fully with crude oil, which increases the cumulative oil production.
From the perspective of the fluid parameters and phase saturation parameters (Figure 8a,b), there was a weak correlation with the cumulative oil production in CO2-EOR. For instance, in the fluid parameters, the correlation coefficients for the residual oil saturation index, gas specific gravity, and phase mixing parameter were 0.011, −0.007, and −0.008, respectively. Similarly, in the phase permeability parameters, the correlation coefficients for the residual gas saturation, residual oil saturation in the oil–water system, and residual oil saturation in the gas–liquid system were 0.006, −0.004, and −0.012, respectively. These correlation coefficients being close to zero indicate that there is a weak linear relationship between the phase permeability parameters and the cumulative oil production in CO2-EOR.
Furthermore, from the perspective of the injection–production parameters (Figure 8d), there was a strong linear correlation between the CO2-EOR cumulative oil production and CO2-WAG injection volume. The correlation coefficient for the CO2-WAG injection volume was 0.187. This indicates that increasing the CO2-WAG injection volume can effectively enhance the displacement efficiency of the CO2 and increase oil production in the reservoir. Optimizing these parameters can lead to more efficient oil recovery in the CO2-EOR process.

4.2. Machine Learning Model Building

The dataset was split into 70% for training and 30% for testing. Eight machine learning models, including linear regression, ridge regression, decision tree, random forest, gradient boosting decision tree, extreme gradient boosting, K-nearest neighbors, and the neural network, were established. By comparing their accuracies, the model with the highest accuracy was selected as the optimal model.
To evaluate the prediction accuracy of the machine learning models, the coefficient of determination (R2) was selected as the metric [43]. The R2 value ranges from 0 to 1, with a higher value indicating a better fit of the model. The specific formula to calculate R2 is as follows:
R 2 = 1 i = 1 K y i ^ y i 2 i = 1 K y i ¯ y i 2
where y i ¯ is the mean value of y i .
Figure 9 presents the scatter plots of the predicted results versus the true results for the eight predictive models investigated in this study. The corresponding coefficient of determination (R2) values for the selected predictive models are shown in Table 3. Among them, the linear regression, ridge regression, K-nearest neighbors, and decision tree models exhibited scattered predicted points and true points around the 45-degree line, thereby indicating poor predictive performance, with test R2 values of 0.95, 0.81, 0.79, and 0.91, respectively. In contrast, the extreme gradient boosting (XGBoost) model showed a concentration of predicted points and true points along the 45-degree line, with a high test R2 value of 0.98, thereby indicating a low error and good predictive performance.
Based on the results, it can be observed that, among the eight machine learning methods, the extreme gradient boosting (XGBoost) model exhibited the best predictive performance. It achieved a training R2 of 0.99 and a test R2 of 0.98. This model is suitable for use as a predictive optimization model to optimize the injection and production parameters, as well as to predict cumulative oil production. Table 4 provides the hyperparameters of the XGBoost predictive model.

5. CO2-EOR Parameter Optimization

5.1. Coupling of PSO Optimization and XGBoost Model

Particle swarm optimization (PSO) is a widely recognized metaheuristic algorithm that is known for its ability to effectively explore the solution space and find global optima by simulating the collective behavior of a swarm of particles. In this study, PSO was used to search for the optimal combination of CO2-WAG parameters that maximizes the cumulative oil production. When applying the PSO algorithm, a trained XGBoost model was used to evaluate the suitability of a large number of project design parameters. With the aid of the surrogate model, the computational burden of the optimization procedure was significantly reduced, thereby allowing for more iterations of the PSO algorithm. The parameters of the final PSO model, along with the XGBoost parameters, are provided in Table 5, and the optimization process is depicted in Figure 10.

5.2. Production Prediction and Parameter Optimization

In the process of exploiting reservoirs using CO2-EOR technology, various operational and injection/production parameters have an impact on the cumulative oil production. Therefore, the XGBoost-PSO optimization model was employed to optimize the operational parameters, thereby aiming to enhance the cumulative oil production and recovery factor of the reservoir. During the optimization process, a set of key parameters was considered, including water injection well bottom flow pressure, gas injection well bottom flow pressure, production well bottom flow pressure, WAG gas injection, and WAG water injection.
By optimizing these parameters, the final optimization results were obtained, and they are shown in Table 6. Figure 11 and Figure 12 illustrate the optimized cumulative oil production and daily oil production, respectively. From the figures, it is evident that the cumulative oil production and daily oil production under the CO2-WAG method were significantly higher than under the WF method. This finding indicates the immense potential of CO2-EOR technology in improving oil recovery. The optimized cumulative oil production successfully increased from 425,916 m3 to 475,047 m3. This implies that, by optimizing the operational parameters, the oil production potential of the reservoir can be further enhanced.
By establishing the XGBoost-PSO optimization model and optimizing the operational parameters, this process can provide reliable guidance and decision support for reservoir development. This optimization model not only improved the cumulative oil production and recovery factor, but also provides crucial support for the long-term sustainable development of the oilfield. Therefore, further research and optimization of this model are necessary to further enhance the efficiency and benefits of reservoir development.

6. Discussion and Conclusions

Machine learning is a popular research method used in data processing, and this study utilized a predictive model that has significant room for improvement. In this study, we used the XGBoost model, which allows for the customization of parameters such as the number of hidden layers, the number of neurons, and the learning rate to suit specific needs. Additionally, these hyperparameters can be adjusted to enhance the model’s predictive accuracy. Optimization techniques such as PSO or GA can also be introduced to further strengthen the model’s hyperparameters. Furthermore, the evaluation and optimization in this study only considered the cumulative production, daily production, and recovery rate as the objective functions. Future research can consider incorporating other metrics such as net present value (NPV) as objective functions.
This study utilized the XGBoost machine learning algorithm to establish a workflow for evaluating the cumulative gas production in CO2-EOR modeling. This workflow was used for capacity prediction and parameter optimization in CO2-EOR. The following conclusions were drawn:
(1)
Compared to traditional simulation and prediction methods, machine learning approaches can effectively handle reservoir data and address non-linear problems. By incorporating multiple factors such as geology and operations, they significantly improve the efficiency and accuracy of the models.
(2)
The investigation of the correlation between various factors and the cumulative oil production reveals that, from a geological perspective, there is a strong linear correlation between the porosity and permeability with the CO2-EOR cumulative oil production. From an injection/production parameter perspective, there is a strong linear correlation between the CO2-WAG gas injection rate and the CO2-EOR cumulative oil production.
(3)
Different machine learning models exhibited varying performance results in predicting production. By comparing eight different production prediction models, it can be concluded that the extreme gradient boosting (XGBoost) model outperforms other machine learning models in terms of predictive performance. The XGBoost model achieved an R2 score of 0.99 on the training set and 0.98 on the testing set.
(4)
The cumulative oil production, daily oil production, and recovery factor under the CO2-WAG method were significantly higher than those under the WF method. This finding suggests that CO2-EOR technology has great potential in improving the recovery factor of oil reservoirs.
(5)
During the optimization of the CO2-EOR parameters, PSO was coupled with the trained XGBoost model. PSO efficiently searches the parameter space to find the optimal CO2-EOR parameters that maximize the cumulative oil production, thus saving computational costs. The optimized parameters resulted in a higher cumulative oil production and recovery factor when compared to previous results.

Author Contributions

Conceptualization, M.G. and Z.L.; methodology, M.G. and W.L. (Weirong Li); software, S.Q.; validation, Z.L., W.L. (Wanlu Liu) and H.Y.; formal analysis, W.L. (Wanlu Liu); investigation, J.C.; resources, H.Y.; data curation, J.C.; writing—original draft preparation, S.Q.; writing—review and editing, M.G. and W.L. (Weirong Li); visualization, J.C.; supervision, M.G.; project administration, M.G.; funding acquisition, M.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Major Science and Technology project of the CNPC in China (grant No. 2021ZZ01-03 and No. 2021ZZ01-06).

Data Availability Statement

The raw/processed data required to reproduce these findings cannot be shared at this time, as the data also forms part of an ongoing study.

Acknowledgments

The authors are grateful for the financial support of the CNPC in China.

Conflicts of Interest

The authors declare that the publication of this paper has no conflict of interest.

References

  1. Kondori, J.; Miah, M.I.; Zendehboudi, S.; Khan, F.; Heagle, D. Hybrid Connectionist Models to Assess Recovery Performance of Low Salinity Water Injection. J. Pet. Sci. Eng. 2021, 197, 107833. [Google Scholar] [CrossRef]
  2. Guo, J.-X.; Huang, C.; Wang, J.-L.; Meng, X.-Y. Integrated Operation for the Planning of CO2 Capture Path in CCS–EOR Project. J. Pet. Sci. Eng. 2020, 186, 106720. [Google Scholar] [CrossRef]
  3. Allahyarzadeh Bidgoli, A.; Hamidishad, N.; Yanagihara, J.I. The Impact of Carbon Capture Storage and Utilization on Energy Efficiency, Sustainability, and Production of an Offshore Platform: Thermodynamic and Sensitivity Analyses. J. Energy Resour. Technol. 2022, 144, 112102. [Google Scholar] [CrossRef]
  4. Song, C. Global Challenges and Strategies for Control, Conversion and Utilization of CO2 for Sustainable Development Involving Energy, Catalysis, Adsorption and Chemical Processing. Catal. Today 2006, 115, 2–32. [Google Scholar] [CrossRef]
  5. Jiang, J.; Rui, Z.; Hazlett, R.; Lu, J. An Integrated Technical-Economic Model for Evaluating CO2 Enhanced Oil Recovery Development. Appl. Energy 2019, 247, 190–211. [Google Scholar] [CrossRef]
  6. Kong, S.; Huang, X.; Li, K.; Song, X. Adsorption/Desorption Isotherms of CH4 and C2H6 on Typical Shale Samples. Fuel 2019, 255, 115632. [Google Scholar] [CrossRef]
  7. Huang, X.; Gu, L.; Li, S.; Du, Y.; Liu, Y. Absolute Adsorption of Light Hydrocarbons on Organic-Rich Shale: An Efficient Determination Method. Fuel 2022, 308, 121998. [Google Scholar] [CrossRef]
  8. Tang, Y.; Hou, C.; He, Y.; Wang, Y.; Chen, Y.; Rui, Z. Review on Pore Structure Characterization and Microscopic Flow Mechanism of CO2 Flooding in Porous Media. Energy Technol. 2021, 9, 2000787. [Google Scholar] [CrossRef]
  9. Kulkarni, M.M.; Rao, D.N. Experimental Investigation of Miscible and Immiscible Water-Alternating-Gas (WAG) Process Performance. J. Pet. Sci. Eng. 2005, 48, 1–20. [Google Scholar] [CrossRef]
  10. Karimaie, H.; Nazarian, B.; Aurdal, T.; Nøkleby, P.H.; Hansen, O. Simulation Study of CO2 EOR and Storage Potential in a North Sea Reservoir. Energy Procedia 2017, 114, 7018–7032. [Google Scholar] [CrossRef]
  11. Tang, R.; Wang, H.; Yu, H.; Wang, W.; Chen, L. Effect of Water and Gas Alternate Injection on CO2 Flooding. Fault Block Oil Gas Field 2016, 23, 358–362. [Google Scholar] [CrossRef]
  12. Sanchez, N.L. Management of Water Alternating Gas (WAG) Injection Projects; OnePetro: Richardson, TX, USA, 1999. [Google Scholar]
  13. Christensen, J.R.; Stenby, E.H.; Skauge, A. Review of WAG Field Experience. SPE Reserv. Eval. Eng. 2001, 4, 97–106. [Google Scholar] [CrossRef]
  14. Al-Bayati, D.; Saeedi, A.; Myers, M.; White, C.; Xie, Q.; Clennell, B. Insight Investigation of Miscible SCCO2 Water Alternating Gas (WAG) Injection Performance in Heterogeneous Sandstone Reservoirs. J. CO2 Util. 2018, 28, 255–263. [Google Scholar] [CrossRef]
  15. Sun, X.; Liu, J.; Dai, X.; Wang, X.; Yapanto, L.M.; Zekiy, A.O. On the Application of Surfactant and Water Alternating Gas (SAG/WAG) Injection to Improve Oil Recovery in Tight Reservoirs. Energy Rep. 2021, 7, 2452–2459. [Google Scholar] [CrossRef]
  16. Pancholi, S.; Negi, G.S.; Agarwal, J.R.; Bera, A.; Shah, M. Experimental and Simulation Studies for Optimization of Water–Alternating-Gas (CO2) Flooding for Enhanced Oil Recovery. Pet. Res. 2020, 5, 227–234. [Google Scholar] [CrossRef]
  17. Ren, B.; Duncan, I.J. Maximizing Oil Production from Water Alternating Gas (CO2) Injection into Residual Oil Zones: The Impact of Oil Saturation and Heterogeneity. Energy 2021, 222, 119915. [Google Scholar] [CrossRef]
  18. Khather, M.; Yekeen, N.; Al-Yaseri, A.; Al-Mukainah, H.; Giwelli, A.; Saeedi, A. The Impact of Wormhole Generation in Carbonate Reservoirs on CO2-WAG Oil Recovery. J. Pet. Sci. Eng. 2022, 212, 110354. [Google Scholar] [CrossRef]
  19. Ren, D.; Wang, X.; Kou, Z.; Wang, S.; Wang, H.; Wang, X.; Tang, Y.; Jiao, Z.; Zhou, D.; Zhang, R. Feasibility Evaluation of CO2 EOR and Storage in Tight Oil Reservoirs: A Demonstration Project in the Ordos Basin. Fuel 2023, 331, 125652. [Google Scholar] [CrossRef]
  20. Rodrigues, H.; Mackay, E.; Arnold, D.; Silva, D. Optimization of CO2-WAG and Calcite Scale Management in Pre-Salt Carbonate Reservoirs; OnePetro: Richardson, TX, USA, 2019. [Google Scholar]
  21. Sen, D.; Chen, H.; Datta-Gupta, A. Inter-Well Connectivity Detection in CO2 WAG Projects Using Statistical Recurrent Unit Models. Fuel 2022, 311, 122600. [Google Scholar] [CrossRef]
  22. Imani, G. Machine Learning-Assisted Prediction of Oil Production and CO2 Storage Effect in CO2-Water-Alternating-Gas Injection (CO2-WAG). Appl. Sci. 2022, 12, 958. [Google Scholar] [CrossRef]
  23. He, R.; Ma, W.; Ma, X.; Liu, Y. Modeling and Optimizing for Operation of CO2-EOR Project Based on Machine Learning Methods and Greedy Algorithm. Energy Rep. 2021, 7, 3664–3677. [Google Scholar] [CrossRef]
  24. Mohagheghian, E.; James, L.A.; Haynes, R.D. Optimization of Hydrocarbon Water Alternating Gas in the Norne Field: Application of Evolutionary Algorithms. Fuel 2018, 223, 86–98. [Google Scholar] [CrossRef]
  25. You, J.; Ampomah, W.; Tu, J.; Morgan, A.; Sun, Q.; Wei, B.; Wang, D. Optimization of Water-Alternating-CO2 Injection Field Operations Using a Machine-Learning-Assisted Workflow. SPE Reserv. Eval. Eng. 2022, 25, 214–231. [Google Scholar] [CrossRef]
  26. Jaber, A.K. Genetic Algorithm to Optimize Miscible Water Alternate CO2 Flooding in Heterogeneous Clastic Reservoir. Arab. J. Geosci. 2022, 15, 714. [Google Scholar] [CrossRef]
  27. Dehghan, M.H.; Hamidi, F.; Salajegheh, M. Study of Linear Regression Based on Least Squares and Fuzzy Least Absolutes Deviations and Its Application in Geography. In Proceedings of the 2015 4th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), Zahedan, Iran, 9–11 September 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1–6. [Google Scholar]
  28. Kavitha, S.; Varuna, S.; Ramya, R. A Comparative Analysis on Linear Regression and Support Vector Regression. In Proceedings of the 2016 Online International Conference on Green Engineering and Technologies (IC-GET), Coimbatore, India, 19 November 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1–5. [Google Scholar]
  29. Dorugade, A.V. New Ridge Parameters for Ridge Regression. J. Assoc. Arab. Univ. Basic Appl. Sci. 2014, 15, 94–99. [Google Scholar] [CrossRef]
  30. Wang, Y.; Xia, S.-T. Unifying Attribute Splitting Criteria of Decision Trees by Tsallis Entropy. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 2507–2511. [Google Scholar]
  31. Gamal, H.; Alsaihati, A.; Elkatatny, S.; Haidary, S.; Abdulraheem, A. Rock Strength Prediction in Real-Time While Drilling Employing Random Forest and Functional Network Techniques. J. Energy Resour. Technol. 2021, 143, 093004. [Google Scholar] [CrossRef]
  32. Brieman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  33. Jordan, M.I.; Mitchell, T.M. Machine Learning: Trends, Perspectives, and Prospects. Science 2015, 349, 255–260. [Google Scholar] [CrossRef]
  34. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13 August 2016; Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar]
  35. Peterson, L. K-Nearest Neighbor. Scholarpedia 2009, 4, 1883. [Google Scholar] [CrossRef]
  36. Abiodun, O.I.; Jantan, A.; Omolara, A.E.; Dada, K.V.; Mohamed, N.A.; Arshad, H. State-of-the-Art in Artificial Neural Network Applications: A Survey. Heliyon 2018, 4, e00938. [Google Scholar] [CrossRef]
  37. Kennedy, J.; Eberhart, R. Particle Swarm Optimization. In Proceedings of the ICNN’95—International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; IEEE: Piscataway, NJ, USA, 1995; Volume 4, pp. 1942–1948. [Google Scholar]
  38. Xu, Y.; Zhao, X.; Chen, Y.; Yang, Z. Research on a Mixed Gas Classification Algorithm Based on Extreme Random Tree. Appl. Sci. 2019, 9, 1728. [Google Scholar] [CrossRef]
  39. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. In Proceedings of the Advances in Neural Information Processing Systems, Stateline, NV, USA, 3–6 December 2012; Curran Associates, Inc.: Red Hook, NY, USA, 2012; Volume 25. [Google Scholar]
  40. Song, K.; Yan, F.; Ding, T.; Gao, L.; Lu, S. A Steel Property Optimization Model Based on the XGBoost Algorithm and Improved PSO. Comput. Mater. Sci. 2020, 174, 109472. [Google Scholar] [CrossRef]
  41. Eberhart; Shi, Y. Particle Swarm Optimization: Developments, Applications and Resources. In Proceedings of the 2001 Congress on Evolutionary Computation, Seoul, Republic of Korea, 27–30 May 2001; IEEE Cat. No.01TH8546. IEEE: Piscataway, NJ, USA, 2001; Volume 1, pp. 81–86. [Google Scholar]
  42. Andalib Sahnehsaraei, M.; Mahmoodabadi, M.J.; Taherkhorsandi, M.; Castillo-Villar, K.K.; Mortazavi Yazdi, S.M. A Hybrid Global Optimization Algorithm: Particle Swarm Optimization in Association with a Genetic Algorithm. In Complex System Modelling and Control through Intelligent Soft Computations; Zhu, Q., Azar, A.T., Eds.; Studies in Fuzziness and Soft Computing; Springer International Publishing: Cham, Switzerland, 2015; Volume 319, pp. 45–86. ISBN 978-3-319-12882-5. [Google Scholar]
  43. Ottah, D.G.; Ikiensikimama, S.S.; Matemilola, S.A. Aquifer Matching With Material Balance Using Particle Swarm Optimization Algorithm—PSO. In Proceedings of the SPE Nigeria Annual International Conference and Exhibition, Lagos, Nigeria, 4 August 2015; p. SPE-178319-MS. [Google Scholar]
Figure 1. The structure of the XGBoost algorithm [40].
Figure 1. The structure of the XGBoost algorithm [40].
Energies 16 06149 g001
Figure 2. Particle swarm optimization Process.
Figure 2. Particle swarm optimization Process.
Energies 16 06149 g002
Figure 3. Workflow diagram.
Figure 3. Workflow diagram.
Energies 16 06149 g003
Figure 4. Schematic of the reservoir model in the feature model.
Figure 4. Schematic of the reservoir model in the feature model.
Energies 16 06149 g004
Figure 5. Distribution of each parameter. The red color represents geological parameters, the orange color represents fluid parameters, the green color represents phase saturation parameters, and the blue color represents injection/production parameters.
Figure 5. Distribution of each parameter. The red color represents geological parameters, the orange color represents fluid parameters, the green color represents phase saturation parameters, and the blue color represents injection/production parameters.
Energies 16 06149 g005
Figure 6. Cumulative oil production curve.
Figure 6. Cumulative oil production curve.
Energies 16 06149 g006
Figure 7. Cumulative oil production distribution curve.
Figure 7. Cumulative oil production distribution curve.
Energies 16 06149 g007
Figure 8. Correlation analysis of parameters with cumulative oil production.
Figure 8. Correlation analysis of parameters with cumulative oil production.
Energies 16 06149 g008
Figure 9. Model performance for each model using training and test sets.
Figure 9. Model performance for each model using training and test sets.
Energies 16 06149 g009aEnergies 16 06149 g009b
Figure 10. Optimization workflow.
Figure 10. Optimization workflow.
Energies 16 06149 g010
Figure 11. Comparison of cumulative oil production before and after optimization.
Figure 11. Comparison of cumulative oil production before and after optimization.
Energies 16 06149 g011
Figure 12. Comparison of daily oil production before and after optimization.
Figure 12. Comparison of daily oil production before and after optimization.
Energies 16 06149 g012
Table 1. Basic parameters of the feature model.
Table 1. Basic parameters of the feature model.
Parameter TypeParametersValueUnit
Geological parametersInitial pressure20MPa
Temperature70°C
Porosity0.2/
Permeability30mD
Spacing in the I direction10m
Spacing in the J direction10m
Spacing in the K direction4m
Fluid
parameters
Oil density799.2kg/m3
Gas specific gravity0.70/
Residual oil saturation index86.10/
Oil viscosity7.67mPa·s
Water saturation0.30/
Oil saturation0.70/
Phase mixing parameter0.70/
Phase saturation parametersResidual water saturation0.3/
Residual oil saturation in oil-water system0.2/
Residual oil saturation in gas-liquid system0.15/
Residual gas saturation0.15/
Injection/production parametersGas injection well bottom flow pressure30MPa
Water injection well bottom flow pressure21MPa
Production well bottom flow pressure5MPa
WF ending time3650Day
WAG gas injection60m3/day
WAG water injection60m3/day
Table 2. Range of values for each parameter.
Table 2. Range of values for each parameter.
Parameters [16,17,22]Minimum ValueMaximum ValueUnitSymbol in
Figure 5
Initial pressure1525MPaInitial pressure
Temperature45.00120.00°CTemperature
Porosity0.150.25/Por
Permeability3.5240.0mDPERMI
Spacing in the I direction520mDi
Spacing in the J direction520mDj
Spacing in the K direction25mDk
Oil density700900kg/m3Oil density
Gas specific gravity0.530.87/Gas gravity
Residual oil saturation index10.00200.00/Rsi
Oil viscosity0.1515.00mPa·sViso
Water saturation0.200.40/Sw
Oil saturation0.600.80/So
Residual gas saturation0.500.85/Omegas
Residual water saturation0.200.40/SWCON
Residual oil saturation in oil-water system0.150.25/SOIRW
Residual oil saturation in gas-liquid system0.100.20/SORG
Residual gas saturation0.100.20/SGCON
Gas injection well bottom flow pressure2238MPaINJG BHP
Water injection well bottom flow pressure2040MPaINJW BHP
Production well bottom flow pressure35MPaProd BHP
WF ending time27004600DayWF Ending time
WAG gas injection45.00180.00m3/dayWAG INJG
WAG water injection45.00180.00m3/dayWAG INJW
Table 3. Comparison of predictive performance of the models.
Table 3. Comparison of predictive performance of the models.
Machine Learning AlgorithmsTrain R2Test R2
Linear Regression0.820.75
Ridge Regression0.820.81
Decision Tree1.000.91
Random Forest0.970.96
K Nearest Neighbors0.800.79
Neural Network0.960.96
Gradient Boosting Decision Tree0.970.96
XGBoost0.990.98
Table 4. Machine learning XGBoot model hyperparameters.
Table 4. Machine learning XGBoot model hyperparameters.
Model HyperparametersMinimum ValueMaximum Values
Number of boosting stages10500
Learning rate0.010.2
Maximum depth of tree310
Gamma00.4
Minimum child weight15
Subsample0.51
Colsample by tree0.51
Table 5. Hyperparameters of PSO algorithm and XGBoost model.
Table 5. Hyperparameters of PSO algorithm and XGBoost model.
Parameters in PSO AlgorithmValue
Population number group size15
Maximum number of iterations maximum50
Inertia weight (ω)0.8
Learning factor (c1)2
Learning factor (c2)2
Table 6. Optimization of basic parameters of CO2-EOR.
Table 6. Optimization of basic parameters of CO2-EOR.
Optimization ParametersWFWAGOptimization of WAGUnit
Gas injection well bottom flow pressure28.5028.5025.33MPa
Water injection well bottom flow pressure212133.76MPa
Production well bottom flow pressure4.004.004.29MPa
WF ending time377737773533day
WAG gas injection0144120m3/day
WAG water injection17417466.71m3/day
Cumulative oil production319,234425,916475,047m3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gao, M.; Liu, Z.; Qian, S.; Liu, W.; Li, W.; Yin, H.; Cao, J. Machine-Learning-Based Approach to Optimize CO2-WAG Flooding in Low Permeability Oil Reservoirs. Energies 2023, 16, 6149. https://doi.org/10.3390/en16176149

AMA Style

Gao M, Liu Z, Qian S, Liu W, Li W, Yin H, Cao J. Machine-Learning-Based Approach to Optimize CO2-WAG Flooding in Low Permeability Oil Reservoirs. Energies. 2023; 16(17):6149. https://doi.org/10.3390/en16176149

Chicago/Turabian Style

Gao, Ming, Zhaoxia Liu, Shihao Qian, Wanlu Liu, Weirong Li, Hengfei Yin, and Jinhong Cao. 2023. "Machine-Learning-Based Approach to Optimize CO2-WAG Flooding in Low Permeability Oil Reservoirs" Energies 16, no. 17: 6149. https://doi.org/10.3390/en16176149

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop