Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling

Merdeka, Mohammad Galang; Ridha, Syahrir; Negash, Berihun Mamo; Ilyas, Suhaib Umer

doi:10.3390/app12063169

Open AccessArticle

Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling

¹

Petroleum Engineering Department, Universiti Teknologi PETRONAS, Seri Iskandar 32610, Perak Darul Ridzuan, Malaysia

²

Institute of Hydrocarbon Recovery, Universiti Teknologi PETRONAS, Seri Iskandar 32610, Perak Darul Ridzuan, Malaysia

³

Department of Chemical Engineering, University of Gujarat, Jalalpur Jattan Road, Gujrat 50700, Pakistan

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(6), 3169; https://doi.org/10.3390/app12063169

Submission received: 8 February 2022 / Revised: 13 March 2022 / Accepted: 16 March 2022 / Published: 20 March 2022

Download

Browse Figures

Versions Notes

Abstract

:

Steam huff and puff injection is one of the thermal EOR methods in which steam is injected in a cyclical manner alternating with oil production. The cost and time inefficiency problem of reservoir simulation persists in the design of a steam huff and puff injection scheme. Building predictive proxy models is a suitable solution to deal with this issue. In this study, predictive models of the steam huff and puff injection method were developed using two machine learning algorithms, comprising conventional polynomial regression and an artificial neural network algorithm. Based on a one-well cylindrical synthetic reservoir model, 6043 experiment cases with 28 input parameter values were generated and simulated. Outputs from the results such as cumulative oil production, maximum oil production rate and oil rate at cycle end were extracted from each simulation case to build the predictive model. Reservoir properties that could change after an injection cycle were also modeled. The developed models were evaluated based on the fitting performance from the R-square value, the mean absolute error (MAE) value and the root mean square error (RMSE) value. Then, Sobol analysis was conducted to determine the significance of each parameter in the model. The results show that neural network models have better performance compared to the polynomial regression models. Neural network models have an average R-square value of over 0.9 and lower MAE and RMSE values than the polynomial regression model. The result of applying the Sobol analysis also indicates that initial reservoir water saturation and oil viscosity are the most important parameters for predicting reservoir production performance.

Keywords:

artificial neural network; enhanced oil recovery; polynomial regression; proxy model; reservoir simulation; steam huff and puff

1. Introduction

The maturity of most giant oil fields motivates the utilization of unconventional recovery methods to increase oil production [1]. After the primary and secondary recovery stages, enhanced oil recovery (EOR) has the potential to enable the recovery of a significant amount of oil that remains in the reservoir. EOR involves the injection of a fluid or fluids into the reservoir to supply the energy required for oil displacement. Furthermore, the injected fluids interact with the reservoir rock and formation fluid, which results in altering the physical properties of the rock and creating advantageous conditions for oil recovery [2,3,4]. One of the main EOR methods applied worldwide is thermal EOR. This method supported more than 40% of total EOR production in 2015 [5]. The mechanism of the thermal EOR method is based on reducing oil viscosity at an increased reservoir temperature. This implies that the target oil is a high-density and high-viscosity oil. Heavy oils are such an example for the target oil, and several reports on thermal EOR implementations showed that almost all of the reservoirs are heavy oil reservoirs [6,7,8].

Steam huff and puff injection, also known as cyclic steam stimulation, is the method comprising cyclical steam injection into a well. After injection, the well is shut in to allow the steam to “soak” into the reservoir. The high temperature of the steam should reduce oil viscosity near the steam–oil interface. This period is called the soaking period. The well is opened again after the soaking period, producing oil at a higher rate. During the soaking period, most of the steam is condensed so the well produces both hot oil and hot water. This increase in well production performance happens because of the reduction in oil viscosity and the increase in reservoir pressure near the wellbore. After some time, the heat dissipates and oil viscosity increases. After the production declines to the pre-defined level, steam is reinjected into the well, starting a new injection/production cycle. This whole process of injection, soaking, and production is called the huff and puff method. Steam huff and puff injection is one of the most used thermal EOR methods due to its economical attractiveness and effectiveness. Furthermore, it is often used as a precursor before conducting a full field-scale steam injection. Successes in previous developed experience of this method were reported in Cold Lake Field, Canada [9,10], Midway Sunset Field, California [11], Duri Field, Indonesia [12,13,14] and Tia Juana Field, Venezuela [15,16]. In Cold Lake Field, a 100,000 cp bitumen was found and 20% of the reserves was recovered by implementing this method. In Midway Sunset Field, more than 19,000 steam cycles were performed in 1500 wells over a period of 23 years. In Duri Field, the oil production rate was increased by almost five-fold and steam huff and puff injection became a precursor to a steamflood pilot, which led to the largest steamflood implementation in the world. In Tia Juana Field, the method was applied to 145 wells and produced 30.7 million barrels of oil in 7 years.

Reservoir simulation is a crucial tool in reservoir engineering. It allows the simulation of the behavior of a dynamic reservoir model in response to the disturbance caused by the production operation. Moreover, it has become a standard in evaluating the performance of a reservoir and designing an optimized production scheme. However, despite the progress in computational hardware, the whole process of reservoir simulation remains time-consuming and expensive due to the high non-linearity of a reservoir system and the difficulty of building a reliable full reservoir model.

The cost and time inefficiency problem of reservoir simulation persists in designing a production scheme for steam huff and puff injection. Building predictive proxy models is a suitable solution to deal with this issue. A proxy model is a mathematically or statistically defined function that represents a real system or its simulation [17]. It can provide results instantly, compared to the whole process of reservoir simulation, which incorporates screening, modeling, history matching, field development planning, and running the simulation. In reservoir engineering, proxy modeling has mainly been used for sensitivity analysis, risk analysis, history matching, production forecasting, and production optimization [17,18,19,20,21]. Prediction of reservoir performance is considered one of the main proxy model applications. Numerous proxy models for different reservoirs under different production methods have been developed and reported in the literature. For instance, Queipo et al. [22] developed a proxy model to optimize cumulative oil production for the steam-assisted gravity drainage method. Mohaghegh [23] built a proxy model using a fuzzy pattern algorithm to mimic a giant oil field simulation, where the proxy model outcomes are oil and water production over time. Artificial neural network (ANN) proxy models were developed by Artun et al. [24] for the CO₂ and N₂ cyclic injection method to predict oil production; by Sumardi and Irawan [25] to predict coalbed methane production; by Ayala et al. [26] to predict fluid production in gas condensate reservoir; by Sun and Ertekin [27] in a polymer flooding process; and by Bansal et al. [28] to forecast well production in a tight oil reservoir. Dahaghi and Mohaghegh [29] constructed a proxy model to predict cumulative production from a fractured shale gas reservoir. Al-Mudhafar and Rao [30] built a proxy model to predict optimum oil recovery for an immiscible gas-assisted gravity drainage method. Polizel et al. [31] generated proxy models based on response surface methodology to predict fluid production, Net Present Value (NPV), and the recovery factor of oil. Panja et al. [32] applied the same algorithm, comparing it to the least square support vector machine algorithm to predict hydrocarbon production from shales. Box–Behnken design was used to develop a statistical proxy model by Jaber et al. [33] to estimate incremental oil recovery under a CO₂-water alternating gas (WAG) scheme and Ahmadi et al. [34] to predict oil recovery in miscible CO₂ injection.

Proxy models that predict reservoir performance for steam huff and puff injection are rarely found. Attempts to develop universal proxy models for the specified case were instigated by Arpaci [35], Sun and Ertekin [36], and Ersahin and Ertekin [37]. The models were presented as general models and some are the continuation and improvements from previous works over the years. Arpaci [35] developed an expert system consisting of ANN proxy models for steam huff and puff injection with horizontal wells in a naturally fractured reservoir. The inputs of the model are reservoir rock and fluid properties, operation design parameters, and fracture design parameters. The forward models predict outputs such as the number of cycles, cycle duration, oil flowrate, and cumulative oil production. Inverse models were developed to predict operation parameters and fracture design parameters, using performance indicators such as oil production and production period as additional inputs. Most of the reservoir parameters were assumed constant, such as oil density, relative permeability, anisotropy, and capillary pressures. It was reported that the majority of testing cases had a difference in the number of cycles by two cycles, but the production prediction still falls within the acceptable error margin. Sun and Ertekin [36] developed ANN proxy models for steam huff and puff injection. The study had the same workflow where forward and inverse models were generated as an expert system. In addition, two case studies describing examples of potential practical applications were presented. The first case study compared the performance of proxy models to the reservoir simulation result using the same inputs. The production data results showed a good agreement between the results obtained using a reservoir simulator and an artificial network model. Ersahin and Ertekin [37] improved the Sun and Ertekin model by optimizing the generation of input data with the results focused more on the heat exchange effect around the wellbore, represented by the developed viscosity contour model.

This study aims to develop predictive models that provide reservoir performance outputs estimation for the steam huff and puff injection method. The developed predictive models are data-driven proxy models based on reservoir simulation results. These models provide improvements from the previously developed data-driven general models, where the range and distribution of the input parameters are based on the screening data of reported steam huff and puff injection field implementations. Furthermore, the provided range is comparatively wider than the previous models. As the proxy model only works within the predetermined range of input parameter values, a wider range results in higher applicability of the model to reservoir conditions that are suitable for the injection method. To the authors’ knowledge, such an investigation of implementing the screening survey data to the development of the proxy model has not been instigated. In addition, this study approaches prediction of the performance of output parameters one injection cycle at a time to achieve faster convergence when developing the model. The implemented machine learning algorithms are polynomial regression and artificial neural network, as both algorithms showed promising results in petroleum industry applications [38,39,40,41,42]. Cumulative oil production, maximum oil rate and oil rate at the end of injection cycle are the output parameters to be modeled. These outputs address the typical production profile of huff and puff, where the production rate peaks the moment that the well is opened, then declines over a certain production period before the huff and puff cycle is repeated. Lastly, reservoir conditions that may change in one injection cycle such as pressure and water saturation are modeled to prepare the input parameters for the next cycle. The performance of multiple cycle scenarios can be determined by reinputting unchanging parameters from the previous cycle and the changing parameters that are modeled. As the models provide a quick estimation of the reservoir performance and can also be used as a sensitivity analysis and optimization tool, it is believed that the models can be of significance to engineers working in reservoir engineering and simulation.

2. Methodology

The predictive model for the steam huff and puff injection method is developed based on reservoir simulation results data using the commercial software Computer Modelling Group (CMG)^TM. The design steps are creating a synthetic reservoir model, generating simulation experimental data, and developing the predictive model. The applied methodology in this study is illustrated in Figure 1.

2.1. Synthetic Reservoir Model

The reservoir model was developed based on the SPE 4th comparative study [43], as illustrated in Figure 2. It is a cylindrical grid model with 20 radial grids, 1 angular grid, and 4 vertical grids. As the huff and puff method incorporates injection, soaking and production in one well, a cylindrical geometry is preferred due to its advantage in representing the radially dominated flow around a single wellbore. The radial grid around a near-well area is refined to capture the thermal mechanism and fluid flow around the wellbore better. It has 1 well positioned at the center, which serves as both an injector and producer. All grids of the well are perforated, and reservoir properties are homogeneous for every grid.

Oil viscosity is considered as an input parameter for this model. The oil viscosity vs. temperature table was generated using Andrade’s correlation as shown in Equation (1) [44]. Relative permeability functions were built using Corey’s equation, presented via Equation (2) to Equation (5) [45]. The oil–water system was covered by Equations (2) and (3) and the gas–liquid system was covered by Equations (4) and (5). The variation in Corey exponents (

n_{o}, n_{o w}, n_{o g}, n_{g}

) covered both sandstone and carbonate rock types.

μ_{o} = A \times e^{\frac{B}{T}}

(1)

k_{r o w} = k_{r o c w} {[\frac{1 - S_{w} - S_{o r w}}{1 - S_{w c} - S_{o r w}}]}^{n_{o}}

(2)

k_{r w} = k_{r w i r o} {[\frac{S_{w} - S_{w c}}{1 - S_{w c} - S_{o r w}}]}^{n_{o w}}

(3)

k_{r o g} = k_{r o g c g} {[\frac{1 - S_{g} - S_{l c}}{1 - S_{g c} - S_{l c}}]}^{n_{o g}}

(4)

k_{r g} = k_{r g c l} {[\frac{S_{g} - S_{g c}}{1 - S_{l c} - S_{g c}}]}^{n_{g}}

(5)

In Equation (1),

μ_{o}

is the oil viscosity,

A

and

B

are the Andrade viscosity constants, and

T

is the temperature in °F. For the oil–water relative permeability system in Equations (2) and (3),

k_{r o w}

is the oil relative permeability and

k_{r w}

is the water relative permeability.

k_{r o c w}

is the oil relative permeability at connate water saturation

(S_{w c})

;

k_{r w i r o}

is the water relative permeability at irreducible oil saturation

(S_{o r w})

.

n_{o}

is the Corey exponent for

k_{r o w}

and

n_{o w}

is the Corey exponent for

k_{r w}

. For the gas–liquid relative permeability system in Equations (4) and (5),

k_{r o g}

is the liquid relative permeability and

k_{r g}

is the gas relative permeability.

k_{r o g c g}

is the liquid relative permeability at connate gas saturation

(S_{g c})

and

k_{r g c l}

is the gas relative permeability at connate liquid saturation

(S_{l c})

.

n_{o g}

is the Corey exponent for

k_{r o g}

and

n_{g}

is the Corey exponent for

k_{r g}

.

The operating parameters are steam injection rate, steam quality, steam temperature, soaking period, injection period, and production period. The initial condition of the reservoir is adjusted to be within the range of reservoir conditions of the screening criteria for steam huff and puff. Steam, with varying quality and temperature, is the injected fluid. The model simulates an injection scheme for one cycle, consisting of injection, soaking, and production periods. This thermal reservoir model is simulated using the commercial simulator CMG STARS^TM.

The 3D model was then validated by comparing the results of oil production history from this model with actual field data, where the field parameters and results used are from the Midway Sunset field [46]. Figure 3 shows the history matching of oil production rate from this simulation model to the field data, which expressed good agreement. Furthermore, the cumulative oil production result from this model is 878 m³, in comparison to field data, which is 888 m³. This suggests that they are in good agreement with each other. It can be inferred that the synthetic model is valid and applicable for further simulation studies.

2.2. Generation of Simulation Experimental Data

Simulation experiment cases were generated by varying the values of 28 input parameters within a provided range. They were designed using the Latin Hypercube Method, where this method has an advantage over classic experimental design in orthogonality and space-filling, resulting in a unique input parameter value for each simulation case [47]. The range of each parameter is determined from the screening criteria of steam huff and puff applications obtained from the literature study. A study from Hama et al. [48] provided the screening criteria in which a data cleaning process had been conducted to remove outliers and duplicates from the database. Furthermore, the distribution of each screening parameter was determined and used in this study to generate the experimental cases. The range of input parameter values of Corey’s equations was selected to satisfy practical fluid flow conditions and accommodate the different pore size distributions of sandstone rock. Combining these with the former screening criteria by Ali [49] and Taber et al. [50], the list of input parameters with their range for this study was determined and is presented in Table 1.

A sampling control was conducted before running the simulation to avoid unrealistic experiment cases, which result in reservoir models that physically do not exist. Combinations of input parameters needed to be controlled, such as oil density, reservoir pressure and temperature, porosity–permeability, rock–fluid properties, and drainage area. After sampling control, 6304 simulation experiment cases were ready to be simulated using a reservoir simulator. The reservoir performance of steam huff and puff injection was represented in cumulative oil production, maximum oil rate per cycle, and the oil rate at the end of the cycle. Multiple cycle injection performance can be estimated by re-inputting parameters into the proxy model, as well as the input parameter that may have changed after conducting the previous huff and puff cycle. The input parameters that may have changed and were modeled in this study are reservoir pressure, reservoir temperature, and water saturation in the reservoir. All of the outputs from each simulation result are captured as a response variable to build the predictive model.

2.3. Predictive Model Development

The CMG CMOST^TM module was utilized to develop the predictive model. This module was employed to run the generated simulation cases, record the output target results and alter the hyperparameters to build the predictive model. From 6304 simulation data, 80% of the result was treated as training data and the remaining 20% was treated as testing data. In this study, polynomial regression and artificial neural network models were developed.

The polynomial regression (PR) method is a form of analysis to determine the relationship between input variables and response variables as a polynomial equation. The general quadratic polynomial regression is presented in Equation (6) below:

y = a_{0} + \sum_{j = 1}^{k} a_{j} x_{j} + \sum_{j = 1}^{k} a_{j j} x_{j}^{2} + \sum_{i < 1} \sum_{j = 1}^{k} a_{j} x_{j}

(6)

where

a_{0}, a_{1}, \dots, a_{k}

are the coefficients of linear (

x_{j}

) terms,

a_{j j}

is the coefficient of quadratic (

x_{j}^{2}

) terms, and

a_{i j}

is the coefficient of the interaction (

x_{i} x_{j}

) terms. All coefficients are determined by using the least-squares method.

It is common that some of the terms in the model are not statistically significant. The model can be improved by removing statistically insignificant terms, which can be determined by evaluating the significance probability. As the insignificant variables are reduced, the modified model is often named a reduced polynomial regression model. In this study, the reservoir performance outputs are represented as a reduced quadratic function of the 28 input parameters.

Artificial neural network (ANN) is a model that emulates the biological neural system. It consists of nodes that are similar to neurons in the human brain. A node receives signals from adjacent nodes and processes them to produce an output. The ANN model structure is presented in Figure 4. It consists of an input layer, an output layer, and one or multiple hidden layers in between. Each neuron from a layer is connected to the neurons from other nearby layers with a connection called weight, which represents an influence of corresponding input on the connected neuron. Then, to transfer the contained information, an activation (or transfer) function is used. This is also known as forward propagation and it is conducted for every connection in the network. After defining the model, the back propagation algorithm is used to readjust the weight and minimize the error between the output and the database. In this study, a multilayer neural network structure was implemented, where the input layer consisted of 28 neurons as input parameters (listed in Table 1), the output layer consisted of 1 neuron as output parameter, and there were multiple hidden layers in between, with each hidden layer consisting of a number of neurons. The activation function for all layers is the hyperbolic tangent function (tanh) presented in Equation (7). This activation function is used as it shows good performance when employed to build a multilayer neural network.

f (x) = \tanh (x) = \frac{2}{1 + e^{- 2 x}} - 1

(7)

Three evaluation metrics were employed to assess the performance of the developed models, which were the coefficient of determination (R-square), the mean absolute error (MAE) and root mean square error (RMSE). The R-square is a statistical metric used to measure the regression prediction compared with the actual data point, where 1 indicates a perfect fit. The adjusted R-square is a modification of the R-square calculation, where it takes into account the number of input parameters

(k)

. Its value is always lower than the R-square. Both are presented in Equations (8) and (9), respectively, where

\hat{y_{i}}

is the predicted value,

y_{i}

is the simulation data, and

n

is the number of experimental cases. In this study, the adjusted R-square will be used as the evaluation metric for the fitting coefficient, as it has the potential to be more accurate for a high number of input parameters. A scatter plot of simulated and predicted data is presented to visualize the performance of the model. In a good, representative model, both training and testing data points should be located close to the 45-degree baseline

(y = x)

, as the baseline represents a perfect fit (

R^{2} = 1

). Additionally, the MAE and RMSE are calculated for the training and testing results of each model. MAE is the average difference of the predicted and actual data, which indicates the relative deviation between them. RMSE is the square root of the mean of the square of all of the errors. It gives a relatively high weight to large errors and grows as the frequency of large errors increases. Both metrics can range from 0 to infinity and lower values are preferred. The MAE and RMSE are given in Equations (10) and (11), respectively.

R^{2} = 1 - \frac{\sum_{1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{\sum_{1}^{n} {(y_{i} - \bar{y_{i}})}^{2}}

(8)

R_{a d j}^{2} = 1 - \frac{(1 - R^{2}) (n - 1)}{n - k - 1}

(9)

M A E = \frac{1}{N} \sum_{1}^{n} | y_{i} - \hat{y_{i}} |

(10)

R M S E = \sqrt{\frac{\sum_{1}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{n}}

(11)

Lastly, Sobol analysis [51], a variance-based sensitivity analysis, was conducted for each developed proxy model to determine the significance of input parameters toward each model. This method aims to quantify the amount of variance that each input parameter contributes to the variance of output. It identifies the total effect of an input parameter, which can be separated into main effect and interaction effect [52]. The main effect is the contribution to model output variance based on the variation of the input parameter itself and the interaction effect is the contribution of particular combinations of model input. The sum of both main and interaction effects is called the total effect. This analysis is conducted for the model that has the better performance after evaluating the fitting metrics between the polynomial regression and the neural network model.

3. Results and Discussion

Each of the proxy models is represented as a function of 28 input parameters and evaluated based on the R-square value. The predicted and simulated data points should be scattered close to the baseline, as it represents the best fit for the model. In the polynomial model, as shown in Equation (6), the least-square fitting algorithm was carried out to determine the coefficients of all terms. The form of the equation is an output parameter as a function of input parameters. In the neural network model, the number of layers and neurons in each layer are pre-assigned before training the model. The end result is a table that contains weight values for all node connections. Through trial-and-error, the hyperparameters of all the developed models were optimized to achieve a high prediction performance.

3.1. Proxy Models for Predicting Reservoir Performance

The reservoir performance outputs are represented by cumulative oil production, maximum oil rate per cycle, and oil rate at the end of the cycle. As mentioned above, these outputs address the typical production profile of huff and puff. Predicted and simulated data points for each output are presented in Figure 5, Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10, respectively. Overall, the neural network model provides a better fit for training and testing data when compared to the polynomial regression model.

The cumulative oil production (

N_{p}

) values range from 0.1 to 100 MSTB. The polynomial model is the equation of the

N_{p}

output as a function of all input parameters. The neural network model of

N_{p}

is a deep neural network with multiple hidden layers, which has an architecture of the following: 1 input layer with 28 input neurons, 1 output layer with 1 output neuron, and five hidden layers with an 8–8–6–4–4 neuron configuration. The input and output layer configurations are the same for all neural network models mentioned in this study. For training data, the R-square for the polynomial regression model and the neural network model are 0.755 and 0.981, while the R-square for the testing data is 0.735 and 0.959. The MAE of the training and testing data for the polynomial regression model is 7.53 Mbbl and 8.28 Mbbl, and for the neural network model, this is 1.39 Mbbl and 1.76 Mbbl. The RMSE of the training and testing data for the polynomial regression model is 10.39 Mbbl and 11.65 Mbbl and, for the neural network model, 2.31 Mbbl and 3.36 Mbbl. The neural network model presents a better result than the polynomial regression model. The prediction for the training data is mainly good where points are around the baseline. The prediction in testing data is well below the 20 Mbbl and starts to scatter further from the best fit curve. The simulation results showed around 90% of the cumulative production data are below 20 Mbbl, thus making the prediction better inside that interval. It can also be observed that the polynomial regression model predicts negative values while the neural network model does not, which shows the advantage of the neural network model over the polynomial model as negative production values do not exist in reality.

The maximum oil rate (

q o_{m a x}

) values range from 0 to 2000 stb/d. The polynomial model is the equation of the

q o_{m a x}

output as a function of all input parameters. The neural network model of

q o_{m a x}

has an architecture of five hidden layers with an 8–8–6–4–4 neuron configuration. For the training data, the R-square for the polynomial regression model and the neural network model are 0.706 and 0.957, while the R-square for the testing data is 0.675 and 0.931. The MAE of the training and testing data for the polynomial regression model is 221.6 bbl/d and 239.9 bbl/d and, for the neural network model, 63.4 bbl/d and 74.3 bbl/d. The RMSE of the training and testing data for the polynomial regression model is 281.5 bbl/d and 303.6 bbl/d and, for the neural network model, 97.6 bbl/d and 126.4 bbl/d. It can be seen from Figure 7 and Figure 8 that the neural network model far outperforms the polynomial regression model. The training and testing data in the neural network are generally clustered around the best fit curve, with the prediction for both being well below 250 bbl/d.

The oil rate values at the end of the cycle (

q o_{e n d}

) range from 0 to 400 stb/d. The polynomial model is the equation of the

q o_{e n d}

output as a function of all input parameters. The neural network model of

q o_{e n d}

has an architecture of five hidden layers with an 8–8–6–4–4 neuron configuration. For training data, the R-square values for the polynomial regression model and the neural network model are 0.561 and 0.953, while the R-square for the testing data is 0.524 and 0.847. The MAE of the training and testing data for the polynomial regression model is 29.1 bbl/d and 31.1 bbl/d and, for the neural network model, 8.6 bbl/d and 11.8 bbl/d. The RMSE of the training and testing data for the polynomial regression model is 43.0 bbl/d and 45.9 bbl/d and, for the neural network model, 14.1 bbl/d and 21.7 bbl/d. The polynomial regression model performs poorly in comparison to the neural network model in this output model, as shown in Figure 9 and Figure 10. Though the deviation is noticeable, overall, the data points for training and testing data have a good correlation to the best fit curve. The prediction for both the training and testing data is below the 20 bbl/d and started to drift further from the best fit curve. The data results show that around 80% of the data points are below 20 bbl/d, making the prediction better inside that interval.

Sobol analysis is implemented for the neural network models to observe the significance of input parameters. Table 2 presents 10 input parameters that provide the highest contribution towards predicting the output for the set of base input parameters described in Table 1. In general, reservoir water saturation, represented by Sw, and oil viscosity, represented by Andrade’s viscosity constant ViscB, are the two parameters with the highest impact on the reservoir production performance models of steam huff and puff injection. The contributions of Sw and ViscB to the

N_{p}

model are 16.5% and 11.0%; for the

q o_{m a x}

model, 53.1% and 25.5%; and for the

q o_{e n d}

model, 10.7% and 12.0%.

Reservoir water saturation is directly related to the volume of oil inside the reservoir, hence the change in its value impacts the amount of oil production. The oil viscosity is a determining factor of oil production performance, as the objective of steam huff and puff injection is to lower the oil viscosity, allowing easier flow of the oil and increasing production. A higher initial oil viscosity will result in less efficient production for a fixed amount of steam injection due to the higher need for heat energy to decrease the viscosity. It is worth mentioning that porosity is not on the list for

N_{p}

and

q o_{m a x}

proxy models, and appears only in the lower part of the list of the

q o_{e n d}

model, despite its significance as a reservoir rock property that also directly links to the volume of oil reserve. This can be explained by observing the distribution of porosity input values from the screening data from the field implementation report, where a large number of porosity values might be clustered on a small range due to a skewed distribution. In addition, the presence of other volume-related parameters with possibly wider distribution input values further increases the dominance of the contribution towards the proxy model outputs, such as water saturation and drainage area, which are observed in the list.

The other affecting parameters for each proxy model can be seen in the table, though their contributions are significantly lower compared to the first two parameters. For the

N_{p}

model, reservoir pressure and the bulk volume of the reservoir are the next contributors to the model. For the

q o_{m a x}

model, these are reservoir pressure and permeability; and for the

q o_{e n d}

model, the production period of the steam huff and puff cycle, reservoir pressure, and the drainage area of the well.

3.2. Proxy Models for Predicting Reservoir Conditions after an Injection Cycle

The reservoir condition outputs after one cycle are represented by reservoir pressure, reservoir temperature, and reservoir water saturation. These outputs address the reservoir parameters that are most likely to change in one injection cycle. The parameters are defined as the average parameter value in the reservoir model, assuming that the effect of steam injection has mostly diminished and a new injection cycle can be started in the reservoir. Predicted and simulated data points for each output are presented in Figure 11, Figure 12, Figure 13, Figure 14, Figure 15 and Figure 16.

The reservoir pressure values at the end of the injection cycle (

p_{e n d}

) from the results dataset range from 0 to 1200 psi. The polynomial model is the equation of the

p_{e n d}

output as a function of all input parameters. The neural network model of

p_{e n d}

has an architecture of three hidden layers with a 10–8–6 neuron configuration. For the training data, the R-square for the polynomial regression model and the neural network model is 0.878 and 0.991, while the R-square for the testing data is 0.877 and 0.989. The MAE of the training and testing data for the polynomial regression model is 64.8 psi and 66 psi and, for the neural network model, 14.9 psi and 16.1 psi. The RMSE of the training and testing data for the polynomial regression model is 85.8 psi and 87.6 psi and, for the neural network model, 22.3 psi and 25.5 psi. It can be seen in Figure 11 and Figure 12 that the neural network model performs better than the polynomial regression model. The training and testing data in the neural network are generally clustered around the best fit curve. It is also worth highlighting that the neural network model predicted no negative values, in comparison to the polynomial regression model that yields some negative data points.

The reservoir temperature values at the end of the injection cycle (

T_{e n d}

) from the results dataset range from 50 to 194 °F. The polynomial model is the equation of the

T_{e n d}

output as a function of all input parameters. The neural network model of

T_{e n d}

has an architecture of three hidden layers with an 8–6–4 neuron configuration. For training data, the R-square for the polynomial regression model and the neural network model is 0.962 and 0.995, while the R-square for the testing data is 0.954 and 0.978. The MAE of the training and testing data for the polynomial regression model is 2.5 °F and 2.6 °F and, for the neural network model, is 0.9 °F and 1.3 °F. The RMSE of the training and testing data for the polynomial regression model is 3.8 °F and 4.2 °F and, for the neural network model, is 1.4 °F and 2.9 °F. The performance between the two models does not differ much, though it can be seen that some training data points drift away in the prediction–simulation plot of the polynomial regression model. Overall, the training and testing data in the neural network fit well around the best fit curve.

The reservoir water saturation values at the end of the injection cycle (

s w_{e n d}

) from the results dataset range from 0 to 0.88. The polynomial model is the equation of the

s w_{e n d}

output as a function of all input parameters. The neural network model of

s w_{e n d}

has an architecture of three hidden layers with a 4–4–2 neuron configuration. For training data, the R-square values for the polynomial regression model and the neural network model are 0.939 and 0.982, while the R-square values for the testing data are 0.935 and 0.980. The MAE of the training and testing data for the polynomial regression model is 0.0259 and 0.0264 and, for the neural network model, is 0.0152 and 0.0156. The RMSE of the training and testing data for the polynomial regression model is 0.0331 and 0.0338 and, for the neural network model, is 0.0208 and 0.0218. Similar to the reservoir temperature model, polynomial regression and the neural network model are observed fit well. For the polynomial model, a slight distortion is detected above the 0.6 water saturation value.

After conducting Sobol analysis for these models, it is observed that the parameter that dominates the contribution to each output model is the initial value of the parameters themselves, before implementing the steam huff and puff injection cycle. Table 3 shows the top five highest contributing parameters, as the other parameters contribute significantly less than the first five parameters. The most important parameter for the reservoir pressure model at the end of the cycle is the initial pressure with 32.7%. For the reservoir temperature, the most important parameter is the initial temperature with 50.9% significance and for reservoir water saturation is the initial water saturation with 78.5% significance. This explains the high fitting value of both the polynomial regression model and the neural network model, where the model is highly linear due to the dominance of one input parameter. This proposes the idea that at the end of the cycle, the traces and effects of steam injection have diminished, leaving the reservoir condition to be nearly similar to the initial condition of the reservoir that demanded the steam injection treatment. However, for the reservoir pressure model, water saturation and viscosity are other parameters that provide a significant contribution. This result can be justified considering that reservoir pressure changes as the volume of the fluid filling the pore spaces changes. Production activity, which displaces the fluid out from the reservoir, depletes the reservoir pressure. This process is the underlying mechanism that determines the reservoir production performance. Hence, the two most important parameters from the developed production predictive models above—water saturation and oil viscosity—may possibly take part in impacting the reservoir pressure model.

3.3. Evaluation of Model Fitting and Performance Metrics

A summary of evaluation metrics for each proxy model is presented in Table 4 and Table 5. It can be seen that, in general, the performance of the neural network models is optimum compared to the polynomial regression models. This is due to the nature of both models, where the high nonlinearity presented in this modeling problem is well captured by the neural network model compared to the polynomial regression model. For training and testing data of the neural network model, almost all of the R-square values are above 0.9, which shows a good fit. Additionally, the MAE and RMSE values are notably lower than the polynomial models. A good agreement between the fitting indicator of training and testing data indicates that the model is not overfitting. Overfitting in a model is undesirable, as that implies that the model only remembers the training data and may perform poorly when exposed to unseen test data. This is considered when designing the experiment cases, where the combination of input parameter values is unique for each case.

The observations from the prediction and simulated plot demonstrate that deviations in testing data points are seen past a certain threshold for each neural network model—for instance, above 20 Mbbl values in the testing data for the

N_{p}

model. As mentioned previously, any deviation past this value may occur as a result of the imbalanced proportion of data points along the output interval. After running the simulation, the cumulative oil production data below 20 Mbbl are around 90% of the results. This leads to fewer data being provided into the model above that threshold, affecting the model’s accuracy. The same can be expressed for the

q o_{e n d}

model, where the testing data points drift significantly past the 20 bbl/day threshold, resulting in a relatively lower R-square value in comparison to the other neural network models. Inversely, it can be inferred that the prediction performance for results below that threshold is excellent. For this specific model, possible improvement can be applied through creating separate proxy models for low and high values of

q o_{e n d}

based on that threshold, which will include re-evaluating input values and their distributions that result in low and high

q o_{e n d}

predictions.

The neural network models presented in this study can be applied for accurately determining reservoir performance in steam huff and puff injection. These predictive models save a large amount of time and costs and are believed to be helpful in assisting users for designing simulation scenarios. It is worth noting the limitation of the proxy model that it is only applicable as long as the input parameter values are within the parameter intervals of the model. The workflow used in this study can be applied for developing proxy models of other reservoir configurations or injection method variations; for example, in a fractured reservoir or a chemical-assisted steam huff and puff injection.

4. Conclusions

This study describes the results of the development of data-driven proxy models to predict reservoir production performance for one cycle of steam huff and puff injection. The proposed models comprise 28 input parameters consisting of reservoir properties and operating designs, and 6 output parameters consisting of reservoir production and reservoir characterization parameters after the injection cycle. For each output parameter, a polynomial regression model and a neural network model were developed. Adjusted R-square, mean absolute error (MAE) and root mean squared error (RMSE) were used to evaluate the performance of all models. Based on the results of our study, the neural network model outperforms the polynomial regression model for the accuracy of all output predictions and lower MAE and RMSE values. Sobol analysis was conducted for each neural network model to determine the importance of each input parameter. Initial reservoir saturation and oil viscosity were found to be the parameters with the most contribution in computations of reservoir production performance parameters. In computations of reservoir parameters at the end of the injection cycle, the most significant parameters were the initial values of the output reservoir characterization parameters themselves.

This research is focused on implementing a machine learning algorithm to build proxy models for one of the EOR methods, where the developed models can be utilized to assist users in designing simulation scenarios and to save a large amount of time. The workflow proposed in this study can be applied to create models with different production schemes for other unconventional production methods. The authors propose this research be expanded by introducing additional functionality to the existing model, such as the ability to account for various reservoir geometry configurations or injection of other enhancing fluids, such as chemicals, in steam huff and puff injection projects.

Author Contributions

Conceptualization, M.G.M.; Methodology, M.G.M., S.R. and S.U.I.; Software, M.G.M. and S.R.; Validation, M.G.M., B.M.N. and S.U.I.; Formal Analysis, M.G.M. and S.U.I.; Investigation, M.G.M., B.M.N. and S.U.I.; Resources, M.G.M. and S.R.; Data Curation, M.G.M.; Writing—Original Draft Preparation, M.G.M.; Writing—Review and Editing, S.R., B.M.N. and S.U.I.; Visualization, M.G.M., S.R. and S.U.I.; Supervision, S.R., B.M.N. and S.U.I.; Project Administration, M.G.M., S.R. and S.U.I.; Funding Acquisition, S.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research is funded by YUTP, grant number 015LC0-101.

Acknowledgments

The authors would like to thank the management of the Petroleum Engineering Department and Institute of Hydrocarbon Recovery at Universiti Teknologi PETRONAS, Malaysia for their support.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bai, G.; Xu, Y. Giant Fields Retain Dominance in Reserves Growth. Oil Gas J. 2014, 112, 44–51. [Google Scholar]
Thomas, S. Enhanced Oil Recovery—An Overview. Oil Gas Sci. Technol. Rev. IFP 2007, 63, 9–19. [Google Scholar] [CrossRef] [Green Version]
Samuel Armacanqui, J.T.; Eyzaguirre, L.F.G.; Prudencio, G.B.; Choquejahua, A.R.S.; Prado, W.I.S.; Rodriguez, J.A.C.; Tafur, Y.R.; Rojas, J.M.; Hassan, A.M. Improvements in EOR Screening, Laboratory Flood Tests and Model Description to Effectively Fast Track EOR Projects. In Proceedings of the Abu Dhabi International Petroleum Exhibition & Conference, Abu Dhabi, United Arab Emirates, 13–16 November 2017. [Google Scholar] [CrossRef]
Green, D.W.; Willhite, G.P. Enhanced Oil Recovery, 1st ed.; SPE Textbook Series; Society of Petroleum Engineers: Richardson, TX, USA, 1998; Volume 6. [Google Scholar]
McGlade, C.; Sondak, G.; Han, M. Whatever Happened to Enhanced Oil Recovery? Available online: https://www.iea.org/commentaries/whatever-happened-to-enhanced-oil-recovery (accessed on 15 October 2021).
Farouq Ali, S.M. Heavy Oil Recovery: Potential, Principles, Practicality, and Problems. In Proceedings of the SPE Rocky Mountain Regional Meeting, Billings, Montana, 15–16 May 1974. SPE-4935-MS. [Google Scholar] [CrossRef]
Farouq Ali, S.M.; Meldau, R.F. Current Steamflood Field Experience. In Proceedings of the SPE Annual Fall Technical Conference and Exhibition, Houston, TX, USA, 1–3 October 1978. [Google Scholar] [CrossRef]
Farouq Ali, S.M.; Meldau, R.F. Current Steamflood Technology. JPT J. Pet. Technol. 1979, 31, 1332–1342. [Google Scholar]
Buckles, R.S. Steam Stimulation Heavy Oil Recovery At Cold Lake, Alberta. In Proceedings of the SPE California Regional Meeting, Ventura, CA, USA, 18–20 April 1979. SPE-7994-MS. [Google Scholar] [CrossRef]
Beattie, C.I.; Boberg, T.C.; McNab, G.S. Reservoir Stimulation in the Cold Lake Oil Sands. SPE Reserv. Eng. 1991, 6, 200–206. [Google Scholar] [CrossRef]
Jones, J. Cyclic Steam Reservoir Model for Viscous Oil, Pressure Depleted, Gravity Drainage Reservoirs. In Proceedings of the SPE California Regional Meeting, Bakersfield, CA, USA, 13 April 1977. SPE-6544-MS. [Google Scholar] [CrossRef]
Gael, B.T.; Gross, S.J.; McNaboe, G.J. Development Planning and Reservoir Management in the Duri Steam Flood. In Proceedings of the SPE Western Regional Meeting, Bakersfield, CA, USA, 8–10 March 1995; No. Figure 1. pp. 533–546. [Google Scholar] [CrossRef]
Bae, W.S.; Masduki, A.; Permadi, A.K.; Abdurrahman, M. EOR in Indonesia: Past, Present, and Future. Int. J. Oil Gas Coal Technol. 2017, 16, 250. [Google Scholar] [CrossRef]
Pearce, J.C.; Megginson, E.A. Current Status Of The Duri Steamflood Project Sumatra, Indonesia. In Proceedings of the SPE International Thermal Operations Symposium, Bakersfield, CA, USA, 7–8 February 1991. SPE-21527-MS. [Google Scholar] [CrossRef]
de Haan, H.J.; van Lookeren, J. Early Results of the First Large-Scale Steam Soak Project in the Tia Juana Field, Western Venezuela. J. Pet. Technol. 1969, 246, 101–110. [Google Scholar] [CrossRef]
Puig, F.; Schenk, L. Analysis of the Performance of the M-6 Area of the Tia Juana Field, Venezuela, Under Primary, Steam-Soak, and Steamdrive Conditions. Soc. Pet. Eng. AIME SPE 1984, 1, 263–276. [Google Scholar]
Zubarev, D.I. Pros and Cons of Applying Proxy-Models as a Substitute for Full Reservoir Simulations. In Proceedings of the SPE Annual Technical Conference and Exhibition, New Orleans, LA, USA, 4–7 October 2009; Volume 5, pp. 3234–3256. [Google Scholar] [CrossRef]
Bruyelle, J.; Guérillot, D. Proxy Model Based on Artificial Intelligence Technique for History Matching—Application to Brugge Field. In Proceedings of the SPE Gas & Oil Technology Showcase and Conference, Dubai, United Arab Emirates, 21–23 October 2019. [Google Scholar] [CrossRef]
Daghbandan, A.; Chalik, S.M. The Prediction of the Performance of an Oil Reservoir by Proxy Model. Int. J. Chemoinform. Chem. Eng. 2016, 4, 46–58. [Google Scholar] [CrossRef]
Negin, C.; Ali, S.; Xie, Q. Application of Nanotechnology for Enhancing Oil Recovery—A Review. Petroleum1 2016, 2, 324–333. [Google Scholar] [CrossRef]
Jaber, A.K.; Al-Jawad, S.N.; Alhuraishawy, A.K. A Review of Proxy Modeling Applications in Numerical Reservoir Simulation. Arab. J. Geosci. 2019, 12, 701. [Google Scholar] [CrossRef]
Queipo, N.V.; Goicochea, J.V.; Pintos, S. Surrogate Modeling-Based Optimization of SAGD Processes. J. Pet. Sci. Eng. 2002, 35, 83–93. [Google Scholar] [CrossRef]
Mohaghegh, S.D. Quantifying Uncertainties Associated with Reservoir Simulation Studies Using Surrogate Reservoir Models. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 24–27 September 2006; Volume 3, pp. 2163–2172. [Google Scholar] [CrossRef]
Artun, E.; Ertekin, T.; Watson, R.; Miller, B. Optimized Design of Cyclic Pressure Pulsing in a Depleted, Naturally Fractured Reservoir. In Proceedings of the SPE Eastern Regional/AAPG Eastern Section Joint Meeting, Pittsburgh, PA, USA, 11–15 October 2008; pp. 396–416. [Google Scholar] [CrossRef]
Sumardi, H.R.; Irawan, D. Coalbed Methane Production Parameter Prediction and Uncertainty Analysis of Coalbed Methane Reservoir with Artificial Neural Networks. In Proceedings of the Indonesian Petroleum Association Fortieth Annual Convention and Exhibition, Jakarta, Indonesia, 25–27 May 2016. [Google Scholar] [CrossRef]
Ayala, L.F.; Ertekin, T.; Adewumi, M. Study of Gas/Condensate Reservoir Exploitation Using Neurosimulation. SPE Reserv. Eval. Eng. 2007, 10, 140–149. [Google Scholar] [CrossRef]
Sun, Q.; Ertekin, T. Development and Application of an Artificial-Neural-Network Based Expert System for Screening and Optimization of Polymer Flooding Projects. In Proceedings of the SPE Kingdom of Saudi Arabia Annual Technical Symposium and Exhibition, Dammam, Saudi Arabia, 23–26 April 2018. [Google Scholar] [CrossRef]
Bansal, Y.; Ertekin, T.; Karpyn, Z.; Ayala, L.; Nejad, A.; Suleen, F.; Balogun, O.; Liebmann, D.; Sun, Q. Forecasting Well Performance in a Discontinuous Tight Oil Reservoir Using Artificial Neural Networks. In Proceedings of the SPE Unconventional Resources Conference-USA, The Woodlands, TX, USA, 10–12 April 2013; pp. 239–250. [Google Scholar] [CrossRef]
Kalantari-Dahaghi, A. Numerical Simulation and Modeling of Enhanced Gas Recovery and CO2 Sequestration in Shale Gas Reservoirs: A Feasibility Study. In Proceedings of the SPE International Conference on CO2 Capture, Storage, and Utilization, New Orleans, LA, USA, 10–12 November 2010; pp. 533–550. [Google Scholar] [CrossRef]
Al-Mudhafar, W.J.; Rao, D.N. Proxy-Based Metamodeling Optimization of the Gas-Assisted Gravity Drainage GAGD Process in Heterogeneous Sandstone Reservoirs. In Proceedings of the SPE Western Regional Meeting, Bakersfield, CA, USA, 23–27 April 2017; pp. 1313–1336. [Google Scholar] [CrossRef]
Polizel, G.A.; Avansi, G.D.; Schiozer, D.J. Use of Proxy Models in Risk Analysis of Petroleum Fields. In Proceedings of the SPE Europec Featured at 79th EAGE Conference and Exhibition, Paris, France, 12–15 June 2017; pp. 510–528, SPE-185835-MS. [Google Scholar] [CrossRef]
Panja, P.; Velasco, R.; Pathak, M.; Deo, M. Application of Artificial Intelligence to Forecast Hydrocarbon Production from Shales. Petroleum 2018, 4, 75–89. [Google Scholar] [CrossRef]
Jaber, A.K.; Awang, M.B.; Lenn, C.P. Box-Behnken Design for Assessment Proxy Model of Miscible CO2-WAG in Heterogeneous Clastic Reservoir. J. Nat. Gas Sci. Eng. 2017, 40, 236–248. [Google Scholar] [CrossRef]
Ahmadi, M.A.; Zendehboudi, S.; James, L.A. Developing a Robust Proxy Model of CO₂ Injection: Coupling Box–Behnken Design and a Connectionist Method. Fuel 2018, 215, 904–914. [Google Scholar] [CrossRef]
Arpaci, B. Development Of an Artificial Neural Network for Cyclic Steam Stimulation Method in Naturally Fractured Reservoirs. Master’s Thesis, Pennsylvania State University, State College, PA, USA, 17 May 2014. Available online: https://etda.libraries.psu.edu/catalog/21880 (accessed on 19 February 2022).
Sun, Q.; Ertekin, T. Structuring an Artificial Intelligence Based Decision Making Tool for Cyclic Steam Stimulation Processes. J. Pet. Sci. Eng. 2017, 154, 564–575. [Google Scholar] [CrossRef]
Ersahin, A.; Gul, S.; Ertekin, T.; Temizel, C. Artificial Neural Network Modeling of Cyclic Steam Injection Process in Naturally Fractured Reservoirs. SPE West. Reg. Meet. Proc. 2019, 2019, 23–26. [Google Scholar] [CrossRef]
Yeten, B.; Castellini, A.; Guyaguler, B.; Chen, W.H. A Comparison Study on Experimental Design and Response Surface Methodologies. In Proceedings of the SPE Reservoir Simulation Symposium, The Woodlands, TX, USA, 31 January–2 February 2005; pp. 465–479. [Google Scholar] [CrossRef]
Ertekin, T.; Sun, Q. Artificial Intelligence Applications in Reservoir Engineering: A Status Check. Energies 2019, 12, 2897. [Google Scholar] [CrossRef] [Green Version]
Krishna, S.; Ridha, S.; Vasant, P.; Ilyas, S.U.; Sophian, A. Conventional and Intelligent Models for Detection and Prediction of Fluid Loss Events during Drilling Operations: A Comprehensive Review. J. Pet. Sci. Eng. 2020, 195, 107818. [Google Scholar] [CrossRef]
Kumar, A.; Ridha, S.; Ganet, T.; Vasant, P.; Ilyas, S.U. Machine Learning Methods for Herschel-Bulkley Fluids in Annulus: Pressure Drop Predictions and Algorithm Performance Evaluation. Appl. Sci. 2020, 10, 2588. [Google Scholar] [CrossRef] [Green Version]
Otchere, D.A.; Arbi Ganat, T.O.; Gholami, R.; Ridha, S. Application of Supervised Machine Learning Paradigms in the Prediction of Petroleum Reservoir Properties: Comparative Analysis of ANN and SVM Models. J. Pet. Sci. Eng. 2021, 200, 108182. [Google Scholar] [CrossRef]
Aziz, K.; Ramesh, A.B.; Woo, P.T. Fourth Spe Comparative Solution Project: Comparison of Steam Injection Simulators. JPT J. Pet. Technol. 1987, 39, 1576–1584. [Google Scholar] [CrossRef]
Andrade, E.C. The Viscosity of Liquids. Nature 1930, 125, 582–584. [Google Scholar] [CrossRef]
Corey, A. The Interrelation Between Gas and Oil Relative Permeabilities. Prod. Mon. 1954, 19, 38–41. [Google Scholar]
Gozde, S.; Chhina, H.S.; Best, D.A. Analytical Cyclic Steam Stimulation Model for Heavy Oil Reservoirs. In Proceedings of the SPE California Regional Meeting, Bakersfield, CA, USA, 5–7 April 1989; pp. 597–613. [Google Scholar] [CrossRef]
McKay, M.D.; Beckman, R.J.; Conover, W.J. A Comparison of Three Methods for Selecting Values of Input Variables in the Analysis of Output from a Computer Code. Technometrics 2000, 42, 55–61. [Google Scholar] [CrossRef]
Hama, M.Q.; Wei, M.; Saleh, L.D.; Bai, B. Updated Screening Criteria for Steam Flooding Based on Oil Field Projects Data. In Proceedings of the SPE Heavy Oil Conference-Canada, Calgary, AB, Canada, 10–12 June 2014; pp. 363–381. [Google Scholar] [CrossRef]
Farouq Ali, S.M. Current Status of Steam Injection As a Heavy Oil Recovery Method. J. Can. Pet. Technol. 1974, 13, 54–68. [Google Scholar] [CrossRef]
Taber, J.J.; Martin, F.D.; Seright, R.S. EOR Screening Criteria Reyisited—Part 1: Introduction to Screening Criteria and Enhanced Recovery Field Projects. SPE Reserv. Eng. 1997, 12, 189–197. [Google Scholar] [CrossRef] [Green Version]
Sobol, I.M. Sensitivity Estimates for Nonlinear Mathematical Models. Math. Model. Comput. Exp. 1993, 4, 407–414. Available online: http://www.andreasaltelli.eu/file/repository/sobol1993.pdf (accessed on 8 February 2022).
Saltelli, A. Global Sensitivity Analysis: The Primer; John Wiley & Sons: Hoboken, NJ, USA, 2008. [Google Scholar]

Figure 1. The overall flowchart methodology of proxy model development for steam huff and puff injection in this study.

Figure 2. Synthetic 1-well, 3D cylindrical reservoir model.

Figure 3. Validation of the synthetic 3D reservoir model to the field data [46].

Figure 4. The artificial neural network structure for steam huff and puff injection proxy models used in this study.

Figure 5. Predicted and simulated plot of (a) training data and (b) testing data for cumulative oil production—polynomial regression model.

Figure 6. Predicted and simulated plot of (a) training data and (b) testing data for cumulative oil production—neural network model.

Figure 7. Predicted and simulated plot of (a) training data and (b) testing data for maximum oil rate—polynomial regression model.

Figure 8. Predicted and simulated plot of (a) training data and (b) testing data for maximum oil rate—neural network model.

Figure 9. Predicted and simulated plot of (a) training data and (b) testing data for oil rate at the end of the cycle—polynomial regression model.

Figure 10. Predicted and simulated plot of (a) training data and (b) testing data for oil rate at the end of the cycle—neural network model.

Figure 11. Predicted and simulated plot of (a) training data and (b) testing data for reservoir pressure at the end of the cycle—polynomial regression model.

Figure 12. Predicted and simulated plot of (a) training data and (b) testing data for reservoir pressure at the end of the cycle—neural network model.

Figure 13. Predicted and simulated plot of (a) training data and (b) testing data for reservoir temperature at the end of the cycle—polynomial regression model.

Figure 14. Predicted and simulated plot of (a) training data and (b) testing data for reservoir temperature at the end of the cycle—neural network model.

Figure 15. Predicted and simulated plot of (a) training data and (b) testing data for reservoir water saturation at the end of the cycle—polynomial regression model.

Figure 16. Predicted and simulated plot of (a) training data and (b) testing data for reservoir water saturation at the end of the cycle—neural network model.

Table 1. Input variables for sensitivity analysis and predictive model development.

No	Parameter	Unit	Min	Max
1	Area	acre	0.5	10
2	Net Thickness	ft	20	100
3	Porosity	%	15	40
4	Permeability	mD	250	2000
5	Kv/Kh Ratio	Fraction	0.1	0.3
6	Oil Density	°API	8	25
7	Andrade’s Viscosity Constant A		0.0001	0.001
8	Andrade’s Viscosity Constant B		3000	5750
9	Reservoir Temperature	F	86	232
10	Reservoir Pressure	psia	100	1000
11	Initial Water Saturation	Fraction	0.2	0.6
12	Connate Water Saturation	Fraction	0.125	0.4
13	Residual Oil Saturation (Oil–Water Table)	Fraction	0.1	0.45
14	Critical Gas Saturation	Fraction	0	0.1
15	Residual Oil Saturation (Liquid–Gas Table)	Fraction	0.05	0.2
16	Kro at Connate Water Saturation	Fraction	0.2	1
17	Krw at Irreducible Oil Saturation	Fraction	0.2	1
18	Krg at Connate Liquid Saturation	Fraction	0.2	1
19	Exponent for Krow	Fraction	1	4
20	Exponent for Krw	Fraction	1	4
21	Exponent for Krog	Fraction	1	4
22	Exponent for Krg	Fraction	1	4
23	Steam Injection Rate	bbl/d	200	1500
24	Steam Quality	Fraction	0.6	0.95
25	Steam Temperature	°F	250	500
26	Injection Period	days	7	30
27	Soaking Period	days	3	30
28	Production Period	days	30	365

Table 2. Input parameters with the highest contribution towards the output for each proxy model of reservoir production performance based on Sobol analysis.

Proxy Model
$N_{p}$		$q o_{m a x}$		$q o_{e n d}$
Input	Contribution (%)	Input	Contribution (%)	Input	Contribution (%)
Sw	16.5	Sw	53.1	ViscB	12.0
ViscB	11.0	ViscB	25.5	Sw	10.7
Pressure	6.9	Permeability	8.7	ProductionPeriod	10.4
Thickness	5.8	Pressure	5.0	Pressure	8.1
Area	5.1	Swcon	4.1	Area	7.3
Permeability	5.0	Temperature	2.8	Permeability	6.7
ProductionPeriod	4.9	Area	2.3	Temperature	4.9
Swcon	4.5	ViscA	2.3	Porosity	4.7
Temperature	3.5	InjectionPeriod	1.2	Thickness	4.5
Sorw	3.2	NW	0.6	ViscA	4.2

Table 3. Input parameters with the highest contribution towards the output for each proxy model of reservoir conditions after an injection cycle based on Sobol analysis.

Proxy Model
$p_{e n d}$		$T_{e n d}$		$s w_{e n d}$
Input	Contribution (%)	Input	Contribution (%)	Input	Contribution (%)
Pressure	32.7	Temperature	50.9	Sw	78.5
Sw	24.5	Area	3.3	Pressure	3.4
ViscB	15.5	Thickness	2.5	Swcon	2.8
Area	8.3	InjRate	2.5	Area	1.4
ProductionPeriod	8.2	Sw	2.2	NW	0.1

Table 4. Summary of training and testing R-square values for each proxy model.

Proxy Model	Polynomial Regression		Neural Network
Proxy Model	Training R-Square	Testing R-Square	Training R-Square	Testing R-Square
$N_{p}$	0.755	0.735	0.981	0.959
$q o_{m a x}$	0.706	0.675	0.957	0.931
$q o_{e n d}$	0.561	0.524	0.953	0.847
$p_{e n d}$	0.878	0.877	0.991	0.989
$T_{e n d}$	0.962	0.954	0.995	0.978
$s w_{e n d}$	0.939	0.935	0.982	0.980

Table 5. Summary of training and testing mean absolute error (MAE) and root mean square error (RMSE) for each proxy model.

Proxy Model	Polynomial Regression				Neural Network
Proxy Model	Training MAE	Training RMSE	Testing MAE	Testing RMSE	Training MAE	Training RMSE	Testing MAE	Testing RMSE
$N_{p}$ (Mbbl)	7.53	10.39	8.28	11.65	1.39	2.31	1.76	3.36
$q o_{m a x}$ (bbl/d)	221.6	281.5	239.9	303.6	63.4	97.6	74.3	126.4
$q o_{e n d}$ (bbl/d)	29.1	43.0	31.1	45.9	8.6	14.1	11.8	21.7
$p_{e n d}$ (psi)	64.8	85.8	66.0	87.6	14.9	22.3	16.1	25.5
$T_{e n d}$ (°F)	2.5	3.8	2.6	4.2	0.9	1.4	1.3	2.9
$s w_{e n d}$	0.0259	0.0331	0.0264	0.0338	0.0152	0.0208	0.0156	0.0218

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Merdeka, M.G.; Ridha, S.; Negash, B.M.; Ilyas, S.U. Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling. Appl. Sci. 2022, 12, 3169. https://doi.org/10.3390/app12063169

AMA Style

Merdeka MG, Ridha S, Negash BM, Ilyas SU. Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling. Applied Sciences. 2022; 12(6):3169. https://doi.org/10.3390/app12063169

Chicago/Turabian Style

Merdeka, Mohammad Galang, Syahrir Ridha, Berihun Mamo Negash, and Suhaib Umer Ilyas. 2022. "Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling" Applied Sciences 12, no. 6: 3169. https://doi.org/10.3390/app12063169

APA Style

Merdeka, M. G., Ridha, S., Negash, B. M., & Ilyas, S. U. (2022). Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling. Applied Sciences, 12(6), 3169. https://doi.org/10.3390/app12063169

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Reservoir Performance Prediction in Steam Huff and Puff Injection Using Proxy Modelling

Abstract

1. Introduction

2. Methodology

2.1. Synthetic Reservoir Model

2.2. Generation of Simulation Experimental Data

2.3. Predictive Model Development

3. Results and Discussion

3.1. Proxy Models for Predicting Reservoir Performance

3.2. Proxy Models for Predicting Reservoir Conditions after an Injection Cycle

3.3. Evaluation of Model Fitting and Performance Metrics

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI