1. Introduction
The offshore oil reservoirs face challenges such as high water cut, low permeability, and uneven reservoir development. How to fully exploit the potential of existing oil reservoirs and improve economic benefits is an issue of great engineering significance. Production optimization, by adjusting the injection-production relationship, can effectively increase oil recovery while reducing costs, and it is widely applied in offshore oil fields to reduce costs and enhance efficiency [
1,
2,
3,
4,
5]. There are primarily two categories of production optimization methods, with the first being gradient methods. Two line search methods, namely steepest descent and conjugate gradient, were employed and compared by Asadollahi and Naevdal [
6], and the results demonstrated that the conjugate gradient method can rapidly search for the optimal development scheme. Forouzanfar et al. utilized a gradient-based enhanced Lagrangian optimization code for the life cycle production optimization of oil reservoirs [
7]. The objective was to identify the optimal well controls for a 15-year production period. In comparison with the reference case, the net present value increased by 8.9% over the 15-year production period. Liu and Reynolds [
8] used gradients calculated with the adjoint method to find boundary points by maximizing the augmented Lagrangian function, optimizing both the life-time net present value and short-term net present value. Bukshtynov et al. [
9] developed and applied an efficient, robust, and flexible closed-loop reservoir management computational framework using a gradient-based production optimization method. This approach aimed to enhance the net present value through the utilization of automatic differentiation. Sefat et al. [
10] provided guidance for selecting appropriate reservoir production optimization methods by employing state-of-the-art stochastic gradient approximation algorithms. Volkov and Bellout [
11] developed an analytical framework to investigate the impact of enforcing simulator-based economic constraints during the execution of gradient-based production optimization. This approach enhanced the performance of the search process.
While gradient-based methods can quickly obtain computational results, for complex oil reservoir production optimization problems, obtaining gradient information is often challenging and may lead to difficulty in searching for the global optimal scheme. To address the aforementioned challenges, heuristic-based evolutionary algorithms are widely applied in the field of production optimization, adept at resolving high-dimensional optimization problems [
12]. Evolutionary algorithms, such as differential evolution [
13], particle swarm optimization [
14], and genetic algorithm [
15], possess strong global search capabilities. They gradually approach the optimal solution by generating a large number of candidate development schemes. Reddy and Kumar proposed an efficient and effective multi-objective optimization method based on the differential evolution algorithm, applying it to a case study of reservoir system optimization [
16]. This approach can generate Pareto optimal solutions for multi-objective reservoir operation problems with convergence. Siavashi and Yazdani compared the performance of genetic algorithm (GA) and particle swarm optimization (PSO) in enhancing enhanced oil recovery (EOR) projects and combined the Newton method with PSO to improve its convergence speed [
17]. Gu et al. utilized the differential evolution algorithm to optimize a reservoir water-cut minimization control model, effectively improving water displacement efficiency [
18]. Zhang et al. employed the differential evolution algorithm to solve constrained reservoir production optimization problems, achieving higher net present values [
19]. An et al. introduced conventional particle swarm optimization into reservoir engineering methods, reducing the computational costs of production optimization in commercial oil fields [
20].
Although evolutionary algorithms can enhance global search capabilities, as mentioned earlier, they require generating a large number of candidate schemes to approximate the optimal solution. If all candidate solutions are evaluated and screened through numerical simulators, it can be extremely time-consuming. To address this time-consuming issue, in recent years, a method that utilizes surrogate models to replace reservoir numerical simulators and expedite the optimization process has gained popularity in the field of production optimization. A surrogate model is a mathematical model that takes development schemes as inputs and provides reservoir responses as outputs. In other words, by evaluating a small number of development schemes through a numerical simulator, a high-precision surrogate model is constructed. This surrogate model is then used in the subsequent optimization process to replace the numerical simulator, enabling the rapid generation of reservoir responses. Surrogate models are data-driven and come in various types, such as radial basis function network [
21], support vector machine regression [
22], and the Gaussian process [
23], among others. Based on the type and quantity of surrogate models used, optimization methods can be classified into two categories: single surrogate optimization methods and multi-surrogate optimization methods. Let us first introduce single surrogate optimization methods. Golzari et al. proposed a novel approach for constructing an adaptive surrogate model, employing a dynamic artificial neural network as the surrogate model to achieve a rapid approximation of the actual reservoir simulation model [
24]. This method demonstrates good accuracy, enhancing the overall optimization process. Aladeitan et al. utilized surrogate modeling methods to assess the optimal well designs for three different development strategies aiming to enhance the recovery factor of a marginal oil reservoir [
25]. Chen et al. proposed a new framework based on surrogate-assisted evolutionary algorithm and dimensionality reduction [
26]. This framework utilizes Sammon mapping to map high-dimensional decision variables to low-dimensional space, achieving better performance. Zhao et al. proposed a new method for solving multi-objective production optimization problems [
27]. This method employs radial basis function networks as surrogate models to predict the dominant relationship between candidate solutions and pre-screening. Ogbeiwi et al. deployed a surrogate model for total oil production to perform robust optimization of water flooding in the Niger Delta reservoir, significantly reducing computational costs [
28]. In contrast to single surrogate optimization methods, multi-surrogate optimization involves the participation of multiple surrogate models, either of the same or different types, in the optimization process. Chen et al. proposed a production optimization method that utilizes both global and local surrogate models to assist differential evolution [
29]. The collaboration between global optimization and local search contributes to making the production optimization process more efficient and effective. Du et al. established a multi-objective production optimization framework that combines Bayesian Random Forest and Particle Swarm Optimization algorithms [
30]. This framework effectively balances oil production and the gas–oil ratio. Wang et al. employed a multi-surrogate model framework using polynomial response and radial basis function networks, reducing prediction uncertainty in the production optimization process [
31]. Wang et al. [
32] employed a multi-surrogate model production optimization framework based on Pareto criteria, expediting the search for optimal solutions.
Although evolutionary algorithms and surrogate models have been widely applied in production optimization, there is currently a lack of comparative studies in the context of offshore oil reservoirs. Offshore oil reservoirs pose challenges with a multitude of optimization variables and harsh production conditions, making it essential to compare various optimization methods for informed decision-making. Therefore, this paper selects representative algorithms from evolutionary optimization, single surrogate optimization methods, and multi-surrogate optimization methods. Performance tests for production optimization problems are conducted on two real offshore oil reservoir cases. The performance comparison of these methods aims to assist field engineers in better applying advanced intelligent computing methods to the field of offshore oil reservoir production optimization. The remaining structure of the article is as follows. Chapter two introduces the production optimization issues in offshore oil reservoirs, providing the foundational context for the problems addressed in this paper. Chapters three to five, respectively, delve into the three categories of methods being compared. Chapter six presents a comparative analysis of the application effectiveness of these three methods on two oil reservoir cases. Finally, the paper concludes with the findings and conclusions drawn from the study.
2. Problem Statement
Production optimization in offshore oil reservoirs is primarily achieved through controlling the development schemes of production and injection wells to maximize cumulative oil production or economic benefits. The term “development schemes” here refers to the bottom-hole pressure or flow rate of each well. In practical production, economic benefits are typically chosen as the optimization objective, specifically the net present value (NPV). NPV can be expressed using the following formula:
where
J is the objective function for optimization, representing the NPV;
x represents the development scheme, and
v is the state variable containing reservoir characteristics.
Nt represents the total number of optimization time steps;
represents the time interval for each time step in the optimization; b is the discount rate; and
are the oil production rate, water production rate of the
j-th production well at time step
t (STB/D), respectively;
is the water injection rate of the
j-th injection well at time step
t (STB/D);
ro,
rw, and
ri are oil production revenue, produced water treatment cost, and injection cost, all in USD/STB, respectively;
NP and
NI represent the number of production and injection wells, respectively.
In addition, this paper also considers the boundary constraint of the development scheme. Therefore, the production optimization problem for offshore oil reservoir can be defined by the following formula:
where
is the dimension number of the development scheme,
A denotes the number of production and injection wells, i.e.,
;
xlow and
xup represent the upper and lower bounds of the development scheme, respectively.
3. Differential Evolutionary Algorithm for Solving Production Optimization Problems in Offshore Reservoirs
The differential evolution (DE) algorithm, as an optimization algorithm with global search capabilities, has been widely applied in the industrial field in recent years [
33]. Although many new optimization algorithms are emerging in the field of intelligent computing, due to the stability of the DE, it performs better in reservoir production optimization [
32]. Therefore, this paper chooses it as a comparative algorithm for solving offshore oil reservoir production optimization problems. The overall process of DE is shown in Algorithm 1.
Algorithm 1: Pseudocode of DE |
01: | Initialization: Generate development schemes using Latin hypercube sampling [34] in the boundary constraint space, evaluate them with the numerical simulator to obtain NPV , and then store both the schemes and NPV in a database DB; |
02: | While the stopping criterion is not met |
03: | Mutation: Generate mutation vectors by applying the mutation factor to ; |
04: | Crossover: Generate new schemes by applying the crossover factor to and mutation vectors; |
05: | Selection: Calculate the NPV for all new schemes using the numerical simulator and store the best one along with its NPV in the DB; |
06: | End while |
07: | Output: The best scheme in DB. |
The core idea of DE is to use the existing development schemes as a foundation, generate new schemes using crossover and mutation operators, and select the optimal individuals in the new schemes based on their NPV. This process is repeated until the predetermined maximum number of numerical simulation iterations is reached. Initially, the existing development schemes
are generated by applying the Latin hypercube sampling method according to the boundary constraints outlined in Equation (2). Their NPV
is calculated using a numerical simulator, and a sample library
DB is constructed, where
N is the number of development schemes. The first step of DE is to generate mutation vectors based on the existing development schemes using a mutation factor:
where
vi is the mutation vector corresponding to the
i-th development scheme;
xbest is the individual in the existing schemes with the highest NPV;
F is the pre-defined mutation factor;
m1 and
m2 are two random integers between 1 and
N. The next step is the crossover process:
The individual
xi in the existing development schemes can be represented as the following:
;
d is the dimension number of the development scheme.
represents the
j-th dimension of the new scheme obtained by updating the
i-th existing scheme through the DE algorithm;
is a random number between 0 and 1;
C is a pre-defined crossover factor. During the interaction in each dimension, a random number
is generated. If
is less than the pre-defined crossover factor, then in that dimension, the new scheme will be the
j-th dimension of the mutation vector. Otherwise, the existing scheme will be retained.
is a random integer between 1 and
d, ensuring that the new scheme cannot be identical to the existing scheme. After generating new schemes, the selection process is to select the one with the highest NPV among all the new schemes and add it to the database:
where
is the scheme after the selection process and it is placed in the database together with the corresponding NPV;
represents all the new schemes generated after the crossover phase, and their NPV is calculated entirely using a numerical simulator.
4. Canonical Surrogate Assisted Evolutionary Algorithm for Solving Production Optimization Problems in Offshore Reservoirs
As mentioned earlier, when solving the offshore oil reservoir production optimization problem using only the differential evolution algorithm, it is necessary to use a numerical simulator to calculate the generated numerous candidate solutions, significantly increasing computational costs. This issue is reflected in step 5 of Algorithm 1, where the NPV of all new schemes needs to be computed using the numerical simulator, leading to a reduced number of optimization iterations within the loop. To solve this problem, the surrogate model method is used to assist the evolutionary algorithm in the rapid determination of the optimal development scheme. This method is referred to as the canonical surrogate-assisted evolutionary algorithm (CSAEA) [
35]. CSAEA employs a training phase to train surrogate models using real samples, thereby reducing the reliance on numerical simulators during subsequent optimizations. The surrogate model is constructed by taking development schemes
as input and NPV
as output. After the surrogate model is established, it can quickly generate reservoir responses as a substitute for numerical simulation. Among these, the radial basis function network (RBFN) has been widely applied in the field of production optimization. In this paper, the RBFN is employed to assist the differential evolution as the second comparative algorithm. Let us start by introducing the RBFN. RBFN is an artificial neural network that approximates the numerical simulator by summing weighted basis functions. The expression is as follows:
where
is the predicted value of the RBFN;
represents the weight coefficients;
x is the development scheme to be predicted;
xi is a development scheme in the
DB;
N is the total number of development schemes, and
is the basis function. In this paper, the cubic basis function is chosen, and the expression is as follows:
After the establishment of RBFN, it can be used in subsequent optimization to replace numerical simulators for screening new development schemes. The overall process of CSAEA is shown in Algorithm 2. Its overall process is similar to DE, with the most crucial difference lying in step 5. In this step, the surrogate model is used to predict the NPV of all new schemes, reducing the need for the numerical simulator to evaluate only the selected solution with the highest predicted value. This significantly reduces the number of numerical simulations required. The flowchart of CSAEA is shown in
Figure 1.
Algorithm 2: Pseudocode of CSAEA |
01: | Initialization: Generate development schemes using Latin hypercube sampling in the boundary constraint space, evaluate them with the numerical simulator to obtain NPV , and then store both the schemes and NPV in a database DB; |
02: | While the stopping criterion is not met |
03: | Train an RBFN surrogate model using the development schemes in DB and NPV; |
04: | Generating new schemes using the DE algorithm. |
05: | Select the best one from the new schemes based on the predicted values of the RBFN model, calculate its NPV using the numerical simulator, and update the database; |
06: | End while |
07: | Output: The best scheme in DB. |
5. Global–Local Surrogate-Assisted Differential Evolution for Solving Production Optimization Problems in Offshore Reservoirs
CSAEA utilizes all samples in the database for the construction of the RBFN model, which results in a higher model accuracy. However, due to the uneven quality of samples in the database, it is challenging to quickly pinpoint the optimal development scheme. To address this issue, Chen et al. proposed a global–local surrogate-assisted differential evolution (GLSADE) algorithm, constructing two surrogate models to accelerate the optimization process [
29]. One of them is the global surrogate model, which, similar to the RBFN established in CSAEA, is trained using all the samples in the database
DB. It is noteworthy that the second surrogate model established is a local surrogate model, trained with outstanding schemes selected based on NPV ranking from the
DB. The advantages of establishing a local surrogate model to participate in the search can be illustrated using
Figure 2:
As shown in
Figure 2, the blue solid line represents the true function curve, i.e., the numerical simulation; the red dashed line represents the local surrogate model; the black solid dots represent all samples in the database
DB; the gray solid dots represent samples with NPV under a specific threshold, forming a subset of the database denoted as
DBtop; the red cross represents the optimal sample. From the figure, it can be observed that if a surrogate model is built for optimization using samples within the range of
xlow and
xup, the search scope is too broad, making it difficult to quickly locate the optimal solution. However, by constructing a local surrogate model with the gray excellent samples, the search can be performed within the narrowed range of
and
, leading to a rapid identification of the optimal solution.
and
can be calculated using the following expression:
and are composed of the minimum and maximum values of each dimension across all samples in the DBtop, respectively. The overall process of GLSADE is shown in Algorithm 3.
Algorithm 3: Pseudocode of GLSADE |
01: | Initialization: Generate development schemes using Latin hypercube sampling in the boundary constraint space, evaluate them with the numerical simulator to obtain NPV , and then store both the schemes and NPV in a database DB; |
02: | While the stopping criterion is not met |
03: | //Global surrogate model search phase |
04: | Construct a global RBFN model using all the samples in DB; |
05: | Generating new schemes using the DE algorithm; |
06: | Select the best one from the new schemes based on the predicted values of the global RBFN model, calculate its NPV using the numerical simulator, and update the database; |
07: | //Local surrogate model search phase |
08: | Sort the samples in the DB based on NPV and select the n top-ranked samples to generate a subset database DBtop; |
09: | Construct a local RBFN model using all the samples in DBtop; |
10: | Calculate and based on the minimum and maximum values of each dimension for all samples in the DBtop; |
11: | Generate optimized initial schemes using LHS within the range of and ; |
12: | Generating new schemes using the DE algorithm within the range of and ; |
13: | Select the best one from the new schemes based on the predicted values of the local RBFN model, calculate its NPV using the numerical simulator, and update the database; |
14: | End While |
15: | Output: The best scheme in DB. |
As illustrated in Algorithm 3, the global surrogate model search phase of GLSADE is roughly similar to CSAEA. In the local search phase, firstly, the n top-ranked samples are selected from the DB to construct a data subset DBtop. Subsequently, DBtop is used to train a local RBFN model. Then, the upper and lower limits of the DBtop are determined. In both CSAEA and DE, new schemes are generated based on all schemes in the DB. However, in the local search phase of GLSADE, optimized initial schemes are generated within the determined superior upper and lower limits using LHS. These solutions are then updated using DE, and finally, the local RBFN model is employed for filtering to expedite the optimization process.
6. Experimental Results and Discussion
To verify the performance of DE, CSAEA, and GLSADE in solving the production optimization problem of offshore oil fields, tests are conducted using two offshore actual oil fields. It is worth noting that the heterogeneity of offshore oil reservoirs is relatively strong, and generally, their well networks are sparse, resulting in higher development costs. The two reservoirs selected in this article are both offshore oil reservoirs with low permeability and high-water content, and low development potential. One of them is optimized in a moderate dimension, while the other is set as a high-dimensional optimization problem (the specific dimensions will be detailed below), which corresponds to the two main types of problems addressed in the field of reservoir development. Conducting case studies on them can effectively demonstrate the performance of the algorithms. All simulation experiments in this paper are carried out using ECLIPSE as the numerical simulator. The stopping criterion for testing on the two reservoir cases reaches 1000 evaluations of the numerical simulator. The initial sample size N for all methods is set to 200, which also serves as the training phase for the surrogate models of CSAEA and GLSADE. The mutation factor F is set to 0.5, and the crossover factor C is set to 0.5. The number of top-ranked samples n in GLSADE is set to 200.
Example 1: The offshore actual reservoir model A. The first case tested in this paper is model A, which has
grid blocks, and the grid size is
. This reservoir is developed using water flooding, with a total of three water-injection wells and nine production wells. The locations of these wells and the permeability of the model field are shown in
Figure 3. The properties of the A model are presented in
Table 1. The three water-injection wells are controlled by injection rates, with the lower limits set at 20, 200, and 100 STB/D, respectively, and upper limits set at 100, 978, and 1200 STB/D, respectively. The nine production wells are controlled by production rates, with the lower limits set at 20, 10, 60, 20, 100, 90, 15, 70, and 15 STB/D, respectively, and upper limits set at 160, 38, 330, 73, 220, 296, 260, 290, and 93.8 STB/D, respectively. The reservoir production period is 720 days, divided into four control steps, each spanning 180 days on average. Therefore, the total number of decision variables is
. The oil revenue, water injection cost, and water treatment cost are USD/STB 80, 2, and 2, respectively. The discount rate is 0%.
Figure 4 shows the optimization results obtained from independently running the three methods on A model for 10 iterations. As shown in
Figure 4, DE exhibits the poorest performance. This is because it lacks a surrogate model to assist in optimization and relies solely on evaluating all candidate development schemes using the numerical simulator, greatly increasing computational burden and resulting in fewer optimization iterations. The CSAEA methodology employs surrogate models to expedite the optimization process, thereby achieving superior results compared to DE. This observation aligns with the research conducted by Golzari et al. [
24], affirming that the utilization of surrogate models enables significantly more iterations within a finite timeframe, facilitating the exploration of optimal outcomes. However, CSAEA’s reliance on a single surrogate model for the search process constrains its capacity to rapidly pinpoint the optimum solution. In contrast, GLSADE leverages multiple surrogate models to balance exploration and exploitation, resulting in superior solutions within the same iteration period, corroborating the understanding posited by Chen et al. [
29]. GLSADE employs a global surrogate model to preliminarily screen advantageous development schemes, and a local surrogate model to conduct in-depth exploration of high-potential regions. This approach markedly augments the probability of identifying the optimum solution, thereby yielding optimal performance.
Figure 5 and
Figure 6 show the optimal control schemes for producers and injectors obtained using the three methods. Each row represents the control scheme of a particular well over the entire optimization period, with each box indicating the control scheme of a particular well for a specific control step.
Figure 7, respectively, shows the variations in cumulative oil production (FOPT), cumulative water injection (FWIT), and cumulative water production (FWPT) over the development time under optimal well control. The well control scheme of DE provides the lowest cumulative oil production, the highest cumulative water production, and cumulative water injection, thus resulting in the lowest economic benefits. The well control scheme of GLSADE can provide the highest cumulative oil production while not necessarily resulting in the highest cumulative water production or injection. Therefore, the ultimate economic benefits are the highest.
Example 2: The offshore actual reservoir model B. The second case tested in this paper is model B, which has
grid blocks, and the grid size is
. This reservoir is developed using water flooding, with a total of two water-injection wells and six production wells. The locations of these wells and the permeability field are shown in
Figure 8 of the model. The properties of the B model are presented in
Table 2. The two water-injection wells are controlled via injection rates, with lower limits set at 300 and 300 STB/D, respectively, and upper limits set at 627 and 677 STB/D, respectively. The six production wells are controlled via production rates, with lower limits set at 20, 10, 20, 20, 10, and 30 STB/D, respectively, and upper limits set at 230, 61, 240, 113, 50, and 150 STB/D, respectively. The reservoir production period is 720 days and divided into 20 control steps, each spanning 36 days on average. Therefore, the total number of decision variables is
. The oil revenue, water injection cost, and water treatment cost are USD/STB 80, 2, and 2, respectively. The discount rate is 0%.
Figure 9 shows the optimization results obtained from independently running the three methods on B model for 10 iterations. The experimental result similar to those exhibited using the A model shows that DE still performs the worst due to the absence of using a surrogate model, hence, it is unable to expedite the solving process. In the experiments of B model, it can be observed that GLSADE’s performance is significantly superior to CSAEA. The variable dimension in B model reaches 160, indicating a large-scale optimization problem. From the experimental results, it can be inferred that GLSADE is more suitable for offshore oilfield production optimization problems with a large number of wells or time steps compared to conventional surrogate optimization methods. This is in line with other research on multi-surrogate models [
31,
32], where the synergistic coupling of multiple surrogate models significantly enhances the global optimization capability for complex problems with large search spaces. This approach circumvents premature convergence to locally optimal solutions at certain times, thereby achieving more optimal outcomes over the entire development cycle.
Figure 10 and
Figure 11 show the optimal control schemes for producers and injectors obtained using the three methods.
Figure 12 illustrates the curves of FOPT, FWIT, and FWPT over time for the three methods. It can be seen from
Figure 12 that although GLSADE injects more water and produces more water, it can yield the highest cumulative oil production, thereby achieving the maximum economic benefit.