1. Introduction
In the middle and late stages of a natural gas field’s development, the reduction in formation pressure often renders the gas incapable of autonomously removing the accumulated formation water. This frequently results in a decline in the production capacity of the gas wells, potentially leading to their abandonment [
1]. Therefore, to effectively address this issue, plunger lift technology has been widely adopted as a cost-effective artificial lift method [
2]. This technique involves deploying a plunger in the wellbore to facilitate efficient gas-liquid separation and liquid discharge [
3].
Current research on gas-lift plunger performance primarily employs experimental measurements and numerical simulations, both of which have inherent limitations. Experimentally, the process involves high costs and complex equipment setup, often requiring on-site trials. Experiments are constrained by actual gas well conditions, making it difficult to replicate various operating conditions, thereby limiting the general applicability of the results. Additionally, experiments are time-consuming, especially when repeated under different conditions. Due to environmental influences on measurement, experimental results often contain significant errors. Particularly in gas-lift plunger experiments, the small gap between the plunger and the pipeline makes it challenging to measure physical quantities within the gap, which are crucial for characterizing the plunger’s flow field.
For numerical simulations, the reliance is on established models and assumptions, which may not fully reflect real-world scenarios. The flow field around a gas-lift plunger, characterized by high velocities, large gradients, small gaps, and curved streamlines, complicates the flow, necessitating the use of appropriate turbulence models to accurately describe turbulence structures and their impacts. The Reynolds-averaged Navier-Stokes (RANS) method, blending theory and empiricism, is a viable simulation technique. However, its turbulence model accuracy relies on specific model constants, usually calibrated with standard flows like flat plate boundary layers and free shear flows, which may be inadequate for predicting complex flow fields in gas-lift plungers [
4].
Traditional computational fluid dynamics (CFD) techniques, such as Direct Numerical Simulation (DNS) and Large Eddy Simulation (LES), can effectively model gas flow. However, their requirement to handle high Reynolds number turbulence necessitates substantial computational resources, which in turn limits their practical engineering applications [
5]. Currently, the Reynolds-averaged Navier-Stokes (RANS) equations remain the mainstream choice for engineering applications due to their favorable balance between computational cost and predictive accuracy.
Given these limitations, there is a clear need to develop more accurate and efficient modeling techniques for gas-lift plunger systems. Accurate modeling not only enhances the fundamental understanding of near-wall turbulence and flow resistance within narrow annular gaps but also enables the optimization of plunger geometry and operating parameters through numerical tools. Furthermore, improved modeling fidelity provides a basis for digital twin technologies and intelligent well management systems.
In geosciences, data assimilation (DA) has been developed as a technique to address potential shortcomings in experimental and numerical simulations, establishing it as a fourth central analytical approach alongside theoretical, experimental, and numerical methods [
6]. DA is a method that integrates mathematical models with observational data, considering the uncertainties of both. It combines measured data with predictive models to adjust the model’s evolution, optimize performance, and enhance accuracy. Data assimilation was initially proposed by Lewis Fry Richardson to optimize weather forecasting [
7]. Subsequently, this approach has been applied to system monitoring, hydrology, and geology, among other fields [
8].
This technique pioneers a novel approach to turbulence research, facilitating more precise simulations and predictions of turbulence behavior and offering more reliable guidance for engineering applications. Kato et al. [
9] developed a data assimilation method based on the ensemble transformed Kalman filter (ETKF) and successfully predicted the inlet angle of attack and the Mach number for the turbulent flow field around an airfoil. Deng et al. [
10] optimized the constants in four turbulence models using an EnKF-based DA technique. Analyzing Particle Image Velocimetry data from a free circular jet flow showed significant improvements in flow field predictions across all models after optimization, with the k-ε model exhibiting the best performance. Zhang et al. [
11] conducted an in-depth study on the applicability of the hybrid data assimilation method (EnVar) in turbulence modeling. From the perspective of Bayesian inference, the research optimized RANS simulations for a converging-diverging channel. Experimental results demonstrated that the ensemble-based variational method effectively inferred unknown parameters under both small-sample conditions (D = 20) and large-scale systems (D = 2400). He et al. [
12] employed a data assimilation approach, utilizing the Ensemble Kalman Filter (EnKF) combined with NASA’s PIV experimental data to optimize the constants of the Spalart-Allmaras (SA) turbulence model. The optimized model improved the accuracy of jet flow field predictions, reducing the relative error in the horizontal direction behind the nozzle from 13.04% to 4.6%. Fang et al. [
13] used the Ensemble Kalman Filter (EnKF) algorithm to recalibrate the constants of the SST k-ω turbulence model, significantly improving the accuracy of flow characteristic predictions for steam valves with filters. The recalibrated model constants are suitable for conditions with similar valve openings but exhibit higher errors when the opening differences are large. The optimized model constants better capture the turbulence characteristics of the internal flow. Liu et al. [
14] developed a data-driven framework using a two-step EnKF method to predict compressor cascade flow fields. Validating with a test function, they applied S-A and SST models to the MAN GHH cascade, achieving predictions consistent with experiments. Data assimilation reduced errors by nearly 70%, improved separation bubble predictions, and showed low dependency on turbulence models. Yu et al. [
15] utilized the EnKF DA technique to optimize the SST k-ω turbulence model constants for enhancing heat transfer predictions in plate heat exchangers. By assimilating experimental data from various operational conditions into the computational model, the study achieved significant reductions in prediction errors. The calibrated model constants enabled more accurate simulations of the complex flow and heat transfer behaviors within plate heat exchangers. This optimization, validated by experimental data, substantially enhanced the accuracy of flow predictions, offering a more precise method for analyzing the flow characteristics of the control valve. Meng et al. [
16] optimized S-A turbulence model parameters using the Ensemble Kalman Filter (EnKF) for three wind turbine airfoils (NACA63415, S809, DU97W300) under stall conditions. The optimized parameters reduced simulation errors, improved flow separation predictions, and showed that S809’s parameters could be applied to other airfoils with slightly higher errors. Key parameters C
b1, C
u1, and σ changed significantly, with C
b1 being the most critical.
Despite extensive research in related fields, studies on data assimilation for air-lift plungers remain scarce, particularly regarding the scalability of assimilated models. This paper employs the Ensemble Kalman Filter (EnKF) algorithm to recalibrate turbulence model constants through data assimilation techniques. By integrating experimental data with numerical simulations, the study evaluates the scalability of various plungers under different operational conditions. A comparative analysis is conducted to examine the flow characteristics and states of the gas-lifted plunger gap flow before and after model optimization. It is crucial to emphasize that the assimilation process requires high-fidelity experimental data. In this study, gas is used as the sole flow medium, with the flow rate during plunger hovering serving as the observational data. This approach aims to investigate the gas flow characteristics as it passes through the plunger gap, ultimately providing a more accurate and practical theoretical foundation for enhancing plunger lifting techniques.
Furthermore, the findings of this study are expected to contribute to the broader field of turbulent flow modeling by extending data assimilation techniques to gas–liquid two-phase systems, while also offering technical insights that may enhance the efficiency, reliability, and design optimization of plunger lift operations in low-pressure gas wells.
2. Mathematical Principles
2.1. Turbulence Modeling Equations
Due to the irregular rod-like external structure of the plunger, the fluid dynamics around it become highly complex. This complexity stems not only from the plunger’s geometry but also from the fluid flow behavior both around and within the plunger. To effectively simulate and analyze this intricate flow scenario, the RNG k-ε model, an enhanced version of the standard k-ε model, provides an ideal solution. By incorporating additional correction terms, this model can accurately handle strong vortex flows and flow distortions around curved surfaces, thereby improving the precision of turbulence simulations. The RNG k-ε model excels in simulating flow fields both inside and outside the plunger, effectively capturing subtle flow variations and vortex structures [
17].
RNG k-ε model equations:
In these equations,
Gk represents the turbulent kinetic energy produced by the mean velocity gradient
is the turbulent kinetic energy generated by buoyancy.
represents the contribution of pulsating expansion to the total dissipation rate in compressible turbulence.
represents the source term defined by the use.
The reciprocal of the effective Prandtl number, , is analytically determined using RNG theory, with .
In Fluent, the default constants of the RNG k-ε turbulence model are modifiable, including: .
2.2. Ensemble Kalman Filter
The Ensemble Kalman Filter (EnKF) is a DA method designed for state estimation in nonlinear systems. This method was proposed by Evensen et al. [
18] to address the limitations of the traditional Kalman Filter (KF) when applied to nonlinear systems. The EnKF uses an ensemble of samples to represent the probability distribution of the system’s state and adjusts the samples to more closely align with experimental observations by calculating their covariance matrix and the Kalman gain [
19].
The subject of study is represented by a set of state variables, denoted collectively as the vector . Experimental observations are represented as the vector . The algorithm assumes a specific relationship exists among state variables, experimental observations, and predictive models.
The state-space model describing the relationship between the experimental observations and prediction model is:
In the formula:
: prediction model function.
: initial state vector of the samples.
: prediction model error (neglected in this study).
: observation matrix mapping the state vector to observation space.
: experimental observation error.
- (1)
Forecasting process
During the prediction process, the state parameter vector within each ensemble member will be iteratively computed using the RNG k-ε model starting from the initial state until the numerical simulation of turbulence converges.
The updating of the state parameters follows the following equation:
Here,
F represents the governing equations of the RNG k-ε turbulence model, and the state parameters of each ensemble member are expressed in the following form:
Here, Q denotes the gas volumetric flow rate predicted by the RNG k-ε model, and A is the vector of turbulence model constants, expressed as: A = (Cu C1ε C2ε Prt)T. The superscript i denotes the index of the ensemble member.
Mean value of the ensemble members:
where
represents the total number of set members, and
denotes the average value of the set members.
- (2)
Analysis process
The analysis phase is a critical step in the ENKF algorithm. In this phase, the Kalman gain is determined, and the ensemble members are updated by synthesizing the uncertainty of observational data with the statistical information of the ensemble members. The specific process is as follows:
Prediction error analysis:
Updating collection members:
Mean value of the corresponding new set member:
For highly nonlinear models, a relatively accurate prediction is often not achieved by performing the analysis process only once. Consequently, this study employs a multi-iteration EnKF algorithm for both prediction and analysis. By iterating multiple times, the uncertainty of observational data and the statistical properties of the ensemble members can be more comprehensively incorporated, thereby enhancing the precision of predictions. At each iteration, the convergence of the algorithm is assessed based on the standard deviation of the ensemble members and the maximum number of iterations, which is set to 10,000. Utilizing this multi-iteration EnKF algorithm enables a more accurate approximation of the system’s actual state, improves prediction accuracy, and ensures algorithmic convergence.
2.3. Model Calibration Process
This paper uses the EnKF in the data assimilation method to calibrate the model’s constants. In this method, we define an ensemble state matrix of the following form:
Here, k denotes the number of state variables, which is 5 in this study.
The experimental flow rate observation data are as follows:
represents the experimentally measured gas volumetric flow rate under specific conditions.
The observation matrix for data assimilation is as follows:
where
W is the observation error derived from the uncertainty of the measurement instruments.
The mapping function matrix is as follows:
Here, 1m×n denotes an m × n matrix with all elements equal to 1, In represents the identity matrix of order n, and 0m×n indicates an m × n matrix with all elements equal to 0.
In this study, the observation matrix
H is employed to extract the predicted gas volumetric flow rate
Q from the state vector and project it onto the observation space, corresponding to the experimentally measured volumetric flow rate. The observation matrix is explicitly defined as follows:
This indicates that only the first variable of the state vector (the gas volumetric flow rate) is directly compared with the experimental observations.
In this study, 100 samples, corresponding to the total number of ensemble members (L = 100), are generated through Latin hypercube sampling and selected for calibrating the turbulence model constants. The predicted flow rates for these 100 samples under specific operating conditions are obtained using RNG k-ε model calculations. According to the integration method in Equation (17), these 100 sets of predicted flow rates serve as the initial state matrix for the algorithm. Subsequently, the initial state matrix is updated through several iterations of the ensemble Kalman filtering algorithm, using experimentally observed flow data to derive the optimal model constants calibrated through data assimilation. Finally, the newly calibrated model constants are applied under various operating conditions to evaluate their reliability and applicability. The entire process is illustrated in
Figure 1.
3. Experimental Procedure and Equipment
In the study of gas-lift plunger flow fields, conventional flow measurement techniques are difficult to apply due to the extremely narrow clearance between the plunger and the tubing wall (typically on the millimeter scale) and the confined, compact structure. Although hot-wire/hot-film anemometry (HWA/HFA) [
20] offers fast response, the probe is relatively large and tends to disturb the flow. Particle Image Velocimetry (PIV) and Particle Tracking Velocimetry (PTV) require transparent test sections and laser illumination, and the tracer particles are significantly affected by wall adhesion effects and plunger motion. Laser Doppler Velocimetry (LDV), while highly accurate, can only provide point measurements and also relies on optical access, making it difficult to capture the overall flow structure within the narrow, complex clearance [
21].
Therefore, in this study, the flow rate observed when the gas-lift plunger is suspended in the well is used as the measurement input, and the flow characteristics are then inferred through data assimilation.
3.1. Calibration of Turbulence Modeling Constants
The experimental apparatus consists of core components, an auxiliary system, and an observation system, primarily constructed from plexiglass and high-strength stainless steel.
The core components include a steel frame, tubing frame assembly, tubing components, and plungers with various slot geometries. The entire structure is built on a 7-m-high steel framework, with its top securely anchored to the ceiling and wall to mitigate structural vibrations, while the base is firmly fixed to the ground using four sets of anchor bolts. The tubing frame assembly comprises a blowout preventer, a well-bottom buffering device, and a plexiglass tubing section with an inner diameter of 60 mm and a wellbore length of 6 m.
The auxiliary system consists of an air compressor, a gas storage tank, and adjustable control valves. The observation system includes a gas flow meter positioned at the gas inlet and pressure transmitters installed at the gas inlet, well bottom, and wellhead.
The primary equipment parameters are presented in
Table 1, while the experimental setup for evaluating the sealing performance of the gas-lift plunger is illustrated in
Figure 2.
According to the optimal diameter ratio proposed by Zhao et al. [
22], the outer diameter of the plunger was set to 57.6 mm. Based on the findings of Wang et al. [
23], the rectangular slot, which exhibits the poorest sealing performance, and the arc-shaped slot, which demonstrates the best sealing performance, were selected as the sealing slot structures for the plunger’s exterior. The rectangular slot has a depth of 6 mm and a width of 12 mm, while the arc-shaped slot has a radius of 6 mm and is designed as a complete semicircle. The experimental plunger device, as illustrated in
Figure 3, is constructed from stainless steel, with each plunger weighing 3.5 kg.
3.2. Experimental Method
The objective of the experiment is to obtain the volumetric flow rate of gas when the plunger is in a suspended state at various positions within the wellbore, as well as the corresponding pressure readings at the flow meter, well bottom, and wellhead. The working fluid used in the experiment is air, with a temperature of 15 °C and operated under standard atmospheric pressure.
The sealing performance of the air-lifted plunger was evaluated as follows:
After verifying and confirming the integrity of the experimental apparatus, the required test plunger structure is installed. The plunger is introduced into the system through the top of the device, and the air compressor is activated to store compressed air, providing the necessary gas supply for the experiment.
A cushioning plug is installed at the top of the tubing to ensure the stability and safety of the plunger during the testing process and to prevent potential accidents.
The intake valve is opened, and the gas flow rate is gradually increased until the plunger begins to exhibit signs of activation. By precisely controlling the gas flow, the plunger is brought to a state of upward motion. The flow rate is then finely adjusted so that the plunger reaches and maintains a stable hovering position, as illustrated in
Figure 4. Once in position, the plunger is held in suspension for 10 s, during which the inlet flow rate, inlet pressure, bottom-hole pressure, and wellhead pressure are recorded. Data are collected at a sampling frequency of one measurement per second, and the average values during this steady-state period are subsequently calculated.
Based on the experimental data, the gas flow rate is corrected using the ideal gas law to calculate the volumetric flow rate under standard conditions (0 °C, 101.325 kPa):
: Gas volumetric flow rate under standard conditions (m3/h);
: Measured gas flow rate (m3/h);
: Measured absolute gas pressure (Pa);
: Standard atmospheric pressure (101,325 Pa);
: Measured absolute gas temperature (K);
: Standard temperature (273.15 K)
Adjust the opening of the outlet valve to control the hovering position of the plunger, and repeat step (3) multiple times.
- 5.
Replace the plunger and repeat steps 1–4.
3.3. Experimental Results
The measured experimental data are displayed in
Table 2, where one set is utilized for experimental data assimilation to optimize the turbulence model constants, and the remainder is used for validation.
4. Examples of Numerical Simulation Calculations
The geometric model and computational grid of the plunger lift model are shown in
Figure 4. The dimensions of the plunger and tubing are based on the experimental setup. The distance from the inlet to the plunger is 70 mm, and the distance from the outlet to the plunger is 150 mm. Considering the complexity of the flow field around the plunger, a Poly-HexCore mesh is used in this study. The Poly-HexCore mesh is an advanced mesh type designed to provide flexibility and efficiency compared to the traditional hexahedral core mesh. This meshing technique combines the advantages of polyhedral mesh and hex-core mesh, thus optimizing mesh quality and computational efficiency, especially for flow problems with complex structures or large size spans. After verifying mesh independence, a physical model with a mesh size of 6 million cells is used to improve computational efficiency while ensuring accuracy. The physical model is shown in
Figure 5.
In this study, the boundary conditions were determined based on the experimental conditions. The fourth set of experimental data was randomly selected for data assimilation, while the remaining sets were used to validate the assimilation results. The SIMPLE algorithm was adopted for pressure–velocity coupling, and the convective terms were discretized using the first-order upwind scheme. The simulation was considered converged when either the inlet flow rate remained unchanged for more than 2000 iterations or the maximum residual of all monitored variables dropped below 1 × 10
−5. The detailed settings are provided in
Table 3.
DA using the EnKF method requires the configuration of several key hyperparameters as follows: the number of ensemble samples, the range of adjustment coefficients, and the variance of specific Gaussian noise. Properly setting these hyperparameters is critical, as inappropriate values may lead to issues such as excessive computational costs, significant deviations of calibration results from experimental values, and instability in data assimilation outcomes. Therefore, hyperparameters must be carefully selected to ensure the accuracy and stability of the data assimilation process.
In this study, the optimized Latin Hypercube Sampling (OLHS) method was employed to generate random samples of four turbulence model constants in the RNG k-ε model. The sampling range for each parameter was set to ±20% of its nominal value, specifically 0.0676 ≤ Cu ≤ 0.1014, 1.136 ≤ C1ε ≤ 1.704, 1.344 ≤ C2ε ≤ 2.016, 0.68 ≤ Prt ≤ 1.02.
By evenly dividing each parameter range and optimizing the sample distribution, a total of 100 representative turbulence model sample sets were constructed. As illustrated in
Figure 6, the optimized samples were evenly distributed across the high-dimensional parameter space, effectively avoiding clustering and ensuring both the uniformity and representativeness of the sample set.
The 100 sets of turbulence model constants obtained through OLHS were individually input into Fluent for parameter modification, followed by numerical simulations to generate the forecast dataset required for the data assimilation process. In setting the boundary conditions, the bottom-hole pressure and wellhead pressure from the fourth experimental group were randomly selected and applied as the pressure inlet and pressure outlet, respectively, in the simulations. The predicted flow rates corresponding to each set of turbulence model constants were obtained through the simulations. Detailed results are provided in
Appendix A.
The observation noise, which is set with a mean of zero and follows a normal distribution with a standard deviation of 0.1, requires careful consideration when determining its variance. This is because the measurement bias includes both the uncertainty inherent in the experimental instrumentation and the biases arising from physical differences between the experimental setup and the predictive model.
6. Conclusions
This study applied the Ensemble Kalman Filter (EnKF) data assimilation technique to calibrate the constants of the RNG k-ε turbulence model, aiming to improve the predictive accuracy of gas flow through plunger lift systems. The integration of experimental observations with numerical simulations enabled significant reductions in prediction errors. Specifically, the mean square error (MSE) for the rectangular and circular arc plungers decreased from 60.67 and 61.48 to 7.12 and 7.20, respectively, demonstrating the method’s robustness and adaptability to different plunger geometries.
In addition to quantitative improvements, data assimilation revealed critical insights into the flow characteristics within the plunger gap. Notably, changes in the circumferential flow structures and turbulence intensity indicated that conventional models may overestimate sealing performance. The recalibrated model better captured these complex features, providing a more realistic representation of the underlying physics.
The findings highlight the effectiveness of data assimilation not only in enhancing model fidelity but also in improving the understanding of gas–liquid interaction mechanisms in narrow annular flows. This approach provides a new methodology and perspective for optimizing the geometric design of plunger slot structures, allowing for better evaluation of sealing performance and flow behavior under varying operational conditions.
From an application standpoint, the calibrated turbulence model offers practical guidance for the structural optimization of gas-lift plungers, particularly in the design and assessment of different sealing slot types. Future research will focus on extending the assimilation framework to incorporate multi-parameter uncertainty quantification and to explore more diverse plunger configurations in low-pressure gas wells.