1. Introduction
This paper addresses the application of machine learning (ML) methods to making accurate and relevant predictions of slurry flow behavior. Slurries are complex multi-phase systems studied actively from a physical perspective for >70 years. Flow regime prediction is inexact, generally relying on semi-empirical correlations that have been fitted to different data sets, which are expensive and non-trivial to gather. These regime predictions are used to make design decisions for pipelines and other transport applications where errors are costly. In applying ML methods in any mature industrial or scientific field one has two choices: (i) start from scratch with no prior knowledge; (ii) incorporate existing knowledge. This second approach is that used here. Thus below, we review both relevant slurry flow fundamentals and ML applications in this domain.
Pipe flows of slurries are commonly encountered in the mining industry (slurry transport) and in oil and gas well operations: hole cleaning, hydraulic fracturing and gravel packing. Here we deal with slurry applications in well drilling, where the relevance of multi-phase flow has long been recognized [
1]. Horizontal slurry flow of sand and a Newtonian/non-Newtonian carrier liquid in a pipe geometry is encountered widely in horizontal wells, and also in gathering and transition lines. In drilling engineering, a variety of models are used for cuttings transport. At the lowest level, these simply compare the pumping velocity with a typical particle settling velocity. More sophisticated models [
2,
3,
4] consider flows with a layered structure in the well cross-section, consisting of settled beds with mobile suspension flows above.
These mechanistic models are analogous to those developed earlier to predict slurry transport in the mining industry. Starting from the early work of Durand and Condolios [
5], the importance of flow regimes was immediately recognized. Turian and Yuan [
6] proposed four different semi-empirical correlations, based on more than 2800 experimental data points, to determine the slurry friction factor in the four flow regimes. This is probably the most comprehensive empirical correlation developed to date. However, an underlying criticism of [
6] is that the frictional pressure does not represent the underlying physical balance leading to a solids bed. Transition between the four flow regimes has historically formed one major axis of research on slurry transport. These are typically known as transition velocities, the most important one being the deposition or critical velocity, which defines the onset of a stationary bed at the bottom of the pipe. In applications it is crucial to determine the most efficient pipe size to handle a variety of flow conditions throughout lifetime of the field/pipeline. Accurate prediction of the key flow parameters such as critical velocity, flow regime, and pressure drop has significant impact on such design decisions [
7,
8].
There are many empirical and semi-empirical models and correlations for predicting the critical velocity, e.g., that of Oroskar and Turian [
9] and Kokpinar et al. [
10]. One of the significant features of the critical velocity is that it corresponds to the minimum frictional pressure drop in the slurry flow, which has even motivated predictions based on this feature, e.g., see [
11]. Likewise, many models and correlations have been developed for prediction of frictional pressure drops, e.g., [
12,
13] for heterogeneous flow, and [
14,
15] for the bed-load regime. Additionally, many comprehensive layered models were developed more recently to predict both critical velocity and frictional pressure drop in different regimes, e.g., the two layer model of Gillies et al. [
3] and the modified three layer model of Sarraf Shirazi and Frigaard [
16].
All the above models are to some extent a combination of phenomenological and mechanistic approaches: once the flow regime is phenomenologically defined this enables a more accurate mechanistic model. However, even targeted mechanistic models are limited by the physical complexity of the actual flows. This suggests that a more data-driven (ML) approach might be effective. In this context, Osman and Aggour [
17] developed an artificial neural network for prediction of the frictional pressure drop of a slurry flow in horizontal and near horizontal pipes. The accuracy of their model outperformed that of the existing correlations compared. Ulker and Sorgun [
18] used four different machine learning algorithms including k-nearest neighbor (kNN), support vector regression (SVR), linear regression, and ANN to estimate the sedimentation bed height inside a wellbore with and without drill pipe rotation. They found that ANN provided slightly better performance compared to other models. Azamathulla et al. [
19] used adaptive neuro-fuzzy interference system (ANFIS) and gene-expression programming (GEP) for prediction of the pressure drop. Their results showed that the ANFIS model led to better performance compared to GEP and existing correlations. Lahiri and Ghanta [
20] developed a hybrid SVR and genetic algorithm (GA) technique for prediction of the slurry frictional pressure drop, where GA was used for efficient tuning of SVR hyper-parameters. Their developed model accuracy outperformed that of all the existing correlations.
While the above ML methods have produced positive results for slurry transport in the past 2 decades, the picture is incomplete. First, the estimation of pressure drops only cover the heterogeneous regime. This is a practical drawback: not only are the methods limited to prediction in one regime, but one needs prior knowledge of the flow regime, which is not always the case in practice. Secondly, no dimensional analysis was performed before feeding the parameters as inputs to the algorithm. This necessarily means that there is significant redundancy methodologically. In this study, we address both issues and give a complete model. The key novelty of our approach is that we work with the known physical structure of slurry flows. First we use dimensional analysis to eliminate redundancy in variables. Second we integrate 2 models to mimic the physical studies: (a) a model to predict the regimes and transition; (b) knowing the regime, we predict pressure drop. This improves the accuracy in a physically consistent way.
An outline of the paper is as follows. Below in
Section 2 we outline the dimensional analysis and the development of the features as inputs to our models for critical velocity and frictional pressure drop.
Section 3 provides a brief background on the ANN and SVR models, and discuss the corresponding important hyper-parameters for each model that need to be tuned for the propose of training. In
Section 4 we introduce our modeling and training approach in detail, specially for developing the integrated model for prediction of the slurry friction factor using our knowledge of the flow regime.
Section 5 provides the acquired experimental data from the literature, and the detailed results produced by our model with the comparison against the well-known correlations in the literature.
4. Modeling Approach
The purpose of this study is to develop learning models using ANN and SVR algorithms for prediction of the critical velocity and frictional pressure drop of slurry flow in pipe geometry. For critical velocity, we use the four dimensionless features,
,
s,
, and
developed in
Section 2.1 as inputs to develop the above mentioned learning algorithms with satisfactory generalizability. However, for the prediction of frictional pressure drop we also need to understand the effect of the slurry flow regime on the friction factor.
Figure 3 shows a schematic of the frictional pressure drop as a function of the mean slurry velocity for different flow regimes. The slurry flow regime is governed by the competition between the turbulent eddies and the particle settling tendency due to gravity. The former tends to suspend the solid particles in the carrier liquid while the latter drives the particles to settle at the bottom of the pipe. The frictional pressure drop of a slurry flow depends on the different existing stresses and forces whose nature and strength strongly depend on the flow regime [
6,
16].
At low flow rates, the turbulent eddies are not strong enough to suspend the solid phase. As a result, a considerable portion of the pipe is occupied by stationary sedimentation bed above which there is a heterogeneous layer with a recognizable solids concentration gradients. This regime of the slurry flow is also referred to as the bed-load regime. As observed in
Figure 3 the frictional pressure drop decreases with the mean velocity in this regime. This is explained by the fact that at low velocities, the stresses and forces are dominated by the solids phase, that are weakened as the slurry velocity increases.
As the mean superficial velocity increases, the turbulent eddies become more capable of suspending the solids until all the static bed layer is eroded, and there is a moving bed layer at the bottom of the pipe whose concentration is close to maximal packing. As the flow rate is further increased, we reach the heterogeneous or fully suspended regime where there is a solid concentration gradient in the direction of gravity. At extremely high flow rates, turbulent eddies become significantly more dominant and the solid phase becomes progressively more homogeneously distributed in the carrier liquid. As shown in
Figure 3 the frictional pressure drop increases with the mean velocity through saltation flow, heterogeneous, and homogeneous regimes. Furthermore, the pressure drop increase rate is also increasing at higher velocities, as the liquid phase role becomes more dominant in the suspension stresses.
As noted above, the frictional pressure drop behavior is noticeably affected when the regime changes from the bed-load to saltation flow, i.e., at the critical velocity. Therefore, we can introduce this prior knowledge to our predictive modeling approach.
Figure 4a,b shows the work flow chart for developing our predictive models. We develop two separate learning models with satisfactory accuracy and generalization capability for the bed-load and heterogeneous flow regimes according to the work-flow chart illustrated in
Figure 4a. For this task, we also need to train the two mentioned models with separate datasets representing the corresponding regimes. For examining the generalizability of the developed predictive model for frictional pressure drop, we primarily check what the flow regime is, based on the developed model for critical velocity. Subsequently, we feed the six dimensionless parameters (see
Section 2.2) as features to the corresponding predictive learning model for frictional pressure drop prediction. This procedure is illustrated in
Figure 4b for further clarification of our integrated method scheme for prediction of the slurry friction factor. Consequently we have a dataset for critical velocity, and two distinct datasets for frictional pressure drop: in bed-load regime and the rest of the regimes.
We develop the most suitable ANN and SVR predictive models for each of the three datasets via grid search among their corresponding hyperparameters. The chosen hyperparameters of ANN for tuning include the architecture of the network, i.e., the number of hidden layer(s) and neurons in each hidden layer, activation function, number of training epochs, and learning rate, and the ones for SVR include C, , kernel type, and kernel parameter (polynomial degree for polynomial function, and for the radial basis function). Then we pick the one with the best validation score as our ultimate proposed model. For the purpose of model development we take 80% of each dataset randomly as the training set and the remaining 20% as the test set. We perform 5-fold cross validation on the training set to examine the generalization capacity of the model on the data that it did not get trained on. The best model with specific sets of hyperparameters is chosen based on this validation score.
5. Results and Discussion
As the magnitude of the input features are significantly different, the data should be normalized before being fed to the training algorithms. If the inputs are of different scales, the weights connected to the inputs with larger scales will be updated much faster compared to others, which can considerably hurt the learning process. On the other hand, there are also a variety of practical reasons why normalizing the inputs can make training faster and reduce the chances of getting stuck in local optima. We use the standard normalization as follows:
where
is the normalized input of the
sample, and
and
are the mean and standard deviation of the data points in training set. The output is also normalized in similar way as in (
19).
Table 2 shows parameters of 100 experimental data points collected from the literature, measuring critical velocity, which we use to train and test our proposed models. Additionally,
Figure 5a–e demonstrate the estimation of the probability density function and box plot of all the input features along with the output, which provides insightful information about the distribution and statistical parameters of dataset. Each data point is the result of an experimental test by the listed authors, performed in different flow loop facilities. As can be observed, these experiments cover a wide range of particle sizes
, pipe diameters
, mean solids concentrations
, and also different density ratios
. Most of the data are taken from the measurements conducted by Kokpinar et al. [
10] who used coarse particles, with different materials to also see the effect of
s on the critical velocity. They used sand, coarse sand, coal, blue plastic, black plastic, fine tuff, and coarse tuff with specific densities of
respectively.
We train and obtain a validation score (loss) on the randomly chosen 80 data points (training set) and report the out of sample results on the remaining 20 data points.
Table 3 shows the the optimum hyperparameters for SVR algorithm,
Table 4 shows parameters of experimental data points collected from the literature, and
Table 5 shows the optimum hyperparameters for ANN algorithms.
Table 3 and
Table 5 show their corresponding validation loss respectively. It should be noted that the validation loss refers to the average mean squared error obtained by the 5-fold cross validation. As observed, the optimum SVR model outperforms ANN in terms of the validation score and hence the generalization capability. Therefore, the SVR model is chosen as the ultimate prediction model for critical velocity.
Table 6 shows the performance of the chosen model on training and test sets, in terms of the average absolute relative error (AARE), the cross-correlation coefficient (R), and the standard deviation of error (
). We can compare the proposed model performance against the most widely used predictive correlations in literature brought in
Table 1. The out of sample average absolute relative error of these models are 0.099, 0.153, 0.308, 0.322, 0.412, and 0.447 corresponding to the prediction of the proposed SVR model, Kokpinar et al. [
10], Oroskar and Turian [
9], Durand [
11], Yufin [
22], and Zandi et al. [
13] respectively. It is evident that the prediction error of critical velocity has reduced considerably in the present work.
Figure 6 shows the parity plot of the experimentally measured and predicted results of the dimensionless critical velocity for the training and test sets with the AARE of 0.073 and 0.099 respectively.
We have also directly compared the performance of our model with that of Kokpinar et. al. [
10] with their own 42 experimental data points.
Figure 7 shows the parity plot of the corresponding predictions versus the measured dimensionless critical velocities. The AARE of estimations are 0.142 and 0.062 for Kokpinar et al. [
10] and present models respectively. As observed in
Figure 7 the present model performs better in particular where
.
Figure 8 illustrates the effect of hyperparameter
C on the loss function (mean squared error) of the training, validation, and test sets. As was mentioned in
Section 3.2,
C determines the trade-off between the flatness of the hypothesis function, and the degree up to which deviations larger than
are tolerated in SVR algorithm. In practice it also has regularization effect such that the lower the value of
C, the more the objective function is regularized. As seen in
Figure 8 there is an optimal
C where the loss function is minimized in validation and test sets, lower than which the hypothesis function suffers from high bias (under-fitting) and higher than which it suffers from high variance (over-fitting). Obviously, the values of all hyperparameters including
C are chosen based on the validation score.
Table 4 shows parameters of experimental data points collected from the literature, measuring frictional pressure drop for heterogeneous and bed-load regimes, which we use to train and test our proposed models. The total number of experimental data points are 365 and 125 for the heterogeneous and bed-load regimes respectively. As can be observed, the experiments mostly used fine particles except for the Doron et al.’s data [
38] and part of Durand’s measurements in bed-load regime [
5], where particle sizes of
mm and
mm were used respectively. Pipe diameters of the range
were used in the experiments with flow velocity range of
, and mean delivered solids concentration of
. Most of the experiments were conducted using sand particles with the density ratio of
except for Doron et al.’s work where General Electric “Black Acetal” with the density ratio of
was used [
38].
Figure 9a–g show the kernel density estimation and box plot of all the input features along with the output for both heterogeneous and bed-load regime datasets. An interesting observation is that the distribution of
and
are considerably different comparing the two regimes. The reason is that according to (
4) and (
5) both of these dimensionless variables include the term
in their equations, and we know that the mean slurry velocity in the bed-load regime is less than that of the heterogeneous regime. Therefore,
is considerably lower while
is larger in bed-load regime compared to the heterogeneous regime.
Similar to the critical velocity case, we randomly take 80% of the dataset for the purpose of training and validation, and the rest 20% as the test set for evaluating the out of sample performance. As can be observed from
Table 3 and
Table 5 the most efficient developed ANN models are outperforming SVR for both heterogeneous and bed-load regimes.
Figure 10a,b show the parity plot comparing the measured and predicted slurry friction factor for both regimes. The corresponding out of sample results are illustrated in
Table 6.
For a fair comparison against the existing correlations and models from literature, we also need to investigate the integrated method performance in terms of predicting the frictional pressure drop. In other words, we would like to determine the out of sample error where we ignore the prior knowledge of the flow regime, which can be the case in real-life scenarios specifically for industrial applications. To serve this purpose, we feed each data point to the developed SVR algorithm for critical velocity prediction, and compare the predicted result with the mean slurry velocity as a means to identify the regime. For this process the key assumption is
at the critical velocity which is a reasonable assumption to make. After the regime identification, we feed the data point to the corresponding model for predicting the frictional pressure drop. The out of sample results for integrated method prediction is shown in
Table 6.
Once again, we can compare the out of sample AARE against that of some recognized correlations and models available in literature for predicting the pressure drop. For slurry friction factor prediction in heterogeneous regime, the AARE of the correlations developed by Zandi and Govatos [
13], Durand and Condolios [
5], and Turian and Yuan [
6] are 0.643, 0.449, and 0.348 respectively, whereas for the bed-load regime, the AARE of the proposed models by Gruesbeck et al. [
14], Penberthy et al. [
15], and Turian and Yuan [
6] are 0.837, 0.769, and 0.529 respectively. It is clear that the prediction performance of the current study with AARE of 0.084 significantly outperforms that of the mentioned models.
Figure 11 illustrates the effect of the epoch number, a key hyperparameter for ANNs, on the loss function of the training, validation, and test sets for the heterogeneous regime ANN model. As can be observed, there is an optimal epoch number for training, after which the validation loss starts to increase. In other words, after around 400 training epochs the model is over-fitting on the training dataset.
To investigate whether the proposed integrated method is indeed required for the purpose of a satisfactory prediction for frictional pressure drop, we have also performed a batch train using all of the 490 frictional pressure drop data points, without any supervised or unsupervised classification based on the flow regime. We trained and tested another learning model under the mentioned condition with the similar procedure as other developed models.
Table 3 and
Table 5 show that the SVR model performance is more satisfactory compared to ANN in terms of generalization capacity.
Figure 12 illustrates the corresponding parity plot for the measured slurry friction factor against the predicted values.
For comprehending and comparing the performance of the batch-trained model with the integrated method, the corresponding parity plots are shown in
Figure 13a,b.
Figure 13a shows the measured and predicted slurry friction factor for the integrated method. As could be observed, there are four heterogeneous data points whose regime was incorrectly classified as bed-load (blue squares), and three bed-load data points with false classification. As shown, the predicted slurry friction factor for misclassified heterogeneous data points tend to be higher than the measured value, whereas the reverse is true for the misclassified bed-load data points. The reason is that generally, the value of slurry friction factor in bed-load regime is more than that of the heterogeneous regime. However, the out of sample results of the integrated method is more satisfactory compared to the batch-trained model, with the AARE of 0.084 for the former and 0.155 for the latter, as shown in
Table 6. Consequently, it can be comprehended that although the integrated method prediction highly relies on the performance of regime classification, i.e., the SVR model for critical velocity prediction, it is considered to be more efficient in practice to predefine a regime classification method, such as the one accomplished in this work, prior to feeding it to the model for satisfactory prediction of the friction factor.