Two-phase oil and water flow pattern identification in vertical pipes applying long short-term memory networks

Ruiz-Díaz, Carlos Mauricio; Quispe-Suarez, Brayan; González-Estrada, Octavio Andrés

doi:10.1007/s42247-024-00631-2

Two-phase oil and water flow pattern identification in vertical pipes applying long short-term memory networks

Original Article
Open access
Published: 12 February 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Emergent Materials Aims and scope Submit manuscript

Two-phase oil and water flow pattern identification in vertical pipes applying long short-term memory networks

Download PDF

302 Accesses
1 Altmetric
Explore all metrics

Abstract

Accurate biphasic flow pattern recognition is essential in the design of coatings for the oil and gas sector because it enables engineers to create materials that are tailored to specific flow conditions. This results in enhanced corrosion protection, erosion resistance, flow efficiency, and overall performance of equipment and infrastructure in the challenging environments of the oil and gas industry. The development of flow maps has been based on empirical correlations that incorporate characteristics such as superficial velocities, volume fractions, and physical properties such as the density and viscosity of the analyzed substances. In addition, geometric parameters such as the inclination and the internal diameter of the pipes are considered. However, due to the difficult working conditions on offshore platforms and the limitations in monitoring internal flow patterns, technological advances have been implemented to improve this process using artificial intelligence techniques. In this context, this study proposes using a long-term memory (LSTM) recurrent neural network to predict the flow patterns generated in vertical pipes. This LSTM network was trained and validated using data obtained from a literature database. The results obtained showed that the model has a prediction error of less than 1%. These technological advances represent an important step towards optimizing the flow pattern identification process in the hydrocarbons industry. By leveraging the capabilities of artificial intelligence, more accurate and reliable forecasts can be obtained, enabling informed decisions and improving the efficiency and safety of operations.

Dynamic prediction of moving trajectory in pipe jacking: GRU-based deep learning framework

Article 01 July 2023

Utilization of machine learning for the estimation of production rates in wells operated by electrical submersible pumps

Article Open access 22 February 2024

Prediction of the Incrustating Trend in Oil Extraction Pipelines: An Approach Based on Neural Decision Trees

1 Introduction

Multiphase flow transport must be considered in designing and optimizing many engineering systems, such as food processing equipment, fuel cells, and the petrochemical industry [1]. The latter has focused its current research on crude oil transport [2]. The optimization of production in oil fields occurs with the continuous monitoring of flow rates, temperature, and pressure of the fluid extracted from wells. Performance can be measured against extraction costs, which is possible with reliable data that provide a clear perspective on fluid transport. Based on the abovementioned concepts, studies have been developed to model multiphase flows in transport lines worldwide [3]–[4].

Predicting flow patterns in the oil and gas sector is essential to optimize operational efficiency, maximize production, ensure safety, comply with regulations, prevent equipment wear, and minimize environmental impact. It also facilitates the implementation of predictive maintenance strategies and contributes to continuous and reliable operation while reducing costs and improving risk management in a critical industry [5]. Coatings and thin films are often applied to surfaces in the oil and gas industry to protect equipment and infrastructure from corrosion caused by contact with corrosive fluids [6]. Understanding the biphasic flow patterns of these fluids can help design coatings that are optimized for specific flow conditions, ensuring effective corrosion protection. Moreover, solid particles suspended in the fluid can cause erosion of surfaces over time. Recognizing flow patterns helps design coatings that are resistant to erosive wear, extending the lifespan of equipment and reducing the need for frequent maintenance [7]. Certain flow patterns can lead to the accumulation of hydrates and waxes, which can obstruct pipelines and impede fluid flow. Coatings and thin films can be engineered to inhibit the adherence of these substances to surfaces, maintaining efficient flow even under challenging conditions.

The studies developed by [8] demonstrate the predictive ability of artificial neural networks fed by a class of data-driven virtual flow meters (VFM). This work showed that neural networks could be a promising tool for predicting multiphase flow rates using pressure and temperature data. Conversely, as indicated by the research presented in reference [9], the update frequency of a steady-state VFM model is critical for sustaining optimal performance over time, particularly in dynamic, non-stationary conditions.

The study of multiphase flows integrates concepts directly related to transport phenomena in modeling and flow characterization, using the principles of conservation of mass and conservation of momentum. These principles allow for a mathematical approximation of flow characteristics inside pipelines [10]. The description of multiphase flow behavior requires the use of experimental data or correlations due to the complexity involved in their study. Although advanced computational methods are being developed, data and correlations are generally still needed to implement multiphase flow modeling effectively [11].

Multiphase flows can be found in multiple engineering practices, especially in the oil industry, where two-phase and tri-phase flows can be found [12]. Two-phase flow refers to the interactive flow of two distinct phases (each phase representing a mass or volume of matter) with common interfaces in a channel. Possible combinations are solid–liquid, where solid particles are dispersed primarily in liquid; solid–gas, where a gas stream transports solid particles; and liquid–vapor (gas), where the volume fraction of one phase relative to the other results in different flow regimes and combination of the above [13].

At present, there is a large number of studies on the behavior of multiphase flows. One approach being explored is data-driven modeling, a technique that analyzes system data and looks for relationships between input and output variables without relying on knowledge of the underlying physical equations [14, 15]. This promising approach makes it possible to take advantage of the information contained in the data and discover patterns and correlations that can be used to understand and predict the behavior of multiphase flows efficiently and accurately. This poses challenges when supplying input data; thus, it is essential to give adequate statistical treatment and avoid a poor fit in model parameters [16]. This process is achieved by implementing artificial neural networks, which start by calculating a function with the inputs given, propagating the calculated values from the input neurons to the output neurons and using the weights as intermediate parameters. Learning occurs by changing the weights that connect the neurons [17] to have predictions that fit reality.

The study in [18] used daily data on the production of oil, water, and gas generated from an oil field as the primary input variable to model a neural network, resulting in predictions of multiphase production in the short term. The long short-term memory (LSTM) algorithm was implemented to generate the model. This approach showed that, compared to the model based on numerical simulation physics, this neural network approach was less dependent on engineering calculations and offered faster and more robust solutions. In [19], an investigation was conducted using artificial intelligence systems with actual data from an oil platform in the South China Sea to develop a virtual flow meter model. This work demonstrated how LSTM considers the influence of previous time steps on the evaluated flow rate, capturing the relationship between parameters such as temperature and pressure on the predicted flow rate. LSTM is also highlighted in [20], where it was shown that precision is improved if the input and output sequence lengths are equal and how to remove this limitation without losing precision in the neural network.

This work presents the development of a recurrent neural network model implementing the LSTM algorithm to predict flow patterns generated within vertical two-phase liquid–liquid flow pipes of lubricating oil and water. First, two-phase flow data is collected and processed. The following section discusses structuring the LSTM neural network, followed by the evaluation and optimization approaches. Finally, results are presented for different network configurations, followed by concluding remarks.

2 Methodology

2.1 Data processing

The data used in this study was collected by [21], who compiled relevant information on the two-phase flow of oil and water in a vertical pipe. For this, the works of 18 authors previously published in the literature were analyzed. These collected data provide a solid and representative basis for the development of this work, allowing an exhaustive and well-founded analysis of the behavior of the biphasic flow in this specific configuration. Table 1 presents the information in the database, summarizing the quantitative characteristics of each analyzed work.

Table 1 Summary of the collected database from reference authors

Full size table

Different types of two-phase flows were considered in the study, including dispersed oil-in-water (D O/W), dispersed water-in-oil (D W/O), churn oil-in-water (churn O/W), churn water-in-oil (churn W/O), core flow, stratified oil-in-water (S O/W), stratified water-in-oil (S W/O), flow in transition (TF), very fine dispersed oil-in-water (VFD O/W), and very fine dispersed water-in-oil (VFD W/O); see Fig. 1. These different types of two-phase flows represent various configurations and flow patterns that can occur in oil and water systems and were considered in order to better understand the characteristics and behavior of multiphase flows.

The variables used to build the database are the superficial velocities of the water phase (${j}_{w}$) and oil phase (${j}_{o}$). These velocities are function of injection rates and the cross-sectional area of the pipe, as well as the mixture velocity ($J$) and the volumetric fractions of oil–water fluids (${\varepsilon }_{o}$ and ${\varepsilon }_{w}$) that characterize the flow. Other important parameters included in the data collection are the oil viscosity (${\mu }_{o}$), oil density (${\rho }_{o})$, and the internal diameter of the pipe ($D$). Specific geometrical configurations are generated for each combination of parameters, grouped for this work into the ten flow patterns, which are the output data of the intelligent model to be developed. Table 2 presents the specific amount of data available for each flow pattern.

Table 2 Quantification of data for each flow pattern

Full size table

2.1.1 Normalization

To train the neural network, it is crucial to normalize the information that will be used as inputs to the intelligent model, providing precise data that allows the network to relate the supplied variables with the predicted pattern. Normalization is defined as a scale change of the original data such that it falls within a specific range [38]. The input vector of the network is normalized, which contains $x=\left[{j}_{o}, {j}_{w}, j, {\varepsilon }_{o}, {\varepsilon }_{w}, D, {\mu }_{o}\right]$. Equation (1) presents the mathematical definition used to normalize the numerical information:

$${X}_{{\text{normalized}}}=\frac{x-{X}_{{\text{min}}}}{{X}_{{\text{max}}}-{X}_{{\text{min}}}}$$

(1)

The variable ${X}_{{\text{max}}}$ represents the maximum value of the series under study, while ${X}_{{\text{min}}}$ represents the minimum value of the series, and $x$ corresponds to the point value to be normalized.

2.1.2 Codification

Given that the flow pattern is defined in the database as a categorical variable, it is necessary to encode the label that is assigned to the ten different configurations. To carry out this encoding, a number from 0 to 9 is assigned to each of the defined flow patterns. Since there are different data labels, simple encoding is insufficient. In this case, encoding is achieved using the OneHotEncoder function, a well-established technique commonly employed in the fields of data science and machine learning. The fundamental concept behind one-hot encoding is to represent categorical variables as binary vectors. Each category is transformed into a separate column with binary values (1 or 0) to indicate the presence or absence of that category in a data row. This process eliminates the encoded variable as a whole and introduces a binary variable for each unique integer value. Table 3 shows the result of encoding the ten flow patterns with their respective binary representation.

Table 3 OneHotEncoder codification for the flow patterns

Full size table

2.2 LSTM model structuring

The identification of two-phase flow patterns in vertical oil pipelines is an inherently sequential and time-dependent problem. Flow patterns change in response to previous events and evolve continuously. In this context, the use of long short-term memory networks (LSTM) is justified due to its ability to capture sequential dependencies and handle time series data. While support vector machines (SVM), artificial neural networks (ANN), K-means, K-nearest neighbors (KNN), and similar techniques are highly effective in classification and clustering problems, they do not possess the inherent capacity of LSTM to effectively model and predict data sequences. With all the processed information from the reference database, we proceed with the definition of the internal structure of the LSTM. A general scheme of the structure of this type of neural network is presented in Fig. 2, as well as the vector with the input parameters that will be used to generate the predictions. Figure 2 shows how the hidden layers are connected sequentially to process the data from inputs layers. In LSTM, the combination of the current input and the previous hidden state, along with the LSTM’s various gates (including the forget gate, input gate, and output gate), determines how the cell state is updated and what information is retained or forgotten in the memory. The input vector could be interpreted as a three-dimensional array, composed of samples, time series, and features. Once the input vector with the parameters is defined, the categorical representation is generated, which will be the output of the predictive model. In this case, the output has been assigned the flow pattern developed inside the vertical pipes.

The operation of an LSTM is based on the internal process that occurs in its state cells. These cells integrate activation functions to process both new information and information from previous state cells, thus allowing sequential processing and retention of relevant information over time. This mechanism of integration and processing of information in the state cells is essential for the effective operation of the LSTM. Figure 3 presents a diagram of the logical sequence that a state cell follows internally. Several steps occur before obtaining a hidden state or an output from the cell, which involve multiple transformations of the data.

The logical sequence developed within the LSTM cells allows for a detailed explanation of the order in which information is processed within them. Thus, Fig. 3a presents a filter for which information is retained and which is discarded in the cell state. This operation is carried out thanks to the sigmoid layer ($\sigma$) known as the forget gate:

$${f}_{t}=\sigma \left({W}_{f}\left[{h}_{t-1},{x}_{t}\right]+{b}_{f}\right)$$

(2)

This gate analyzes ${h}_{t-1}$ and ${X}_{t}$, which correspond to the current input and the output of the previous state, and generates a number between 0 and 1 for each number in the cell state ${C}_{t-1}$. $W$ is the weighting function adjusted inside the algorithm during the training and $b$ is the bias. Thus, when the function generates a 1, it saves this information, while when it generates a value of 0, it skips the processed information.

Then, the neural network selects the information that will be stored in the cell state (Fig. 3b). This processing is done in two parts: in the first part, a sigmoid layer evaluates the values to be updated in the input gate ${i}_{t}$, and in the second part, a layer with the hyperbolic tangent function ($tanh$) creates a new vector with candidate values ${C}_{t}$, some of which will be added to the cell state. This sequence is synthesized in Eqs. (3) and (4):

$${i}_{t}=\sigma \left({W}_{i}\left[{h}_{t-1},{x}_{t}\right]+{b}_{i}\right)$$

(3)

$${\widetilde{C}}_{t}=tanh\left({W}_{c}\left[{h}_{t-1},{x}_{t}\right]+{b}_{c}\right)$$

(4)

After selecting the information to be included in the cell state, the cell state is updated (Fig. 3c). The previous state, ${C}_{t-1}$, is converted to ${\widetilde{C}}_{t}$ by multiplying it with ${f}_{t}$, and then adding ${i}_{t} {C}_{t}$. This results in Eq. (5):

$${C}_{t}={f}_{t}{C}_{t-1}+{i}_{t}{C}_{t}$$

(5)

Finally, in Fig. 3d, the value that will continue the process to the upper cells is selected. The output gate ${o}_{t}$ is based on the state of the cell, and a sigmoid layer is executed to decide which parts of the cell state are generated. Then, the cell state is processed with a hyperbolic tangent function that converts the values to a range between − 1 and 1, which is multiplied by the output of the sigmoid gate such that only the selected parts are endorsed, generating Eqs. (6) and (7):

$${o}_{t}=\sigma \left({W}_{o}\bullet \left[{h}_{t-1},{x}_{t}\right]+{b}_{o}\right)$$

(6)

$${h}_{t}={o}_{t}tanh({C}_{t})$$

(7)

In problem-solving using neural networks, objective functions tend to be nonlinear. The nature of artificial neurons is logistic regression, which results in a linear function:

$$P=\frac{1}{1+{e}^{-y}}$$

(8)

$$y={a}_{1}{x}_{1}+\dots +{a}_{n}{x}_{n}$$

(9)

Equation (8) shows the logistic regression function, which is a discrete binary function, while Eq. (9) represents a linear function. When multiple neurons are grouped into multiple layers, the network becomes a super neuron, whose goal is a linear function. In this way, activation functions are implemented for neural networks, which generate non-linearity in the objective function. Neural networks use activation functions that are nonlinear to develop the learning process [39].

2.3 Model evaluation and optimization

The loss function is essential to evaluate the performance of an intelligent model. In this context, key statistical parameters are used to determine the performance and accuracy of the proposed models. The mean square error (MSE) and the cross-entropy are two of the statistical parameters used for this purpose. These parameters allow us to quantify and measure the discrepancy between the model predictions and the actual values, providing a quantitative assessment of the accuracy and quality of the model predictions. The mean square error reads

$$MSE=\frac{{\sum }_{i=1}^{n}{\left({y}_{i}- \lambda \left({x}_{i}\right)\right)}^{2} }{n}$$

(10)

where ${y}_{i}$ is the target value for the test instance ${x}_{i}$, $\lambda \left({x}_{i}\right)$ is the expected target value for the test instance ${x}_{i}$, and $n$ is the number of test instances.

For cross-entropy, $H$ is the cross-entropy function, $P$ is the target distribution, and $Q$ is the approximation of the target distribution. Cross-entropy is calculated using probabilities of events from $P$ and $Q$ as

$$H\left(P,Q\right)=-{\sum }_{x\in X}P\left(x\right)\times log(Q(x))$$

(11)

In order to optimize the final model, taking into account its evaluation, different configurations are defined by varying parameters of the neural network structure, such as the number of cells per layer, the number of stacked hidden layers, the activation function of the model, and the loss function. All these adjustments are made using open-source code in Python v3.9. The parameters for each type of LSTM configuration are presented in Table 4.

Table 4 Parameter definition for the structured LSTM configurations

Full size table

During the structuring and development phases of the predictive model, it is crucial to pay attention to the compilation stage, where all the parameters selected to train the network are carefully considered. In the particular case of the LSTM 3 network, the model is compiled specifying the Adam optimizer and the categorical cross-entropy loss function. This configuration is implemented with the aim of achieving better performance in the classification of the series, taking into account the type of information that is being used.

The model is adjusted by updating the weights for each parameter included in the training set, considering that previous cell states are integrated into each of them. All LSTM structures were trained with 80% of the collected data. In the evaluation, the remaining 20% of the data is used to verify the results in each of the metrics specified at the time of compiling the model. A graphical analysis is carried out in Fig. 4 with the aim of selecting the model with the best performance. The values of the loss function for each model are examined both in the training phase and in the validation phase, considering a total of 300 epochs. Through this analysis, we seek to identify the model that presents a lower loss and, therefore, a better fit to the data and a greater predictive capacity.

After evaluating the performance of the different configurations, notice that during training, LSTM 1 tends to decrease the error faster than the other models, making it the optimal configuration for predicting flow patterns. Figure 4a shows the performance of each of the four models on the training data, with LSTM 1 being selected over the others due to its performance in the validation phase, having the lowest error of all models. On the other hand, LSTM 4 implements a different loss equation from the others, starting its training with a considerable error and taking longer to decrease it. This is due to the characteristics of the loss function, which does not binary classify the expected flow patterns, but treats them as an integer encoding. When compared to the expected output, it creates a discrepancy in the output of the network. Figure 4b shows the performance of the networks on the validation data, detailing the behavior of the networks when faced with unknown data. The error fluctuates compared to that obtained in the training data. Values tend to converge if the loss function is the same. However, when testing with the MSE loss function, it presents an error greater than 2%, demonstrating that it is the model with the lowest performance.

Figure 5a presents the accuracy values obtained for each model during the training. LSTM models 1, 2, and 3 tend to increase their accuracy. Only LSTM model 4, which has a different loss function, presents a linear behavior stagnating at an accuracy of less than 0.1. This behavior is due to the complexity of the prediction, and this complexity is given by the need to predict one flow pattern within ten different types. With OneHotEncoder encoding, better performance is obtained than with normal encoding, due to OneHotEncoder encoding the data in a binary format. More data is needed to increase the accuracy of the network. On the other hand, Table 2 shows how a certain flow pattern represents less than 1% of the total data. This may incur in overfitting of the network, causing the model to remember the supplied data, but be inaccurate with model evaluation data.

The adjustment of the number of epochs tends to be irrelevant if it is prolonged over time, due to its convergent trend. The LSTM 1 model was selected to test the generation of flow maps, because the error obtained in this model is less than 1%, and its accuracy grows exponentially with the epochs, reaching a value close to 0.9 in the training. In this model, the supplied data was parameterized to observe the behavior of the model.

3 Results

In this section, the performance of the LSTM 1 model in predicting flow patterns is evaluated, and the error seen in the previous section is verified. Different flow maps generated by the LSTM 1 network using reference data are presented. After selecting the model with the best performance, in this case, LSTM 1, a flow map generation test was carried out. For this, 300 data sets were provided as input to the network. Figure 6 shows the number of times the model correctly predicted the flow pattern. Yellow marks represent the flow pattern of the provided data, while blue marks indicate the predictions generated by the network. An error of only 0.3% is observed, which agrees with the results presented in Fig. 4a. The loss function used to evaluate the model demonstrates excellent classification performance, indicating that the model accurately fits the actual data. In this case, there was a mismatch in a single prediction, and as the amount of data increases, the error is expected to increase to 0.8% when all data is considered. Although the data provided to generate this graph only includes 8 of the 10 possible flow patterns, it is still a significant sample due to the amount of data used. The model, however, incorrectly predicted a pattern for that specific data.

The model did not generate additional patterns beyond the eight present in the range. For the specific data points in the input vector $x=\left[{j}_{o}, {j}_{w}, j, {\varepsilon }_{o}, {\varepsilon }_{w}, D, {\mu }_{o}\right]$, the model predicted a pattern of D W/O, while the expected pattern was VFD W/O. However, this is within the expected error margins.

The flow map generated with the complete dataset from reference data is presented in Fig. 7, which shows the flow pattern depending on ${J}_{w}$ and ${J}_{o}$. in logarithmic scale. The flow map indicates that the decrease in the magnitudes of water and oil velocities induces the formation of the flow pattern of S W/O and D W/O. Figure 7 also shows that the increase in the magnitude of ${J}_{w}$ and ${J}_{o}$ generates the formation of the other types of flow patterns. Notice that the S O/W flow pattern presents overlapping points, which represent the influence of other variables in the formation of a flow pattern, such as internal diameter and oil viscosity.

Figure 8 shows the flow patterns obtained by implementing LSTM 1 network. The differences between the maps presented in Fig. 7 and Fig. 8 can be seen in the region of the flow map where the transition between flow patterns occurs, generated when the oil velocity is high and the water velocity is low. The magnitude ranges of the superficial velocities of the substances are 0.01 to 1 m/s for ${J}_{w}$ and 0.1 to 1 m/s for ${J}_{o}$.

In this study, two flow maps were developed considering the information provided by each corresponding author. Figure 9 presents the flow map structured with the information presented by [24], in which five flow patterns (CoreFlow, D O/W, D W/O, TF, VFD O/W) and a total of 125 data points are identified. Additionally, the separation zone between the D W/O and D O/W flow patterns is presented. Considering the magnitudes of the velocities developed by the substances, 60% of the data points are in the D O/W flow pattern. On the other hand, the CoreFlow pattern is presented in the zone of high superficial velocities for water and low velocities for oil.

The implementation process of the LSTM 1 network resulted in the flow map presented in Fig. 10. The performance of the network is low in the fluid transition zone, showing multiple prediction errors and generating patterns that are not defined in that zone. In this case, there are nine correct predictions out of 18, which is due to the behavior of the fluid in the transition area. The transition involves the change of the dominant fluid, and different authors have limited information about this zone due to its chaotic behavior, which cannot be fully controlled. However, it is worth noting that the LSTM 1 network accurately predicts the flow patterns developed in the other zones of the flow map.

Additionally, Fig. 11 presents the flow map developed using the information from [33], which shows a greater number of points in the transition zone, and also presents fewer flow patterns (D O/W, D W/O, S O/W, TF). Figure 12 shows the results obtained by processing the data with the LSTM 1 network, in which 31 correct flow patterns out of the 38 expected are identified, showing better performance compared to the flow chart generated in Fig. 10. The model generates VFD O/W, which corresponds to an intermediate pattern delimited by TF and D W/O, demonstrating correct behavior of the neural network. Moreover, it shows that only two patterns are generated outside the range delimited by the reference data, which are the VFD O/W and VFD W/O patterns. Furthermore, in the velocity range of 0.01 to 0.1 m/s for ${J}_{w}$ and 0.09 to 0.1 m/s for ${J}_{o}$, the flow pattern S O/W is presented, for which the predictions developed by the implemented network show high precision when the injection velocities of water and oil are low, exhibiting only one error out of 52 predictions in this area. From this analysis, it is concluded that an increase in the oil injection velocity results in a displacement of the zone towards the vicinity of the transition zone. As a result, a greater imprecision is observed in the flow patterns obtained when using the LSTM 1 model.

4 Conclusions

In this study, various LSTM models were built for the prediction of data series containing liquid–liquid biphasic fluid characteristics of oil and water. These characteristics include oil and water injection rates, oil and water volume fractions, oil viscosity, and internal diameter of the pipe through which they flow. The objective was to determine the flow pattern present in a determined value of each characteristic. Four different configurations for the LSTM network were proposed, and once compiled and trained, the error was evaluated against the expected predictions. The results showed that LSTM 1 had the smallest error, with 0.98%, followed by LSTM 2 with 1.2%, LSTM 3 with 1.4%, and LSTM 4 with 3.5%. The LSTM 1 setup consisted of two hidden layers with 80 memory cells in each, using categorical cross-entropy as the loss function. This allowed flow pattern prediction accuracy in a vertical pipe with less than 1% error.

The low performance of the LSTM 1 model in the transition zone is due to the scarcity of data available in these areas. In addition, it was shown that the performance of the model improves significantly when a large amount of data is available, increasing the accuracy from 50 to 83%. This highlights the importance of providing a larger amount of data for network training.

This study has shown that it is possible to develop models based on the prediction of time series using injection velocities, volume fractions, diameter, and viscosity, to determine liquid–liquid biphasic flow patterns of oil and water in a vertical pipeline. LSTM networks are particularly well-suited for this task due to their ability to model sequential data, including time-dependent behavior in flow patterns. By structuring the data into temporal sequences, LSTM enables the discernment of patterns that other techniques might overlook, offering a detailed and insightful analysis of flow behavior. This represents a significant advantage in addressing the dynamic nature of flow patterns in complex systems. In the future, it is planned to investigate the behavior of the model with data from offshore platforms and use physically informed networks to obtain samples, with the aim of improving the quality and representativeness of the data used in network training.

Data availability

The data presented in this study are available on request from the corresponding author.

References

F. Amir, Y. Zhang, in Perspectives on statistical thermodynamics. Introduction to transport phenomena (Cambridge University Press, 2017), pp. 81–93. https://doi.org/10.1017/9781316650394.010
A. Archibong-Eso, N.E. Okeke, Y. Baba, A.M. Aliyu, L. Lao, H. Yeung, “Estimating slug liquid holdup in high viscosity oil-gas two-phase flow”, Flow Meas. Instrum. 65, 22–32 (2018). https://doi.org/10.1016/j.flowmeasinst.2018.10.027. (Mar. 2019)
Article Google Scholar
A.C. Bannwart, O.M.H. Rodriguez, C.H.M. de Carvalho, I.S. Wang, R.M.O. Vara, Flow patterns in heavy crude oil-water flow. J. Energy Resour. Technol. 126(3), 184–189 (2004). https://doi.org/10.1115/1.1789520
Article CAS Google Scholar
C.M. Ruiz-Diaz, M.M. Hernández-Cely, O.A. González-Estrada, Analysis of liquid-liquid (water and oil) two-phase flow in vertical pipes, applying artificial intelligence techniques. J. Phys. Conf. Ser. 2046(1), 012016 (2021). https://doi.org/10.1088/1742-6596/2046/1/012016
Article Google Scholar
G. Mask, X. Wu, K. Ling, An improved model for gas-liquid flow pattern prediction based on machine learning. J. Pet. Sci. Eng. 183, 106370 (2019). https://doi.org/10.1016/j.petrol.2019.106370
Article CAS Google Scholar
E.M. Fayyad, A.M. Abdullah, M.K. Hassan, A.M. Mohamed, G. Jarjoura, Z. Farhat, Recent advances in electroless-plated Ni-P and its composites for erosion and corrosion applications: a review. Emergent Mater. 1(1–2), 3–24 (2018). https://doi.org/10.1007/s42247-018-0010-4
Article CAS Google Scholar
M. Parsi, K. Najmi, F. Najafifard, S. Hassani, B.S. McLaury, S.A. Shirazi, A comprehensive review of solid particle erosion modeling for oil and gas wells and pipelines applications. J. Nat. Gas Sci. Eng. 21, 850–873 (2014). https://doi.org/10.1016/j.jngse.2014.10.001
Article Google Scholar
A.R. Hasan, C.S. Kabir, A simplified model for oil/water flow in vertical and deviated wellbores. SPE Prod. Facil. 14(1), 56–62 (1999). https://doi.org/10.2118/54131-PA
Article CAS Google Scholar
M. Hotvedt, B.A. Grimstad, L.S. Imsland, Passive learning to address nonstationarity in virtual flow metering applications. Expert Syst Appl 210, 118382 (2022). https://doi.org/10.1016/j.eswa.2022.118382
Article Google Scholar
Y.A. Çengel, J.M. Cimbala, Fluid mechanics: Fundamentals and applications, 3rd edn. (McGraw Hill, New York, NY, 2014)
Google Scholar
Q.B. Yong Bai, Subsea Engineering Handbook, 1st edn. (Elsevier, Houston, 2010). https://doi.org/10.1016/B978-1-85617-689-7.10028-7
Book Google Scholar
X. Chen, L. Guo, Flow patterns and pressure drop in oil–air–water three-phase flow through helically coiled tubes. Int. J. Multiph. Flow 25(6–7), 1053–1072 (1999). https://doi.org/10.1016/S0301-9322(99)00065-8
Article CAS Google Scholar
A. Faghri, Y. Zhang, Fundamentals of Multiphase Heat Transfer and Flow (Springer International Publishing, Cham, 2020)
Book Google Scholar
A. Fallis, Practical Hydroinformatics: Computational Intelligence and Technological Developments in Water Applications 53, 9 (2013)
Google Scholar
C.M. Ruiz-Diaz, M.M. Hernández-Cely, O.A. González-Estrada, Modelo predictivo para la identificación de la fracción volumétrica en flujo bifásico. Cienc en Desarro 12(2), 49–55 (2021). https://doi.org/10.19053/01217488.v12.n2.2021.13417
Article Google Scholar
G. M. P. Andrade et al. Virtual flow metering of production flow rates of individual wells in oil and gas platforms through data reconciliation,” J. Pet. Sci. Eng 208. 2022, https://doi.org/10.1016/j.petrol.2021.109772.
C.C. Aggarwal, Neural Networks and Deep Learning, vol. 1 (Springer International Publishing, Cham, 2018)
Book Google Scholar
J. Sun, X. Ma, and M. Kazi, “Comparison of decline curve analysis DCA with recursive neural networks RNN for production forecast of multiple wells,” SPE West. Reg. Meet. Proc 2018. https://doi.org/10.2118/190104-ms.
S. Song et al., An intelligent data-driven model for virtual flow meters in oil and gas development. Chem. Eng. Res. Des. 186, 398–406 (2022). https://doi.org/10.1016/j.cherd.2022.08.016
Article CAS Google Scholar
N. Andrianov, A machine learning approach for virtual flow metering and forecasting. IFAC-PapersOnLine 51(8), 191–196 (2018). https://doi.org/10.1016/j.ifacol.2018.06.376
Article Google Scholar
C. M. Ruiz-Diaz, “Caracterización hidrodinámica de flujos multifase empleando técnicas de inteligencia artificial,” Master Thesis, Universidad Industrial de Santander, Bucaramanga, 2021.
P. Abduvayt, R. Manabe, T. Watanabe, and N. Arihara, “Analysis of oil-water flow tests in horizontal, hilly-terrain, and vertical pipes,” in Proceedings of SPE Annual Technical Conference and Exhibition, 2004. 1335–1347. https://doi.org/10.2523/90096-MS.
M. Du, N.-D. Jin, Z.-K. Gao, Z.-Y. Wang, L.-S. Zhai, Flow pattern and water holdup measurements of vertical upward oil–water two-phase flow in small diameter pipes. Int. J. Multiph. Flow 41, 91–105 (2012). https://doi.org/10.1016/j.ijmultiphaseflow.2012.01.007
Article CAS Google Scholar
J. Flores, X. Chen, J. Brill, Characterization of oil-water flow patterns in vertical and deviated wells. Proceedings of SPE Annual Technical Conference and Exhibition 1, 55–64 (1997). https://doi.org/10.2523/38810-MS
Article Google Scholar
T. Ganat, S. Ridha, M. Hairir, J. Arisa, R. Gholami, Experimental investigation of high-viscosity oil–water flow in vertical pipes: flow patterns and pressure gradient. J. Pet. Explor. Prod. Technol. 9(4), 2911–2918 (2019). https://doi.org/10.1007/s13202-019-0677-y
Article CAS Google Scholar
G.W. Govier, G.A. Sullivan, R.K. Wood, The upward vertical flow of oil-water mixtures. Can. J. Chem. Eng. 39(2), 67–75 (1961). https://doi.org/10.1002/cjce.5450390204
Article CAS Google Scholar
Y.F. Han, N.D. Jin, L.S. Zhai, H.X. Zhang, Y.Y. Ren, Flow pattern and holdup phenomena of low velocity oil-water flows in a vertical upward small diameter pipe. J. Pet. Sci. Eng. 159(May), 387–408 (2017). https://doi.org/10.1016/j.petrol.2017.09.052
Article CAS Google Scholar
A. R. Hasan and C. S. Kabir, “New model for two-phase oil/water flow: production log interpretation and tubular calculations,” Soc. Pet. Eng. AIME, SPE, PI, 369–382 18216 1988, https://doi.org/10.2118/18216-pa.
A.K. Jana, G. Das, P.K. Das, Flow regime identification of two-phase liquid–liquid upflow through vertical pipe. Chem. Eng. Sci. 61(5), 1500–1515 (2006). https://doi.org/10.1016/j.ces.2005.09.001
Article CAS Google Scholar
A.K. Jana, P. Ghoshal, G. Das, P.K. Das, An analysis of pressure drop and holdup for liquid-liquid upflow through vertical pipes. Chem. Eng. Technol. 30(7), 920–925 (2007). https://doi.org/10.1002/ceat.200700033
Article CAS Google Scholar
J. Guo et al., Heavy oil-water flow patterns in a small diameter vertical pipe under high temperature/pressure conditions. J. Pet. Sci. Eng. 171, 1350–1365 (2018). https://doi.org/10.1016/j.petrol.2018.08.021
Article CAS Google Scholar
R.A. Mazza, F.K. Suguimoto, Experimental investigations of kerosene-water two-phase flow in vertical pipe. J. Pet. Sci. Eng. 184, 106580 (2020). https://doi.org/10.1016/j.petrol.2019.106580
Article CAS Google Scholar
K. Mydlarz-Gabryk, M. Pietrzak, L. Troniewski, Study on oil-water two-phase upflow in vertical pipes. J. Pet. Sci. Eng. 117, 28–36 (2014). https://doi.org/10.1016/j.petrol.2014.03.007
Article CAS Google Scholar
O.M.H. Rodriguez, A.C. Bannwart, Experimental study on interfacial waves in vertical core flow. J. Pet. Sci. Eng. 54(3–4), 140–148 (2006). https://doi.org/10.1016/j.petrol.2006.07.007
Article CAS Google Scholar
J. Xu, D. Li, J. Guo, Y. Wu, Investigations of phase inversion and frictional pressure gradients in upward and downward oil–water flow in vertical pipes. Int. J. Multiph. Flow 36(11–12), 930–939 (2010). https://doi.org/10.1016/j.ijmultiphaseflow.2010.08.007
Article CAS Google Scholar
Y. Yang et al., Oil-water flow patterns, holdups and frictional pressure gradients in a vertical pipe under high temperature/pressure conditions. Exp. Therm. Fluid Sci. 100, 271–291 (2018). https://doi.org/10.1016/j.expthermflusci.2018.09.013. (Jan. 2019)
Article ADS Google Scholar
D. Zhao, L. Guo, X. Hu, X. Zhang, X. Wang, Experimental study on local characteristics of oil-water dispersed flow in a vertical pipe. Int. J. Multiph. Flow 32(10–11), 1254–1268 (2006). https://doi.org/10.1016/j.ijmultiphaseflow.2006.06.004
Article CAS Google Scholar
J. Brownlee, Long short-term memory networks with python develop sequence prediction models with deep learning (Machine Learning Mastery, Vermont, 2017)
Google Scholar
K.K. Chandriah, R.V. Naraganahalli, RNN/LSTM with modified Adam optimizer in deep learning approach for automobile spare parts demand forecasting. Multimed. Tools Appl. 80(17), 26145–26159 (2021). https://doi.org/10.1007/s11042-021-10913-0
Article Google Scholar
C. Ruiz-Díaz, M.M. Hernández-Cely, O.A. González-Estrada, Modelo predictivo para el cálculo de la fracción volumétrica de un flujo bifásico agua- aceite en la horizontal utilizando una red neuronal artificial. Rev UIS Ing 21(2), 155–164 (2022). https://doi.org/10.18273/revuin.v21n2-2022013
Article Google Scholar

Download references

Funding

Open Access funding provided by Colombia Consortium This work was supported by Universidad Industrial de Santander (Grant numbers VIE 3714).

Author information

Authors and Affiliations

Industrial Multiphase Flow Laboratory (LEMI), São Carlos School of Engineering, University of São Paulo, São Carlos, 13566-590, Brazil
Carlos Mauricio Ruiz-Díaz
Research Group in Energy and Environment (GIEMA), Universidad Industrial de Santander, Cr 27 C 9, Bucaramanga, 680002, Colombia
Brayan Quispe-Suarez & Octavio Andrés González-Estrada

Authors

Carlos Mauricio Ruiz-Díaz
View author publications
You can also search for this author in PubMed Google Scholar
Brayan Quispe-Suarez
View author publications
You can also search for this author in PubMed Google Scholar
Octavio Andrés González-Estrada
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection, and analysis were performed by Carlos Mauricio Ruiz-Diaz and Brayan Quispe-Suarez. The first draft of the manuscript was written by Carlos Mauricio Ruiz-Diaz and Octavio Andrés González-Estrada, and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Octavio Andrés González-Estrada.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ruiz-Díaz, C.M., Quispe-Suarez, B. & González-Estrada, O.A. Two-phase oil and water flow pattern identification in vertical pipes applying long short-term memory networks. emergent mater. (2024). https://doi.org/10.1007/s42247-024-00631-2

Download citation

Received: 09 August 2023
Accepted: 13 January 2024
Published: 12 February 2024
DOI: https://doi.org/10.1007/s42247-024-00631-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Two-phase oil and water flow pattern identification in vertical pipes applying long short-term memory networks

Abstract

Similar content being viewed by others

Dynamic prediction of moving trajectory in pipe jacking: GRU-based deep learning framework

Utilization of machine learning for the estimation of production rates in wells operated by electrical submersible pumps

Prediction of the Incrustating Trend in Oil Extraction Pipelines: An Approach Based on Neural Decision Trees

1 Introduction