Well data preprocessing using machine learning algorithms
https://doi.org/10.51890/2587-7399-2023-8-3-163-166
Abstract
The task of interpolating missing data is important for integrated asset modeling, since the accuracy and reliability of modeling directly depend on the quality of the input data. If the model is missing some data, then this may lead to the impossibility of modeling for this time step. Thus, the problem of interpolation of missing data is important and its solution improves the accuracy and reliability of forecasts.
Aim. The aim of the work is to improve the quality of data for integrated asset modeling (IMA).
Materials and methods. The data from the technological regime of wells, necessary for well modeling within the framework of IMA, are considered as initial data. The table view data contains gaps (missed values), which reduces the amount of data available for modeling.
Results. The paper shows a method for filling gaps with machine learning algorithms on the example of a real field. Missing GOR readings for many dates were predicted by machine learning models.
Conclusions. As a result of the work, an algorithm for interpolating data was proposed to identify one case of infection. This step can increase the amount of data without gaps and significantly improve the quality of the models. In this problem, we were able to increase the amount of data for training models by 98 % in this way, which meets the requirements for the maximum work model by an average of 41 %.
About the Authors
K. A. PechkoRussian Federation
Konstantin A. Pechko — Chief specialist
71, Moika River emb., 191186, Saint Petersburg
A. A. Chuprov
Russian Federation
Artem A. Chuprov — Trainee
75–79, liter D, Moika River emb., 190000, Saint Petersburg
A. A. Afanasiev
Russian Federation
Aleksandr A. Afanasiev — Chief specialist
75–79, liter D, Moika River emb., 190000, Saint Petersburg
M. V. Simonov
Russian Federation
Maksim V. Simonov — Head of the center
75–79, liter D, Moika River emb., 190000, Saint Petersburg
References
1. Pechko K.A., Senkin I.S., Belonogov E.V. Well modeling using machine learning methods for integrated modeling // PRONEFT. Professionally about oil, 2022, no. 7(2), pp. 114–120. https://doi.org/10.51890/2587-7399-2022-7-2-114-120
2. Robert Kosova, Evgjeni Xhafaj, Altin Karriqi. Missing Data In The Oil Industry And Methods Of Imputations Using Spss: The Impact On Reserve Estimation. https://www.researchgate.net/publication/358922022_Missing_Data_In_The_Oil_Industry_And_Methods_Of_Imputations_Using_Spss_The_Impact_On_Reserve_Estimation
3. Quinlan J. R. Induction of decision trees // Machine learning, 1986, vol. 1, pp. 81−106.
4. Douglas H. Johnson. The Insignifi cance of Statistical Signifi cance Testing // Journal of Wildlife Management, Wiley-VCH, 1999, vol. 3, no. 63, pp. 763–772.
Review
For citations:
Pechko K.A., Chuprov A.A., Afanasiev A.A., Simonov M.V. Well data preprocessing using machine learning algorithms. PROneft. Professionally about Oil. 2023;8(3):163-166. (In Russ.) https://doi.org/10.51890/2587-7399-2023-8-3-163-166