Data-Augmented Deep Learning for Downhole Depth Sensing and Field Validation
Siyu Xiao, Xindi Zhao, Tianhao Mao, Yiwei Wang, Yuqiao Chen, Hongyun Zhang, Jian Wang, Junjie Wang, Shuang Liu, Tupei Chen, Yang Liu
TL;DR
This work tackles the challenge of accurate downhole depth calibration via casing collar locator signals in data-scarce environments. It presents the Signal Collection Vehicle (SCV) for downhole data acquisition and a comprehensive data-augmentation framework (normalization, LDS, LSR, geometric transformations, and multiple sampling) to train collar-recognition models. Two 1D CNN architectures, TAN and MAN, are evaluated as benchmarks for boundary-moment detection using probability maps rather than sparse one-hot labels. Systematic experiments reveal that standardization, LDS, and random cropping are fundamental, while LSR, time scaling, and multiple sampling significantly boost generalization, with real-well validation confirming accurate collar localization. The findings offer a practical pathway to improved downhole depth measurement in CCL-limited settings and highlight the value of tailored preprocessing in deep learning for geological signals.
Abstract
Accurate downhole depth measurement is essential for oil and gas well operations, directly influencing reservoir contact, production efficiency, and operational safety. Collar correlation using a casing collar locator (CCL) is fundamental for precise depth calibration. While neural network-based CCL signal recognition has achieved significant progress in collar identification, preprocessing methods for such applications remain underdeveloped. Moreover, the limited availability of real well data poses substantial challenges for training neural network models that require extensive datasets. This paper presents a system integrated into downhole tools for CCL signal acquisition to facilitate dataset construction. We propose comprehensive preprocessing methods for data augmentation and evaluate their effectiveness using our neural network models. Through systematic experimentation across various configuration combinations, we analyze the contribution of each augmentation method. Results demonstrate that standardization, label distribution smoothing (LDS), and random cropping are fundamental requirements for model training, while label smoothing regularization (LSR), time scaling, and multiple sampling significantly enhance model generalization capability. The F1 scores of our two benchmark models trained with the proposed augmentation methods maximumly improve from 0.937 and 0.952 to 1.0 and 1.0, respectively. Performance validation on real CCL waveforms confirms the effectiveness and practical applicability of our approach. This work addresses the gaps in data augmentation methodologies for training casing collar recognition models in CCL data-limited environments.
