High-Precision Geosteering via Reinforcement Learning and Particle Filters
Ressi Bonti Muhammad, Apoorv Srivastava, Sergey Alyaev, Reidar Brumer Bratvold, Daniel M. Tartakovsky
TL;DR
This paper tackles the challenge of real-time, robust geosteering by integrating reinforcement learning (RL) with particle filtering (PF) to handle uncertainty in subsurface boundaries. It proposes three decision-making pathways—RL alone, PF alone, and a synergistic RL+PF approach (RL-Estimation)—plus a rule-based PF-informed method for comparison. Through a realistic geosteering scenario with gamma-ray logs and multiple layers and faults, RL-Estimation achieves the best performance, yielding reservoir contact near 89% and high stability, albeit with substantial computational cost due to PF. Benchmarking against theoretical optima shows PF-based RL approaches can closely match the best possible outcomes when state estimates are accurate, while the look-ahead information offers the greatest gains. Overall, the study demonstrates a meaningful synergy between RL and PF for high-precision geosteering and highlights directions for reducing computational burden while preserving accuracy.
Abstract
Geosteering, a key component of drilling operations, traditionally involves manual interpretation of various data sources such as well-log data. This introduces subjective biases and inconsistent procedures. Academic attempts to solve geosteering decision optimization with greedy optimization and Approximate Dynamic Programming (ADP) showed promise but lacked adaptivity to realistic diverse scenarios. Reinforcement learning (RL) offers a solution to these challenges, facilitating optimal decision-making through reward-based iterative learning. State estimation methods, e.g., particle filter (PF), provide a complementary strategy for geosteering decision-making based on online information. We integrate an RL-based geosteering with PF to address realistic geosteering scenarios. Our framework deploys PF to process real-time well-log data to estimate the location of the well relative to the stratigraphic layers, which then informs the RL-based decision-making process. We compare our method's performance with that of using solely either RL or PF. Our findings indicate a synergy between RL and PF in yielding optimized geosteering decisions.
