TractOracle: towards an anatomically-informed reward function for RL-based tractography
Antoine Théberge, Maxime Descoteaux, Pierre-Marc Jodoin
TL;DR
Tractography remains ill-posed due to reliance on local diffusion cues, which can yield spurious pathways. The authors introduce TractOracle, an RL-based framework that couples streamline tracking with an anatomically informed reward derived from a transformer-based TractOracle-Net that scores streamlines for plausibility. The approach yields improved true positive rates and reduced false positives across synthetic and in-vivo data, achieving state-of-the-art tractogram quality in both in-silico and in-vivo evaluations. By enabling simultaneous tracking and filtering with an anatomical prior, this method has the potential to enhance clinical applicability of tractography by reducing spurious connections while preserving genuine white-matter pathways.
Abstract
Reinforcement learning (RL)-based tractography is a competitive alternative to machine learning and classical tractography algorithms due to its high anatomical accuracy obtained without the need for any annotated data. However, the reward functions so far used to train RL agents do not encapsulate anatomical knowledge which causes agents to generate spurious false positives tracts. In this paper, we propose a new RL tractography system, TractOracle, which relies on a reward network trained for streamline classification. This network is used both as a reward function during training as well as a mean for stopping the tracking process early and thus reduce the number of false positive streamlines. This makes our system a unique method that evaluates and reconstructs WM streamlines at the same time. We report an improvement of true positive ratios by almost 20\% and a reduction of 3x of false positive ratios on one dataset and an increase between 2x and 7x in the number true positive streamlines on another dataset.
