Rapid patient-specific neural networks for intraoperative X-ray to volume registration
Vivek Gopalakrishnan, Neel Dey, David-Dimitris Chlorogiannis, Andrew Abumoussa, Anna M. Larson, Darren B. Orbach, Sarah Frisken, Polina Golland
TL;DR
This work tackles the challenge of robust, fast 2D/3D registration between intraoperative X-ray images and preoperative volumes, a key bottleneck in image-guided interventions. The authors propose xvr, a self-supervised framework that trains patient-specific pose regression networks using synthetic X-rays generated from a patient’s own preoperative imaging via a differentiable X-ray renderer, followed by rapid gradient-based pose refinement. A major contribution is the amortized training strategy: pretrain a patient-agnostic model on diverse datasets and then finetune per patient in about five minutes, enabling practical use in emergencies and routine procedures. Extensive evaluation across pelvic, neurovascular, and skull cases from multiple hospitals demonstrates submillimeter accuracy and robust performance, with open-source release to facilitate broad adoption and further development.
Abstract
The integration of artificial intelligence in image-guided interventions holds transformative potential, promising to extract 3D geometric and quantitative information from conventional 2D imaging modalities during complex procedures. Achieving this requires the rapid and precise alignment of 2D intraoperative images (e.g., X-ray) with 3D preoperative volumes (e.g., CT, MRI). However, current 2D/3D registration methods fail across the broad spectrum of procedures dependent on X-ray guidance: traditional optimization techniques require custom parameter tuning for each subject, whereas neural networks trained on small datasets do not generalize to new patients or require labor-intensive manual annotations, increasing clinical burden and precluding application to new anatomical targets. To address these challenges, we present xvr, a fully automated framework for training patient-specific neural networks for 2D/3D registration. xvr uses physics-based simulation to generate abundant high-quality training data from a patient's own preoperative volumetric imaging, thereby overcoming the inherently limited ability of supervised models to generalize to new patients and procedures. Furthermore, xvr requires only 5 minutes of training per patient, making it suitable for emergency interventions as well as planned procedures. We perform the largest evaluation of a 2D/3D registration algorithm on real X-ray data to date and find that xvr robustly generalizes across a diverse dataset comprising multiple anatomical structures, imaging modalities, and hospitals. Across surgical tasks, xvr achieves submillimeter-accurate registration at intraoperative speeds, improving upon existing methods by an order of magnitude. xvr is released as open-source software freely available at https://github.com/eigenvivek/xvr.
