X2BR: High-Fidelity 3D Bone Reconstruction from a Planar X-Ray Image with Hybrid Neural Implicit Methods
Gokce Guven, H. Fatih Ugurdag, Hasan F. Ates
TL;DR
This work tackles single-view 3D bone reconstruction from planar X-rays by introducing X2B, a ConvNeXt-based occupancy network that reconstructs high-fidelity skeletal volumes without priors, and X2BR, a template-guided refinement that enforces anatomical plausibility via a biomechanical template and GBCPD++ non-rigid registration. X2B achieves state-of-the-art numerical accuracy (IoU around 0.95; Chamfer-L1 around 0.005), while X2BR improves anatomical realism, yielding better rib curvature and vertebral alignment through template alignment despite slightly lower IoU. The approach leverages a large real-patient dataset of 3D bone meshes paired with digitally reconstructed radiographs, uses DRR inputs for training, and integrates a full pipeline from segmentation (TotalSegmentator) to occupancy calculation, MISE-based mesh extraction, and biomechanically informed registration. Overall, X2B/X2BR demonstrate a robust hybrid framework that balances quantitative accuracy with clinical interpretability, enabling applications in surgical planning and patient-specific biomechanical simulations.
Abstract
Accurate 3D bone reconstruction from a single planar X-ray remains a challenge due to anatomical complexity and limited input data. We propose X2BR, a hybrid neural implicit framework that combines continuous volumetric reconstruction with template-guided non-rigid registration. The core network, X2B, employs a ConvNeXt-based encoder to extract spatial features from X-rays and predict high-fidelity 3D bone occupancy fields without relying on statistical shape models. To further refine anatomical accuracy, X2BR integrates a patient-specific template mesh, constructed using YOLOv9-based detection and the SKEL biomechanical skeleton model. The coarse reconstruction is aligned to the template using geodesic-based coherent point drift, enabling anatomically consistent 3D bone volumes. Experimental results on a clinical dataset show that X2B achieves the highest numerical accuracy, with an IoU of 0.952 and Chamfer-L1 distance of 0.005, outperforming recent baselines including X2V and D2IM-Net. Building on this, X2BR incorporates anatomical priors via YOLOv9-based bone detection and biomechanical template alignment, leading to reconstructions that, while slightly lower in IoU (0.875), offer superior anatomical realism, especially in rib curvature and vertebral alignment. This numerical accuracy vs. visual consistency trade-off between X2B and X2BR highlights the value of hybrid frameworks for clinically relevant 3D reconstructions.
