LOOC: Localizing Organs using Occupancy Networks and Body Surface Depth Images
Pit Henrich, Franziska Mathis-Ullrich
TL;DR
This work tackles the challenge of non-invasively localizing 67 anatomical structures from a single depth image. It introduces a multi-class occupancy-network model conditioned on the sensor point cloud, augmented by a revised SortSample to handle densely packed internal structures, and it generates patient-specific 3D anatomical atlases. Trained on augmented CT-derived masks from the Atlas Dataset, the method outperforms a template-matching baseline on 50 held-out masks and yields qualitative real-world reconstructions for 12 clothed individuals, with an average inference time of about 3.2 seconds on a high-end GPU. The approach promises practical impact for automated medical imaging and diagnostic workflows by enabling accurate, non-invasive localization of critical structures from simple depth sensing, while acknowledging limitations related to pose variation, clothing, and complex anatomy.
Abstract
We introduce a novel approach for the precise localization of 67 anatomical structures from single depth images captured from the exterior of the human body. Our method uses a multi-class occupancy network, trained using segmented CT scans augmented with body-pose changes, and incorporates a specialized sampling strategy to handle densely packed internal organs. Our contributions include the application of occupancy networks for occluded structure localization, a robust method for estimating anatomical positions from depth images, and the creation of detailed, individualized 3D anatomical atlases. We outperform localization using template matching and provide qualitative real-world reconstructions. This method promises improvements in automated medical imaging and diagnostic procedures by offering accurate, non-invasive localization of critical anatomical structures.
