NUDF: Neural Unsigned Distance Fields for high resolution 3D medical image segmentation
Kristine Sørensen, Oscar Camara, Ole de Backer, Klaus Kofoed, Rasmus Paulsen
TL;DR
This work introduces Neural Unsigned Distance Fields (NUDF) to perform high-resolution 3D medical image segmentation by learning a continuous distance function directly from CT images, enabling open-surface mesh reconstructions. The method uses a CNN encoder to extract image features and a pointwise FCNN decoder to predict the unsigned distance at queried points, guided by a Shape Diameter–based sampling strategy. NUDF demonstrates superior surface fidelity over a traditional 3D U-net on Left Atrial Appendage segmentation, achieving mesh-quality metrics close to voxel spacing while handling complex, open geometries with lower memory demands. The approach offers a practical path to high-resolution, visualization- and simulation-ready meshes in clinical imaging, with potential applicability to other anatomies requiring detailed surface models.
Abstract
Medical image segmentation is often considered as the task of labelling each pixel or voxel as being inside or outside a given anatomy. Processing the images at their original size and resolution often result in insuperable memory requirements, but downsampling the images leads to a loss of important details. Instead of aiming to represent a smooth and continuous surface in a binary voxel-grid, we propose to learn a Neural Unsigned Distance Field (NUDF) directly from the image. The small memory requirements of NUDF allow for high resolution processing, while the continuous nature of the distance field allows us to create high resolution 3D mesh models of shapes of any topology (i.e. open surfaces). We evaluate our method on the task of left atrial appendage (LAA) segmentation from Computed Tomography (CT) images. The LAA is a complex and highly variable shape, being thus difficult to represent with traditional segmentation methods using discrete labelmaps. With our proposed method, we are able to predict 3D mesh models that capture the details of the LAA and achieve accuracy in the order of the voxel spacing in the CT images.
