NAISR: A 3D Neural Additive Model for Interpretable Shape Representation
Yining Jiao, Carlton Zdanski, Julia Kimbell, Andrew Prince, Cameron Worden, Samuel Kirse, Christopher Rutter, Benjamin Shields, William Dunn, Jisan Mahmud, Marc Niethammer
TL;DR
NAISR tackles the lack of covariate-aware, interpretable 3D shape representations by introducing a Neural Additive Interpretable Shape Representation that deforms a learned atlas via covariate-driven displacement fields. It fuses deep implicit shape representations with an additive, disentangled deformation model to enable shape reconstruction, evolution, disentanglement, and transfer, while preserving interpretability. Evaluations on Starman, ADNI hippocampus, and pediatric airway demonstrate competitive reconstruction and superior real-data transfer and covariate analysis, outperforming baselines in key real-world scenarios. This framework supports scientific discovery and personalized predictions by making geometric changes attributable to individual covariates and enabling covariate-guided shape synthesis across longitudinal data.
Abstract
Deep implicit functions (DIFs) have emerged as a powerful paradigm for many computer vision tasks such as 3D shape reconstruction, generation, registration, completion, editing, and understanding. However, given a set of 3D shapes with associated covariates there is at present no shape representation method which allows to precisely represent the shapes while capturing the individual dependencies on each covariate. Such a method would be of high utility to researchers to discover knowledge hidden in a population of shapes. For scientific shape discovery, we propose a 3D Neural Additive Model for Interpretable Shape Representation ($\texttt{NAISR}$) which describes individual shapes by deforming a shape atlas in accordance to the effect of disentangled covariates. Our approach captures shape population trends and allows for patient-specific predictions through shape transfer. $\texttt{NAISR}$ is the first approach to combine the benefits of deep implicit shape representations with an atlas deforming according to specified covariates. We evaluate $\texttt{NAISR}$ with respect to shape reconstruction, shape disentanglement, shape evolution, and shape transfer on three datasets: 1) $\textit{Starman}$, a simulated 2D shape dataset; 2) the ADNI hippocampus 3D shape dataset; and 3) a pediatric airway 3D shape dataset. Our experiments demonstrate that $\textit{Starman}$ achieves excellent shape reconstruction performance while retaining interpretability. Our code is available at $\href{https://github.com/uncbiag/NAISR}{https://github.com/uncbiag/NAISR}$.
