InfoCons: Identifying Interpretable Critical Concepts in Point Clouds via Information Theory
Feifei Li, Mi Zhang, Zhaoxiang Wang, Min Yang
TL;DR
The paper tackles the challenge of interpreting point-cloud models by identifying interpretable, causally influential 3D concepts. It introduces InfoCons, an information-bottleneck–based framework that learns a soft, learnable attention bottleneck to decompose a point cloud into concepts and derive a faithful critical subset by maximizing $I(\mathcal{C},Y)$ while constraining $I(X,\mathcal{C})$. A Gaussian prior and noise are incorporated to decouple neighbor information and promote semantic coherence, producing informative score maps $\hat{m}$ with $\hat{z}=\hat{m}\odot z(x)$. Empirical results across ModelNet40, ScanObjectNN, and KITTI show InfoCons yields more interpretable, less redundant explanations compared to baselines, and its applications to data augmentation and adversarial attacks demonstrate practical benefits in safety-critical settings. The framework is scalable across diverse PC models and tasks, though it incurs computational overhead from the attention bottleneck and requires careful hyperparameter tuning.
Abstract
Interpretability of point cloud (PC) models becomes imperative given their deployment in safety-critical scenarios such as autonomous vehicles. We focus on attributing PC model outputs to interpretable critical concepts, defined as meaningful subsets of the input point cloud. To enable human-understandable diagnostics of model failures, an ideal critical subset should be *faithful* (preserving points that causally influence predictions) and *conceptually coherent* (forming semantically meaningful structures that align with human perception). We propose InfoCons, an explanation framework that applies information-theoretic principles to decompose the point cloud into 3D concepts, enabling the examination of their causal effect on model predictions with learnable priors. We evaluate InfoCons on synthetic datasets for classification, comparing it qualitatively and quantitatively with four baselines. We further demonstrate its scalability and flexibility on two real-world datasets and in two applications that utilize critical scores of PC.
