Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity
Guangzhi Xiong, Sanchit Sinha, Aidong Zhang
TL;DR
Neural Additive Experts (NAEs) introduce a per-feature mixture-of-experts architecture with context-gated routing to interpolate between strictly additive GAMs and interacting models. NAEs maintain feature-level interpretability through additive aggregation while enabling flexible, context-dependent feature effects via multiple experts and a softmax gating mechanism, regulated by an expert-variation penalty. Theoretical results show GAM containment, GA2M approximation, and monotone control of additivity as the penalty grows, while experiments demonstrate competitive accuracy with transparent explanations on synthetic and real-world datasets. The framework provides a principled, tunable trade-off between predictive performance and interpretability, with scalable per-feature explanations and explicit bounds on feature contributions.
Abstract
The trade-off between interpretability and accuracy remains a core challenge in machine learning. Standard Generalized Additive Models (GAMs) offer clear feature attributions but are often constrained by their strictly additive nature, which can limit predictive performance. Introducing feature interactions can boost accuracy yet may obscure individual feature contributions. To address these issues, we propose Neural Additive Experts (NAEs), a novel framework that seamlessly balances interpretability and accuracy. NAEs employ a mixture of experts framework, learning multiple specialized networks per feature, while a dynamic gating mechanism integrates information across features, thereby relaxing rigid additive constraints. Furthermore, we propose targeted regularization techniques to mitigate variance among expert predictions, facilitating a smooth transition from an exclusively additive model to one that captures intricate feature interactions while maintaining clarity in feature attributions. Our theoretical analysis and experiments on synthetic data illustrate the model's flexibility, and extensive evaluations on real-world datasets confirm that NAEs achieve an optimal balance between predictive accuracy and transparent, feature-level explanations. The code is available at https://github.com/Teddy-XiongGZ/NAE.
