Table of Contents
Fetching ...

Multiplicative-Additive Constrained Models:Toward Joint Visualization of Interactive and Independent Effects

Fumin Wang

TL;DR

This work tackles interpretability in high stakes settings by addressing GAMs limited to independent effects and CESR's entangled multiplicative terms. It introduces Multiplicative-Additive Constrained Models (MACMs), a framework that combines a multiplicative component with an additive component to decouple coefficients and expand hypothesis space, using visualizable univariate shape functions. Neural network based MACMs (MACMs(NNs)) outperform CESR and state of the art NN based GAMs across regression and classification tasks, while also preserving interpretability through shape function visualizations and dynamic influence curves. Overall, MACMs provide a practical bridge between multiplicative interaction modeling and additive interpretable structure, delivering improved predictive performance without sacrificing transparent decision reasoning in domains such as healthcare.

Abstract

Interpretability is one of the considerations when applying machine learning to high-stakes fields such as healthcare that involve matters of life safety. Generalized Additive Models (GAMs) enhance interpretability by visualizing shape functions. Nevertheless, to preserve interpretability, GAMs omit higher-order interaction effects (beyond pairwise interactions), which imposes significant constraints on their predictive performance. We observe that Curve Ergodic Set Regression (CESR), a multiplicative model, naturally enables the visualization of its shape functions and simultaneously incorporates both interactions among all features and individual feature effects. Nevertheless, CESR fails to demonstrate superior performance compared to GAMs. We introduce Multiplicative-Additive Constrained Models (MACMs), which augment CESR with an additive part to disentangle the intertwined coefficients of its interactive and independent terms, thus effectively broadening the hypothesis space. The model is composed of a multiplicative part and an additive part, whose shape functions can both be naturally visualized, thereby assisting users in interpreting how features participate in the decision-making process. Consequently, MACMs constitute an improvement over both CESR and GAMs. The experimental results indicate that neural network-based MACMs significantly outperform both CESR and the current state-of-the-art GAMs in terms of predictive performance.

Multiplicative-Additive Constrained Models:Toward Joint Visualization of Interactive and Independent Effects

TL;DR

This work tackles interpretability in high stakes settings by addressing GAMs limited to independent effects and CESR's entangled multiplicative terms. It introduces Multiplicative-Additive Constrained Models (MACMs), a framework that combines a multiplicative component with an additive component to decouple coefficients and expand hypothesis space, using visualizable univariate shape functions. Neural network based MACMs (MACMs(NNs)) outperform CESR and state of the art NN based GAMs across regression and classification tasks, while also preserving interpretability through shape function visualizations and dynamic influence curves. Overall, MACMs provide a practical bridge between multiplicative interaction modeling and additive interpretable structure, delivering improved predictive performance without sacrificing transparent decision reasoning in domains such as healthcare.

Abstract

Interpretability is one of the considerations when applying machine learning to high-stakes fields such as healthcare that involve matters of life safety. Generalized Additive Models (GAMs) enhance interpretability by visualizing shape functions. Nevertheless, to preserve interpretability, GAMs omit higher-order interaction effects (beyond pairwise interactions), which imposes significant constraints on their predictive performance. We observe that Curve Ergodic Set Regression (CESR), a multiplicative model, naturally enables the visualization of its shape functions and simultaneously incorporates both interactions among all features and individual feature effects. Nevertheless, CESR fails to demonstrate superior performance compared to GAMs. We introduce Multiplicative-Additive Constrained Models (MACMs), which augment CESR with an additive part to disentangle the intertwined coefficients of its interactive and independent terms, thus effectively broadening the hypothesis space. The model is composed of a multiplicative part and an additive part, whose shape functions can both be naturally visualized, thereby assisting users in interpreting how features participate in the decision-making process. Consequently, MACMs constitute an improvement over both CESR and GAMs. The experimental results indicate that neural network-based MACMs significantly outperform both CESR and the current state-of-the-art GAMs in terms of predictive performance.

Paper Structure

This paper contains 19 sections, 26 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Architecture of MACMs(NNs), consisting of a multiplicative and an additive part. The multiplicative part is constructed as the product of multiple subnetworks, while the additive part is the sum of multiple subnetworks.
  • Figure 2: Multiplicative Part and Additive Part Shape Functions of MACMs(NNs) on CA Hosuing(modified).
  • Figure 3: Multiplicative Part and Additive Part Shape Functions of MACMs(NNs) on CA Hosuing(modified)
  • Figure 4: Multiplicative Part Shape Functions of MACMs(NNs) on Stroke Prediction.
  • Figure 5: Additive Part Shape Functions of MACMs(NNs) on Stroke Prediction.
  • ...and 2 more figures