Interpretable Machine Learning for Kronecker Coefficients

Giorgi Butbaia; Kyu-Hwan Lee; Fabian Ruehle

Interpretable Machine Learning for Kronecker Coefficients

Giorgi Butbaia, Kyu-Hwan Lee, Fabian Ruehle

TL;DR

The paper tackles the problem of predicting whether Kronecker coefficients $g_{\lambda,\mu}^\nu$ vanish, a task with NP-hardness implications. It develops and compares several interpretable ML approaches—gradient saliency, Kolmogorov--Arnold Networks, small neural networks, and symbolic regression—using both 3n-dimensional partition triples and 1-dimensional $b$-loadings, achieving about 83% accuracy with interpretable models and over 99% with transformer architectures. A key finding is that simple, explicit decision functions can be derived in terms of $b$-loadings, while transformers provide the strongest predictive performance. The work highlights how dataset saliency and interpretable models can yield rapid, mechanistic insights into a deep combinatorial problem and points to future work on mechanistic interpretability and richer feature representations.

Abstract

We analyze the saliency of neural networks and employ interpretable machine learning models to predict whether the Kronecker coefficients of the symmetric group are zero or not. Our models use triples of partitions as input features, as well as b-loadings derived from the principal component of an embedding that captures the differences between partitions. Across all approaches, we achieve an accuracy of approximately 83% and derive explicit formulas for a decision function in terms of b-loadings. Additionally, we develop transformer-based models for prediction, achieving the highest reported accuracy of over 99%.

Interpretable Machine Learning for Kronecker Coefficients

TL;DR

The paper tackles the problem of predicting whether Kronecker coefficients

vanish, a task with NP-hardness implications. It develops and compares several interpretable ML approaches—gradient saliency, Kolmogorov--Arnold Networks, small neural networks, and symbolic regression—using both 3n-dimensional partition triples and 1-dimensional

-loadings, achieving about 83% accuracy with interpretable models and over 99% with transformer architectures. A key finding is that simple, explicit decision functions can be derived in terms of

-loadings, while transformers provide the strongest predictive performance. The work highlights how dataset saliency and interpretable models can yield rapid, mechanistic insights into a deep combinatorial problem and points to future work on mechanistic interpretability and richer feature representations.

Interpretable Machine Learning for Kronecker Coefficients

TL;DR

Abstract

Interpretable Machine Learning for Kronecker Coefficients

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (8)