Causal Invariance and Counterfactual Learning Driven Cooperative Game for Multi-Label Classification
Yijia Fan, Jusheng Zhang, Kaitong Cai, Jing Yang, Keze Wang
TL;DR
The paper tackles the difficulty of rare-label prediction and robustness to distribution shifts in multi-label classification by introducing the Causal Cooperative Game (CCG) framework. CCG models MLC as a multi-player cooperative game in which players learn true label dependencies via Neural Structural Equation Models, guided by a counterfactual curiosity reward and a causal invariance loss, with targeted rare-label enhancement. Extensive experiments on four text benchmarks show consistent improvements in Rare-F1 and robustness to distribution shifts, supported by ablation studies and qualitative analyses of learned causal graphs that demonstrate interpretability. The work highlights the value of integrating causal inference with cooperative game theory to build more robust, generalizable, and explainable MLC systems.
Abstract
Multi-label classification (MLC) remains vulnerable to label imbalance, spurious correlations, and distribution shifts, challenges that are particularly detrimental to rare label prediction. To address these limitations, we introduce the Causal Cooperative Game (CCG) framework, which conceptualizes MLC as a cooperative multi-player interaction. CCG unifies explicit causal discovery via Neural Structural Equation Models with a counterfactual curiosity reward to drive robust feature learning. Furthermore, it incorporates a causal invariance loss to ensure generalization across diverse environments, complemented by a specialized enhancement strategy for rare labels. Extensive benchmarking demonstrates that CCG substantially outperforms strong baselines in both rare label prediction and overall robustness. Through rigorous ablation studies and qualitative analysis, we validate the efficacy and interpretability of our components, underscoring the potential of synergizing causal inference with cooperative game theory for advancing multi-label learning.
