Refining the Information Bottleneck via Adversarial Information Separation
Shuai Ning, Zhenpeng Wang, Lin Wang, Bing Chen, Shuangrong Liu, Xu Wu, Jin Zhou, Bo Yang
TL;DR
This paper tackles generalization under data scarcity in scientific domains by separating task-relevant information from confounding noise without explicit supervision. It introduces AdverISF, a dual-branch framework with adversarial information separation and a multi-layer architecture that recycles noise to recover subtle, predictive features otherwise lost under uniform compression. The approach uses a self-supervised adversarial mechanism based on joint vs marginal distributions and WGAN-GP training, plus a cascaded design to progressively refine representations. Across synthetic benchmarks and a real-world composite cement design task, AdverISF achieves superior data-scarce performance and better out-of-distribution generalization than strong baselines, highlighting its practical impact for material science and related domains with limited data.
Abstract
Generalizing from limited data is particularly critical for models in domains such as material science, where task-relevant features in experimental datasets are often heavily confounded by measurement noise and experimental artifacts. Standard regularization techniques fail to precisely separate meaningful features from noise, while existing adversarial adaptation methods are limited by their reliance on explicit separation labels. To address this challenge, we propose the Adversarial Information Separation Framework (AdverISF), which isolates task-relevant features from noise without requiring explicit supervision. AdverISF introduces a self-supervised adversarial mechanism to enforce statistical independence between task-relevant features and noise representations. It further employs a multi-layer separation architecture that progressively recycles noise information across feature hierarchies to recover features inadvertently discarded as noise, thereby enabling finer-grained feature extraction. Extensive experiments demonstrate that AdverISF outperforms state-of-the-art methods in data-scarce scenarios. In addition, evaluations on real-world material design tasks show that it achieves superior generalization performance.
