A Language Anchor-Guided Method for Robust Noisy Domain Generalization

Zilin Dai; Lehong Wang; Fangzhou Lin; Yidong Wang; Zhigang Li; Kazunori D Yamada; Ziming Zhang; Wang Lu

A Language Anchor-Guided Method for Robust Noisy Domain Generalization

Zilin Dai, Lehong Wang, Fangzhou Lin, Yidong Wang, Zhigang Li, Kazunori D Yamada, Ziming Zhang, Wang Lu

TL;DR

Domain generalization under distribution shift and label noise remains challenging due to spurious correlations. The authors propose $A^3W$, an NLP-anchor guided framework that aligns image features with class-specific semantic anchors derived from CLIP and employs a softmax-weighted loss to down-weight noisy samples, thereby improving robustness. Empirical results across multiple DG benchmarks show consistent improvements over state-of-the-art methods, with notable gains under higher noise and in semantically rich settings. This knowledge-guided approach demonstrates the practical value of integrating external semantic cues into domain generalization and opens avenues for dynamic anchors and multi-modal extensions.

Abstract

Real-world machine learning applications often struggle with two major challenges: distribution shift and label noise. Models tend to overfit by focusing on redundant and uninformative features in the training data, which makes it hard for them to generalize to the target domain. Noisy data worsens this problem by causing further overfitting to the noise, meaning that existing methods often fail to tell the difference between true, invariant features and misleading, spurious ones. To tackle these issues, we introduce Anchor Alignment and Adaptive Weighting (A3W). This new algorithm uses sample reweighting guided by natural language processing (NLP) anchors to extract more representative features. In simple terms, A3W leverages semantic representations from natural language models as a source of domain-invariant prior knowledge. Additionally, it employs a weighted loss function that adjusts each sample's contribution based on its similarity to the corresponding NLP anchor. This adjustment makes the model more robust to noisy labels. Extensive experiments on standard benchmark datasets show that A3W consistently outperforms state-of-the-art domain generalization methods, offering significant improvements in both accuracy and robustness across different datasets and noise levels.

A Language Anchor-Guided Method for Robust Noisy Domain Generalization

TL;DR

Abstract

A Language Anchor-Guided Method for Robust Noisy Domain Generalization

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (4)