Revisiting Few-Shot Learning from a Causal Perspective
Guoliang Lin, Yongheng Xu, Hanjiang Lai, Jian Yin
TL;DR
This work reframes metric-based few-shot learning through the lens of causal inference, specifically front-door adjustment, to identify true causal effects from inputs to labels while mitigating unobserved confounders. It shows that canonical methods like Matching Networks, Prototypical Networks, and CLIP/Tip-Adapter fit certain front-door forms, and it proposes two practical causal methods—an ensemble approach and a stochastic mapping strategy—to incorporate diverse representations and strengthen causal cues. Empirical results across 10 datasets demonstrate consistent improvements over strong baselines and existing prompt-based methods, with significant gains on ImageNet and robust performance across multiple CLIP/BLIP backbones. The work advances a principled integration of causality and representation diversity in FSL, offering a clear path toward more generalizable few-shot models with practical impact in vision-language tasks.
Abstract
Few-shot learning with $N$-way $K$-shot scheme is an open challenge in machine learning. Many metric-based approaches have been proposed to tackle this problem, e.g., the Matching Networks and CLIP-Adapter. Despite that these approaches have shown significant progress, the mechanism of why these methods succeed has not been well explored. In this paper, we try to interpret these metric-based few-shot learning methods via causal mechanism. We show that the existing approaches can be viewed as specific forms of front-door adjustment, which can alleviate the effect of spurious correlations and thus learn the causality. This causal interpretation could provide us a new perspective to better understand these existing metric-based methods. Further, based on this causal interpretation, we simply introduce two causal methods for metric-based few-shot learning, which considers not only the relationship between examples but also the diversity of representations. Experimental results demonstrate the superiority of our proposed methods in few-shot classification on various benchmark datasets. Code is available in https://github.com/lingl1024/causalFewShot.
