Let the Void Be Void: Robust Open-Set Semi-Supervised Learning via Selective Non-Alignment
You Rim Choi, Subeom Park, Seojun Heo, Eunchung Noh, Hyung-Sin Kim
TL;DR
SkipAlign tackles open-set semi-supervised learning by introducing Selective Non-Alignment (SNA), a third contrastive operator that pulls only confident ID samples toward their prototypes while applying angular repulsion to uncertain or OOD samples, creating dense ID clusters within a diffuse OOD void. A dual-gate confidence mechanism and a projection head enable SNA to refine representations and prototypes, balancing ID discrimination and OOD detection. The method combines three losses—L_CC, L_OD, and L_SNA—with adaptive prototype refinement, achieving strong generalization to unseen OOD across CIFAR-10/100, ImageNet-30, and TinyImageNet, with notable improvements in overall OOD-AUC and competitive closed-set accuracy. Theoretical analysis links SNA-driven angular dynamics to a larger geometric margin γ_SNA, which reduces Rademacher complexity and yields robust performance for OOD detectors in OSS scenarios, while qualitative analyses illustrate the resulting ID galaxies and interstellar OOD void.
Abstract
Open-set semi-supervised learning (OSSL) leverages unlabeled data containing both in-distribution (ID) and unknown out-of-distribution (OOD) samples, aiming simultaneously to improve closed-set accuracy and detect novel OOD instances. Existing methods either discard valuable information from uncertain samples or force-align every unlabeled sample into one or a few synthetic "catch-all" representations, resulting in geometric collapse and overconfidence on only seen OODs. To address the limitations, we introduce selective non-alignment, adding a novel "skip" operator into conventional pull and push operations of contrastive learning. Our framework, SkipAlign, selectively skips alignment (pulling) for low-confidence unlabeled samples, retaining only gentle repulsion against ID prototypes. This approach transforms uncertain samples into a pure repulsion signal, resulting in tighter ID clusters and naturally dispersed OOD features. Extensive experiments demonstrate that SkipAlign significantly outperforms state-of-the-art methods in detecting unseen OOD data without sacrificing ID classification accuracy.
