SynerMix: Synergistic Mixup Solution for Enhanced Intra-Class Cohesion and Inter-Class Separability in Image Classification
Ye Xu, Ya Gao, Xiaorong Qiu, Yang Chen, Ying Ji
TL;DR
SynerMix tackles two key gaps in MixUp-based data augmentation for image classification: the neglect of intra-class mixup and the lack of explicit intra-class cohesion in learned feature spaces. It introduces SynerMix-Intra, which synthesizes a single per-class feature from unaugmented originals to strengthen intra-class cohesion, and then integrates this with an inter-class mixup component (e.g., MixUp or Manifold MixUp) to form SynerMix, with total loss $ ext{L}_{ ext{total}} = eta ext{L}_{ ext{intra}} + (1-eta) ext{L}_{ ext{inter}}$. Empirical results across six datasets show that intra-class mixup yields 0.22–3.09% gains on its own (avg ≈1.25%), while the full SynerMix improves performance over the best of either component by 0.1–3.43% (avg 1.16%) and over the top of the individual intra/inter methods by 0.12–5.16% (avg 1.11%). The method is model-agnostic and extends to other domains (e.g., speech, text), with findings also highlighting accelerated convergence and a nuanced role for gradient stochasticity due to synthesized features. Code is publicly available, enabling adoption and further exploration of synergistic intra- and inter-class mixup strategies.
Abstract
To address the issues of MixUp and its variants (e.g., Manifold MixUp) in image classification tasks-namely, their neglect of mixing within the same class (intra-class mixup) and their inadequacy in enhancing intra-class cohesion through their mixing operations-we propose a novel mixup method named SynerMix-Intra and, building upon this, introduce a synergistic mixup solution named SynerMix. SynerMix-Intra specifically targets intra-class mixup to bolster intra-class cohesion, a feature not addressed by current mixup methods. For each mini-batch, it leverages feature representations of unaugmented original images from each class to generate a synthesized feature representation through random linear interpolation. All synthesized representations are then fed into the classification and loss layers to calculate an average classification loss that significantly enhances intra-class cohesion. Furthermore, SynerMix combines SynerMix-Intra with an existing mixup approach (e.g., MixUp, Manifold MixUp), which primarily focuses on inter-class mixup and has the benefit of enhancing inter-class separability. In doing so, it integrates both inter- and intra-class mixup in a balanced way while concurrently improving intra-class cohesion and inter-class separability. Experimental results on six datasets show that SynerMix achieves a 0.1% to 3.43% higher accuracy than the best of either MixUp or SynerMix-Intra alone, averaging a 1.16% gain. It also surpasses the top-performer of either Manifold MixUp or SynerMix-Intra by 0.12% to 5.16%, with an average gain of 1.11%. Given that SynerMix is model-agnostic, it holds significant potential for application in other domains where mixup methods have shown promise, such as speech and text classification. Our code is publicly available at: https://github.com/wxitxy/synermix.git.
