Learn2Mix: Training Neural Networks Using Adaptive Data Integration
Shyam Venkatasubramanian, Vahid Tarokh
TL;DR
learn2mix introduces a dynamic batch composition strategy that adaptively shifts class proportions toward harder classes based on instantaneous class-wise losses, enabling faster convergence under resource constraints. The method is formalized as a bilevel optimization where network parameters are updated with the current mixing, followed by updating the mixing proportions toward the normalized losses. Theoretical results show convergence of both the network parameters and the mixing vector to optimal targets under standard assumptions, while empirical evaluations across classification, regression, and reconstruction tasks demonstrate consistent convergence acceleration and improved generalization compared with classical training and several baselines. The work highlights the practical impact of adaptive data integration for efficient, robust neural network training in imbalanced and resource-limited environments.
Abstract
Accelerating model convergence in resource-constrained environments is essential for fast and efficient neural network training. This work presents learn2mix, a new training strategy that adaptively adjusts class proportions within batches, focusing on classes with higher error rates. Unlike classical training methods that use static class proportions, learn2mix continually adapts class proportions during training, leading to faster convergence. Empirical evaluations on benchmark datasets show that neural networks trained with learn2mix converge faster than those trained with existing approaches, achieving improved results for classification, regression, and reconstruction tasks under limited training resources and with imbalanced classes. Our empirical findings are supported by theoretical analysis.
