Optimal Layer Selection for Latent Data Augmentation
Tomoumi Takase, Ryo Karakida
TL;DR
This work tackles the challenge of selecting latent-layer positions for data augmentation in neural networks. It introduces AdaLASE, a gradient-based method that assigns and dynamically updates per-layer acceptance ratios $q_i$ to determine where latent-DA should be applied, using a proxy loss $L_{DA}$ and a pseudo-validation objective to guide optimization. Across diverse datasets and models, AdaLASE achieves accuracy comparable to or better than Uniform DA, while revealing that optimal layers for augmentation depend on data regime and task. The proposed framework reduces heuristic layer selection and computational costs, with potential extensions to optimize augmentation types and multiple methods jointly, enhancing automated latent-DA policy search in transfer learning and beyond.
Abstract
While data augmentation (DA) is generally applied to input data, several studies have reported that applying DA to hidden layers in neural networks, i.e., feature augmentation, can improve performance. However, in previous studies, the layers to which DA is applied have not been carefully considered, often being applied randomly and uniformly or only to a specific layer, leaving room for arbitrariness. Thus, in this study, we investigated the trends of suitable layers for applying DA in various experimental configurations, e.g., training from scratch, transfer learning, various dataset settings, and different models. In addition, to adjust the suitable layers for DA automatically, we propose the adaptive layer selection (AdaLASE) method, which updates the ratio to perform DA for each layer based on the gradient descent method during training. The experimental results obtained on several image classification datasets indicate that the proposed AdaLASE method altered the ratio as expected and achieved high overall test accuracy.
