On the Relationship Between Double Descent of CNNs and Shape/Texture Bias Under Learning Process
Shun Iwase, Shuya Takahashi, Nakamasa Inoue, Rio Yokota, Ryo Nakamura, Hirokatsu Kataoka
TL;DR
This work investigates the relationship between epoch-wise double descent in CNNs and learning of shape versus texture features in natural images. It combines a phase-based framework for tracking test error with Islam's neuron-based bias quantification to examine how shape/texture bias evolves during training under double-descent conditions, including Nakkiran's CIFAR-10 setup and various ablations. The study finds that shape bias often correlates positively with test error during Phases 1-2 while texture bias correlates negatively, with inflection points aligning closely to error changes; double descent/ascent of bias can occur even without label noise, and deeper layers show stronger synchronization with test performance. These results provide mechanistic insight into the learning dynamics underlying double descent in CNNs and suggest bias trajectories could inform training strategies and bias-aware modeling for image recognition tasks.
Abstract
The double descent phenomenon, which deviates from the traditional bias-variance trade-off theory, attracts considerable research attention; however, the mechanism of its occurrence is not fully understood. On the other hand, in the study of convolutional neural networks (CNNs) for image recognition, methods are proposed to quantify the bias on shape features versus texture features in images, determining which features the CNN focuses on more. In this work, we hypothesize that there is a relationship between the shape/texture bias in the learning process of CNNs and epoch-wise double descent, and we conduct verification. As a result, we discover double descent/ascent of shape/texture bias synchronized with double descent of test error under conditions where epoch-wise double descent is observed. Quantitative evaluations confirm this correlation between the test errors and the bias values from the initial decrease to the full increase in test error. Interestingly, double descent/ascent of shape/texture bias is observed in some cases even in conditions without label noise, where double descent is thought not to occur. These experimental results are considered to contribute to the understanding of the mechanisms behind the double descent phenomenon and the learning process of CNNs in image recognition.
