Revisiting Long-Tailed Learning: Insights from an Architectural Perspective

Yuhan Pan; Yanan Sun; Wei Gong

Revisiting Long-Tailed Learning: Insights from an Architectural Perspective

Yuhan Pan, Yanan Sun, Wei Gong

TL;DR

This work addresses long-tailed recognition by shifting focus from data and losses to neural architecture design. By systematically analyzing architectural components, the authors identify bottleneck topology, aggregated/hierarchical convolutions, activation placement, and BatchNorm as LT-friendly factors, and propose two LT-specific convolutions, LT-AggConv and LT-HierConv. They then introduce LT-DARTS, a LT-aware neural architecture search method featuring an LT-friendly search space and a Balanced Fixed Classifier to mitigate bias during search. Across CIFAR-LT, Places-LT, ImageNet-LT, and iNaturalist-LT, LT-DARTS delivers consistent architectural gains, achieving state-of-the-art results when combined with existing LT techniques and reducing tail-class error without sacrificing head-class performance. The findings demonstrate that architecture design is a powerful, orthogonal lever for improving LT performance and can be readily integrated with prevailing LT strategies for practical impact.

Abstract

Long-Tailed (LT) recognition has been widely studied to tackle the challenge of imbalanced data distributions in real-world applications. However, the design of neural architectures for LT settings has received limited attention, despite evidence showing that architecture choices can substantially affect performance. This paper aims to bridge the gap between LT challenges and neural network design by providing an in-depth analysis of how various architectures influence LT performance. Specifically, we systematically examine the effects of key network components on LT handling, such as topology, convolutions, and activation functions. Based on these observations, we propose two convolutional operations optimized for improved performance. Recognizing that operation interactions are also crucial to network effectiveness, we apply Neural Architecture Search (NAS) to facilitate efficient exploration. We propose LT-DARTS, a NAS method with a novel search space and search strategy specifically designed for LT data. Experimental results demonstrate that our approach consistently outperforms existing architectures across multiple LT datasets, achieving parameter-efficient, state-of-the-art results when integrated with current LT methods.

Revisiting Long-Tailed Learning: Insights from an Architectural Perspective

TL;DR

Abstract

Revisiting Long-Tailed Learning: Insights from an Architectural Perspective

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)