PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition
Xiao Li, Yining Liu, Na Dong, Sitian Qin, Xiaolin Hu
TL;DR
The paper addresses robustness gaps in deep object recognition by introducing PIN++, a large-scale dataset with high-quality part annotations for all IN-1K categories, and a novel Multi-scale Part-supervised Model (MPM) that leverages these annotations. PIN++ enables generation of pseudo part labels for unlabeled IN-1K images, while MPM injects lightweight, multi-resolution part supervision into a standard backbone without increasing inference cost. Across adversarial attacks, common corruptions, and several OOD datasets, MPM improves both robustness and alignment with human vision, and it boosts downstream detection performance when used for backbone initialization. The work highlights the practical value of explicit part-based inductive bias for robust recognition and broader visual understanding tasks. Key contributions include the creation of PIN++, the pseudo-label driven training paradigm, and the MPM architecture that utilizes high-resolution part annotations to achieve robust recognition on a large-scale benchmark.
Abstract
Deep learning-based object recognition systems can be easily fooled by various adversarial perturbations. One reason for the weak robustness may be that they do not have part-based inductive bias like the human recognition process. Motivated by this, several part-based recognition models have been proposed to improve the adversarial robustness of recognition. However, due to the lack of part annotations, the effectiveness of these methods is only validated on small-scale nonstandard datasets. In this work, we propose PIN++, short for PartImageNet++, a dataset providing high-quality part segmentation annotations for all categories of ImageNet-1K (IN-1K). With these annotations, we build part-based methods directly on the standard IN-1K dataset for robust recognition. Different from previous two-stage part-based models, we propose a Multi-scale Part-supervised Model (MPM), to learn a robust representation with part annotations. Experiments show that MPM yielded better adversarial robustness on the large-scale IN-1K over strong baselines across various attack settings. Furthermore, MPM achieved improved robustness on common corruptions and several out-of-distribution datasets. The dataset, together with these results, enables and encourages researchers to explore the potential of part-based models in more real applications.
