VIPriors 4: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
Robert-Jan Bruintjes, Attila Lengyel, Marcos Baptista Rios, Osman Semih Kayhan, Davide Zambrano, Nergis Tomen, Jan van Gemert
TL;DR
VIPriors 4 investigates visual inductive priors for data-efficient deep learning under scratch-training constraints, focusing on object detection and instance segmentation. Top solutions leverage extensive data augmentation, model ensembling, and model-soup techniques, with notable prior-based contributions such as Orthogonal Uncertainty Representation and Image Uncertainty Weighted enhancing performance. The results illustrate that, despite some gains from priors, practical data-efficient performance largely hinges on engineering strategies, pointing to a path where scalable priors and efficient training become more prominent. Overall, the edition highlights both persistent effectiveness of augmentation/ensembling and the emerging potential of principled priors to improve data efficiency in vision tasks.
Abstract
The fourth edition of the "VIPriors: Visual Inductive Priors for Data-Efficient Deep Learning" workshop features two data-impaired challenges. These challenges address the problem of training deep learning models for computer vision tasks with limited data. Participants are limited to training models from scratch using a low number of training samples and are not allowed to use any form of transfer learning. We aim to stimulate the development of novel approaches that incorporate inductive biases to improve the data efficiency of deep learning models. Significant advancements are made compared to the provided baselines, where winning solutions surpass the baselines by a considerable margin in both tasks. As in previous editions, these achievements are primarily attributed to heavy use of data augmentation policies and large model ensembles, though novel prior-based methods seem to contribute more to successful solutions compared to last year. This report highlights the key aspects of the challenges and their outcomes.
