Table of Contents
Fetching ...

Few-Shot Object Detection: Research Advances and Challenges

Zhimeng Xin, Shiming Chen, Tianxu Wu, Yuanjie Shao, Weiping Ding, Xinge You

TL;DR

This paper surveys few-shot object detection (FSOD), addressing how to detect novel objects with limited annotations by leveraging two-stage transfer learning from a richly labeled base set. It introduces a taxonomy dividing FSOD methods into episode-task-based and single-task-based approaches, and surveys representative algorithms that span meta-learning, RPN/Transformer fusion, interclass relations, knowledge distillation, context reasoning, and decoupled detection networks. The work highlights core datasets (VOC, COCO, LVIS, FSOD, ImageNet) and evaluation protocols, analyzes theoretical and empirical trade-offs, and discusses challenges such as data scarcity, domain shift, and localization quality, while pointing to future directions like hybrid learning, improved localization, and practical deployment scenarios. Overall, the survey clarifies how different transfer-learning strategies enable rapid adaptation to novel categories and frames FSOD as a practical tool for real-world detection under data constraints.

Abstract

Object detection as a subfield within computer vision has achieved remarkable progress, which aims to accurately identify and locate a specific object from images or videos. Such methods rely on large-scale labeled training samples for each object category to ensure accurate detection, but obtaining extensive annotated data is a labor-intensive and expensive process in many real-world scenarios. To tackle this challenge, researchers have explored few-shot object detection (FSOD) that combines few-shot learning and object detection techniques to rapidly adapt to novel objects with limited annotated samples. This paper presents a comprehensive survey to review the significant advancements in the field of FSOD in recent years and summarize the existing challenges and solutions. Specifically, we first introduce the background and definition of FSOD to emphasize potential value in advancing the field of computer vision. We then propose a novel FSOD taxonomy method and survey the plentifully remarkable FSOD algorithms based on this fact to report a comprehensive overview that facilitates a deeper understanding of the FSOD problem and the development of innovative solutions. Finally, we discuss the advantages and limitations of these algorithms to summarize the challenges, potential research direction, and development trend of object detection in the data scarcity scenario.

Few-Shot Object Detection: Research Advances and Challenges

TL;DR

This paper surveys few-shot object detection (FSOD), addressing how to detect novel objects with limited annotations by leveraging two-stage transfer learning from a richly labeled base set. It introduces a taxonomy dividing FSOD methods into episode-task-based and single-task-based approaches, and surveys representative algorithms that span meta-learning, RPN/Transformer fusion, interclass relations, knowledge distillation, context reasoning, and decoupled detection networks. The work highlights core datasets (VOC, COCO, LVIS, FSOD, ImageNet) and evaluation protocols, analyzes theoretical and empirical trade-offs, and discusses challenges such as data scarcity, domain shift, and localization quality, while pointing to future directions like hybrid learning, improved localization, and practical deployment scenarios. Overall, the survey clarifies how different transfer-learning strategies enable rapid adaptation to novel categories and frames FSOD as a practical tool for real-world detection under data constraints.

Abstract

Object detection as a subfield within computer vision has achieved remarkable progress, which aims to accurately identify and locate a specific object from images or videos. Such methods rely on large-scale labeled training samples for each object category to ensure accurate detection, but obtaining extensive annotated data is a labor-intensive and expensive process in many real-world scenarios. To tackle this challenge, researchers have explored few-shot object detection (FSOD) that combines few-shot learning and object detection techniques to rapidly adapt to novel objects with limited annotated samples. This paper presents a comprehensive survey to review the significant advancements in the field of FSOD in recent years and summarize the existing challenges and solutions. Specifically, we first introduce the background and definition of FSOD to emphasize potential value in advancing the field of computer vision. We then propose a novel FSOD taxonomy method and survey the plentifully remarkable FSOD algorithms based on this fact to report a comprehensive overview that facilitates a deeper understanding of the FSOD problem and the development of innovative solutions. Finally, we discuss the advantages and limitations of these algorithms to summarize the challenges, potential research direction, and development trend of object detection in the data scarcity scenario.
Paper Structure (35 sections, 12 equations, 10 figures, 7 tables)

This paper contains 35 sections, 12 equations, 10 figures, 7 tables.

Figures (10)

  • Figure 1: Processing of mainstream FSOD.
  • Figure 2: The processing of transfer learning.
  • Figure 3: The organizational structure of FSOD taxonomy in this survey.
  • Figure 4: The processing of episode-task-based approach.
  • Figure 5: The processing of single-task-based method.
  • ...and 5 more figures