Sparse Generation: Making Pseudo Labels Sparse for Point Weakly Supervised Object Detection on Low Data Volume
Chuyang Shang, Tian Ma, Wanzhu Ren, Yuancheng Li, Jiayi Yang
TL;DR
This work addresses the density and inefficiency of pseudo-label generation in point weakly supervised object detection under low data volume. It introduces Sparse Generation, a non-networked three-stage pipeline (Mapping, Mask, Regression) that converts dense pseudo labels into sparse representations and refines them with a regression step, aided by perspective-based matching (PADM) to recover missed instances. The approach relies on a lightweight mapping into 2D tensors, a mask-based aggregation to reduce localization bias, and a constrained regression trained with limited supervision, achieving substantial improvements over state-of-the-art on four datasets, including a newly introduced Bullet-Hole dataset for dense objects. Overall, Sparse Generation delivers higher-quality, sparser pseudo labels with lower computational cost, enabling effective one-epoch training in low-data regimes and offering practical gains for dense-object detection tasks.
Abstract
Existing pseudo label generation methods for point weakly supervised object detection are inadequate in low data volume and dense object detection tasks. We consider the generation of weakly supervised pseudo labels as the model's sparse output, and propose Sparse Generation as a solution to make pseudo labels sparse. The method employs three processing stages (Mapping, Mask, Regression), constructs dense tensors through the relationship between data and detector model, optimizes three of its parameters, and obtains a sparse tensor, thereby indirectly obtaining higher quality pseudo labels, and addresses the model's density problem on low data volume. Additionally, we propose perspective-based matching, which provides more rational pseudo boxes for prediction missed on instances. In comparison to the SOTA method, on four datasets (MS COCO-val, RSOD, SIMD, Bullet-Hole), the experimental results demonstrated a significant advantage.
