BPJDet: Extended Object Representation for Generic Body-Part Joint Detection
Huayi Zhou, Fei Jiang, Jiaxin Si, Yue Ding, Hongtao Lu
TL;DR
BPJDet tackles the challenging problem of joint detection and association of human bodies and their parts by introducing an extended object representation that appends center-offsets to body parts. It supports both anchor-based and anchor-free detectors, uses a multi-task loss to train detection and association end-to-end, and employs an association decoding scheme to link parts to bodies without post-matching. Across CityPersons, CrowdHuman, BodyHands, COCOHumanParts, and Animals5C, BPJDet delivers state-of-the-art body–part association while preserving detection accuracy, and its applications to accurate crowd head detection and hand contact estimation demonstrate practical impact. The approach generalizes to animals and is released open-source, offering a versatile baseline for future body–part joint detection work.
Abstract
Detection of human body and its parts has been intensively studied. However, most of CNNs-based detectors are trained independently, making it difficult to associate detected parts with body. In this paper, we focus on the joint detection of human body and its parts. Specifically, we propose a novel extended object representation integrating center-offsets of body parts, and construct an end-to-end generic Body-Part Joint Detector (BPJDet). In this way, body-part associations are neatly embedded in a unified representation containing both semantic and geometric contents. Therefore, we can optimize multi-loss to tackle multi-tasks synergistically. Moreover, this representation is suitable for anchor-based and anchor-free detectors. BPJDet does not suffer from error-prone post matching, and keeps a better trade-off between speed and accuracy. Furthermore, BPJDet can be generalized to detect body-part or body-parts of either human or quadruped animals. To verify the superiority of BPJDet, we conduct experiments on datasets of body-part (CityPersons, CrowdHuman and BodyHands) and body-parts (COCOHumanParts and Animals5C). While keeping high detection accuracy, BPJDet achieves state-of-the-art association performance on all datasets. Besides, we show benefits of advanced body-part association capability by improving performance of two representative downstream applications: accurate crowd head detection and hand contact estimation. Project is available in https://hnuzhy.github.io/projects/BPJDet.
