Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume

Reef Alturki; Adrian Hilton; Jean-Yves Guillemaut

Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume

Reef Alturki, Adrian Hilton, Jean-Yves Guillemaut

TL;DR

This work tackles occlusion in multi-view pedestrian detection by fusing a unified 3D feature volume with a probabilistic occupancy volume derived from a visual hull. The encoder extracts multi-view features, which are lifted into a 3D volume via 3D feature-pulling, while silhouettes from Mask R-CNN inform a PVH that highlights pedestrian-occupied voxels; the two representations are fused and decoded into a ground-plane detection map using CenterNet-inspired heads. The proposed PVH integration yields state-of-the-art MODA on MultiviewX (97.3%) and competitive performance on Wildtrack, with ablations showing the benefit of PVH over traditional visual hulls and the superiority of the concatenation-based integration. Overall, the method achieves improved localization under occlusion with modest computational overhead, making it suitable for real-time multi-view pedestrian detection and potential tracking applications.

Abstract

Occlusion poses a significant challenge in pedestrian detection from a single view. To address this, multi-view detection systems have been utilized to aggregate information from multiple perspectives. Recent advances in multi-view detection utilized an early-fusion strategy that strategically projects the features onto the ground plane, where detection analysis is performed. A promising approach in this context is the use of 3D feature-pulling technique, which constructs a 3D feature volume of the scene by sampling the corresponding 2D features for each voxel. However, it creates a 3D feature volume of the whole scene without considering the potential locations of pedestrians. In this paper, we introduce a novel model that efficiently leverages traditional 3D reconstruction techniques to enhance deep multi-view pedestrian detection. This is accomplished by complementing the 3D feature volume with probabilistic occupancy volume, which is constructed using the visual hull technique. The probabilistic occupancy volume focuses the model's attention on regions occupied by pedestrians and improves detection accuracy. Our model outperforms state-of-the-art models on the MultiviewX dataset, with an MODA of 97.3%, while achieving competitive performance on the Wildtrack dataset.

Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume

TL;DR

Abstract

Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)