Perspective-Invariant 3D Object Detection

Ao Liang; Lingdong Kong; Dongyue Lu; Youquan Liu; Jian Fang; Huaici Zhao; Wei Tsang Ooi

Perspective-Invariant 3D Object Detection

Ao Liang, Lingdong Kong, Dongyue Lu, Youquan Liu, Jian Fang, Huaici Zhao, Wei Tsang Ooi

TL;DR

This work presents Pi3DET, the first multi-platform LiDAR 3D detection benchmark spanning vehicle, drone, and quadruped platforms, paired with Pi3DET-Net, a two-stage cross-platform adaptation framework that jointly learns geometry robustness and feature alignment to achieve perspective-invariant 3D detection. The approach introduces Random Platform Jitter and Virtual Platform Pose for geometric alignment, plus Geometry-Aware Transformation Descriptor and KL Probabilistic Feature Alignment for semantic feature alignment, enabling effective knowledge transfer from vehicle data to non-vehicle platforms. Extensive experiments on cross-platform and cross-dataset tasks, including cross-platform benchmarks across 18 detectors, demonstrate substantial improvements over baselines and strong generalization capabilities. The dataset, toolkit, and benchmark are publicly released to foster development of generalizable 3D perception systems across diverse autonomous platforms.

Abstract

With the rise of robotics, LiDAR-based 3D object detection has garnered significant attention in both academia and industry. However, existing datasets and methods predominantly focus on vehicle-mounted platforms, leaving other autonomous platforms underexplored. To bridge this gap, we introduce Pi3DET, the first benchmark featuring LiDAR data and 3D bounding box annotations collected from multiple platforms: vehicle, quadruped, and drone, thereby facilitating research in 3D object detection for non-vehicle platforms as well as cross-platform 3D detection. Based on Pi3DET, we propose a novel cross-platform adaptation framework that transfers knowledge from the well-studied vehicle platform to other platforms. This framework achieves perspective-invariant 3D detection through robust alignment at both geometric and feature levels. Additionally, we establish a benchmark to evaluate the resilience and robustness of current 3D detectors in cross-platform scenarios, providing valuable insights for developing adaptive 3D perception systems. Extensive experiments validate the effectiveness of our approach on challenging cross-platform tasks, demonstrating substantial gains over existing adaptation methods. We hope this work paves the way for generalizable and unified 3D perception systems across diverse and complex environments. Our Pi3DET dataset, cross-platform benchmark suite, and annotation toolkit have been made publicly available.

Perspective-Invariant 3D Object Detection

TL;DR

Abstract

Perspective-Invariant 3D Object Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)