Table of Contents
Fetching ...

Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices

Jeho Lee, Chanyoung Jung, Jiwon Kim, Hojung Cha

TL;DR

Panopticus is a carefully designed system for omnidirectional and camera-based 3D detection on edge devices that employs an adaptive multi-branch detection scheme that accounts for spatial complexities and dynamically adjusts the model's architecture and operations based on available edge resources and spatial characteristics.

Abstract

3D object detection with omnidirectional views enables safety-critical applications such as mobile robot navigation. Such applications increasingly operate on resource-constrained edge devices, facilitating reliable processing without privacy concerns or network delays. To enable cost-effective deployment, cameras have been widely adopted as a low-cost alternative to LiDAR sensors. However, the compute-intensive workload to achieve high performance of camera-based solutions remains challenging due to the computational limitations of edge devices. In this paper, we present Panopticus, a carefully designed system for omnidirectional and camera-based 3D detection on edge devices. Panopticus employs an adaptive multi-branch detection scheme that accounts for spatial complexities. To optimize the accuracy within latency limits, Panopticus dynamically adjusts the model's architecture and operations based on available edge resources and spatial characteristics. We implemented Panopticus on three edge devices and conducted experiments across real-world environments based on the public self-driving dataset and our mobile 360° camera dataset. Experiment results showed that Panopticus improves accuracy by 62% on average given the strict latency objective of 33ms. Also, Panopticus achieves a 2.1{\times} latency reduction on average compared to baselines.

Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices

TL;DR

Panopticus is a carefully designed system for omnidirectional and camera-based 3D detection on edge devices that employs an adaptive multi-branch detection scheme that accounts for spatial complexities and dynamically adjusts the model's architecture and operations based on available edge resources and spatial characteristics.

Abstract

3D object detection with omnidirectional views enables safety-critical applications such as mobile robot navigation. Such applications increasingly operate on resource-constrained edge devices, facilitating reliable processing without privacy concerns or network delays. To enable cost-effective deployment, cameras have been widely adopted as a low-cost alternative to LiDAR sensors. However, the compute-intensive workload to achieve high performance of camera-based solutions remains challenging due to the computational limitations of edge devices. In this paper, we present Panopticus, a carefully designed system for omnidirectional and camera-based 3D detection on edge devices. Panopticus employs an adaptive multi-branch detection scheme that accounts for spatial complexities. To optimize the accuracy within latency limits, Panopticus dynamically adjusts the model's architecture and operations based on available edge resources and spatial characteristics. We implemented Panopticus on three edge devices and conducted experiments across real-world environments based on the public self-driving dataset and our mobile 360° camera dataset. Experiment results showed that Panopticus improves accuracy by 62% on average given the strict latency objective of 33ms. Also, Panopticus achieves a 2.1{\times} latency reduction on average compared to baselines.
Paper Structure (24 sections, 4 equations, 23 figures, 5 tables, 1 algorithm)

This paper contains 24 sections, 4 equations, 23 figures, 5 tables, 1 algorithm.

Figures (23)

  • Figure 1: Obstacle avoidance based on omnidirectional 3D object detection. For optimized resource utilization, a robot prioritizes camera views with high spatial complexity (red) over those with low complexity (green and yellow).
  • Figure 2: Baseline pipeline of BEV-based 3D detection with camera images.
  • Figure 3: Detection accuracy and latency of BEVDet variants and LiDAR-based detector with edge GPU. +D and +F denote the incorporation of dense DepthNet and temporal fusion, respectively.
  • Figure 4: mAP (first row) and TP errors (second row) over objects with different distances and velocities. +D and +F denote the incorporation of dense DepthNet and temporal fusion, respectively.
  • Figure 5: Distribution and mAP in example scenes.
  • ...and 18 more figures