Table of Contents
Fetching ...

DOSE3 : Diffusion-based Out-of-distribution detection on SE(3) trajectories

Hongzhe Cheng, Tianyou Zheng, Tianyi Zhang, Matthew Johnson-Roberson, Weiming Zhi

TL;DR

This paper tackles OOD detection for sequences of rigid-body poses by extending diffusion models to the non-Euclidean manifold $\mathbb{SE}(3)$. It introduces DOSE3, a unified diffusion-based framework that diffuses both translations in $\mathbb{R}^3$ and rotations in $\mathbb{SO}(3)$, using manifold-aware forward and reverse processes and a diffusion estimator $\boldsymbol{\epsilon}_\theta$ to derive OOD statistics without retraining. A dedicated SE(3) diffusion UNet architecture with 1D temporal convolutions, attention, and residual connections enables effective modeling of pose trajectories; OOD detection is performed via a 24-dimensional diffusion-based statistic, with density estimation on inliers and a 5th percentile threshold for decision. Empirical validation on Oxford, KITTI, and IROS20 demonstrates near-perfect AUROC across ID/OOD combinations, with rotation information proving particularly discriminative and robust to sequence length and diffusion-step variations, highlighting practical impact for robotics and autonomous systems.

Abstract

Out-of-Distribution(OOD) detection, a fundamental machine learning task aimed at identifying abnormal samples, traditionally requires model retraining for different inlier distributions. While recent research demonstrates the applicability of diffusion models to OOD detection, existing approaches are limited to Euclidean or latent image spaces. Our work extends OOD detection to trajectories in the Special Euclidean Group in 3D ($\mathbb{SE}(3)$), addressing a critical need in computer vision, robotics, and engineering applications that process object pose sequences in $\mathbb{SE}(3)$. We present $\textbf{D}$iffusion-based $\textbf{O}$ut-of-distribution detection on $\mathbb{SE}(3)$ ($\mathbf{DOSE3}$), a novel OOD framework that extends diffusion to a unified sample space of $\mathbb{SE}(3)$ pose sequences. Through extensive validation on multiple benchmark datasets, we demonstrate $\mathbf{DOSE3}$'s superior performance compared to state-of-the-art OOD detection frameworks.

DOSE3 : Diffusion-based Out-of-distribution detection on SE(3) trajectories

TL;DR

This paper tackles OOD detection for sequences of rigid-body poses by extending diffusion models to the non-Euclidean manifold . It introduces DOSE3, a unified diffusion-based framework that diffuses both translations in and rotations in , using manifold-aware forward and reverse processes and a diffusion estimator to derive OOD statistics without retraining. A dedicated SE(3) diffusion UNet architecture with 1D temporal convolutions, attention, and residual connections enables effective modeling of pose trajectories; OOD detection is performed via a 24-dimensional diffusion-based statistic, with density estimation on inliers and a 5th percentile threshold for decision. Empirical validation on Oxford, KITTI, and IROS20 demonstrates near-perfect AUROC across ID/OOD combinations, with rotation information proving particularly discriminative and robust to sequence length and diffusion-step variations, highlighting practical impact for robotics and autonomous systems.

Abstract

Out-of-Distribution(OOD) detection, a fundamental machine learning task aimed at identifying abnormal samples, traditionally requires model retraining for different inlier distributions. While recent research demonstrates the applicability of diffusion models to OOD detection, existing approaches are limited to Euclidean or latent image spaces. Our work extends OOD detection to trajectories in the Special Euclidean Group in 3D (), addressing a critical need in computer vision, robotics, and engineering applications that process object pose sequences in . We present iffusion-based ut-of-distribution detection on (), a novel OOD framework that extends diffusion to a unified sample space of pose sequences. Through extensive validation on multiple benchmark datasets, we demonstrate 's superior performance compared to state-of-the-art OOD detection frameworks.

Paper Structure

This paper contains 21 sections, 15 equations, 5 figures, 4 tables, 2 algorithms.

Figures (5)

  • Figure 1: Sequences of rigid poses are abundant in displines that pertain to objects moving in the real world. We propose $\mathbf{DOSE3}$, a unified diffusion model over the $\mathbb{SE}(3)$ manifold to accurately detect out-of-distribution pose sequences.
  • Figure 2: System Diagram of DOSE3 processing flow. Sequences of pose data are diffused, where diffusion over rotational components is constrained to the $\mathbb{SO}(3)$ manifold. The resulting diffusion estimators are used to construct an OOD statistic.
  • Figure 3: SE(3) diffusion UNet architecture per layer. Trajectories are processed by the 1-dimensional convolution modules.
  • Figure 4: Distribution of elements $\epsilon(x_t, t)$ in rotation tangent space for each dimension when running Oxford Robot Car and KITTI dataset on model trained on Oxford Robot Car
  • Figure 5: $\epsilon$ distribution retrieved by OOD testing on a model trained from KITTI dataset against the Oxford RoboCar dataset. We observe that the distributions of both datasets is particularly evident on the Z-axis rotation.