Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Svetlana Seliunina; Artem Otelepko; Raphael Memmesheimer; Sven Behnke

Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

Svetlana Seliunina, Artem Otelepko, Raphael Memmesheimer, Sven Behnke

TL;DR

This paper proposes a method based on a MaskDINO model to detect and segment persons and to recognize their actions from combined spherical projected multi-channel representations of the LiDAR data with an additional positional encoding.

Abstract

Robots need to perceive persons in their surroundings for safety and to interact with them. In this paper, we present a person segmentation and action classification approach that operates on 3D scans of hemisphere field of view LiDAR sensors. We recorded a data set with an Ouster OSDome-64 sensor consisting of scenes where persons perform three different actions and annotated it. We propose a method based on a MaskDINO model to detect and segment persons and to recognize their actions from combined spherical projected multi-channel representations of the LiDAR data with an additional positional encoding. Our approach demonstrates good performance for the person segmentation task and further performs well for the estimation of the person action states walking, waving, and sitting. An ablation study provides insights about the individual channel contributions for the person segmentation task. The trained models, code and dataset are made publicly available.

Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

TL;DR

Abstract

Person Segmentation and Action Classification for Multi-Channel Hemisphere Field of View LiDAR Sensors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)