LiDAR Based Semantic Perception for Forklifts in Outdoor Environments

Benjamin Serfling; Hannes Reichert; Lorenzo Bayerlein; Konrad Doll; Kati Radkhah-Lens

LiDAR Based Semantic Perception for Forklifts in Outdoor Environments

Benjamin Serfling, Hannes Reichert, Lorenzo Bayerlein, Konrad Doll, Kati Radkhah-Lens

TL;DR

This paper tackles semantic perception for autonomous forklifts operating in outdoor industrial environments by proposing a dual-LiDAR system that provides broad spatial coverage and short-range precision. The authors develop a lightweight CNN-based segmentation pipeline that uses a spherical LiDAR representation, surface normals, and reflectivity, fused across two synchronized LiDARs, and enhanced by an attention-based neck and feature pyramid network to achieve real-time performance. They train with a combined cross-entropy and Tversky loss, pretrain on a large automotive dataset, and finetune on a custom industrial dataset featuring forklifts, pedestrians, and ground/lanes. The evaluation shows that EfficientNet-based backbones offer a favorable accuracy–latency trade-off under a $33.3$ ms budget, achieving up to $\text{mIoU}=74.14\%$ while maintaining real-time operation, and demonstrates strong segmentation of drivable ground, lane markings, and safety-critical objects. The work provides a practical path toward safety-aware, fully autonomous forklift navigation in dynamic warehouse and outdoor yard environments, with future directions including temporal fusion and multi-modal sensing.

Abstract

In this study, we present a novel LiDAR-based semantic segmentation framework tailored for autonomous forklifts operating in complex outdoor environments. Central to our approach is the integration of a dual LiDAR system, which combines forward-facing and downward-angled LiDAR sensors to enable comprehensive scene understanding, specifically tailored for industrial material handling tasks. The dual configuration improves the detection and segmentation of dynamic and static obstacles with high spatial precision. Using high-resolution 3D point clouds captured from two sensors, our method employs a lightweight yet robust approach that segments the point clouds into safety-critical instance classes such as pedestrians, vehicles, and forklifts, as well as environmental classes such as driveable ground, lanes, and buildings. Experimental validation demonstrates that our approach achieves high segmentation accuracy while satisfying strict runtime requirements, establishing its viability for safety-aware, fully autonomous forklift navigation in dynamic warehouse and yard environments.

LiDAR Based Semantic Perception for Forklifts in Outdoor Environments

TL;DR

Abstract

LiDAR Based Semantic Perception for Forklifts in Outdoor Environments

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)