MF-MOS: A Motion-Focused Model for Moving Object Segmentation
Jintao Cheng, Kang Zeng, Zhuoxu Huang, Xiaoyu Tang, Jin Wu, Chengxi Zhang, Xieyuanli Chen, Rui Fan
TL;DR
MF-MOS introduces a motion-focused, dual-branch LiDAR MOS framework that decouples spatial-temporal motion cues (via residual maps) from semantic guidance (via range images). Key components include the Strip Average Pooling Layer (SAPL) for cross-branch fusion, a 3D Spatial-Guided Information Enhancement Module (SIEM) to refine sparse point-cloud signals, and a distribution-based data augmentation scheme over residual maps. The approach achieves state-of-the-art performance on SemanticKITTI-MOS (IoU up to 76.7% on the test set) and demonstrates strong generalization on Apollo, with ablations confirming the contribution of each module. The work advances real-time, robust MOS by effectively leveraging motion information while preserving semantic context, offering practical benefits for autonomous driving perception systems.
Abstract
Moving object segmentation (MOS) provides a reliable solution for detecting traffic participants and thus is of great interest in the autonomous driving field. Dynamic capture is always critical in the MOS problem. Previous methods capture motion features from the range images directly. Differently, we argue that the residual maps provide greater potential for motion information, while range images contain rich semantic guidance. Based on this intuition, we propose MF-MOS, a novel motion-focused model with a dual-branch structure for LiDAR moving object segmentation. Novelly, we decouple the spatial-temporal information by capturing the motion from residual maps and generating semantic features from range images, which are used as movable object guidance for the motion branch. Our straightforward yet distinctive solution can make the most use of both range images and residual maps, thus greatly improving the performance of the LiDAR-based MOS task. Remarkably, our MF-MOS achieved a leading IoU of 76.7% on the MOS leaderboard of the SemanticKITTI dataset upon submission, demonstrating the current state-of-the-art performance. The implementation of our MF-MOS has been released at https://github.com/SCNU-RISLAB/MF-MOS.
