RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection

Cheng Lu; Mingqian Ji; Shanshan Zhang; Zhihao Li; Jian Yang

RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection

Cheng Lu, Mingqian Ji, Shanshan Zhang, Zhihao Li, Jian Yang

Abstract

Long-range 3D object detection remains challenging because LiDAR observations become highly sparse and fragmented in the far field, making reliable context modeling difficult for existing detectors. To address this issue, recent state space model (SSM)-based methods have improved long-range modeling efficiency. However, their effectiveness is still limited by generic serialization strategies that fail to preserve meaningful contextual neighborhoods in sparse scenes. To address this issue, we propose RayMamba, a geometry-aware plug-and-play enhancement for voxel-based 3D detectors. RayMamba organizes sparse voxels into sector-wise ordered sequences through a ray-aligned serialization strategy, which preserves directional continuity and occlusion-related context for subsequent Mamba-based modeling. It is compatible with both LiDAR-only and multimodal detectors, while introducing only modest overhead. Extensive experiments on nuScenes and Argoverse 2 demonstrate consistent improvements across strong baselines. In particular, RayMamba achieves up to 2.49 mAP and 1.59 NDS gain in the challenging 40--50 m range on nuScenes, and further improves VoxelNeXt on Argoverse 2 from 30.3 to 31.2 mAP.

RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection

Abstract

RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection

Abstract

Paper Structure

Table of Contents

Figures (5)