No Thing, Nothing: Highlighting Safety-Critical Classes for Robust LiDAR Semantic Segmentation in Adverse Weather
Junsung Park, Hwijeong Lee, Inha Kang, Hyunjung Shim
TL;DR
This work tackles the difficulty of reliably segmenting safety-critical things in LiDAR data under adverse weather by identifying semantic-level and local-level feature corruptions as the main bottlenecks. It introduces NTN, a framework that combines Feature Binding (FB) to anchor thing classes to visually similar superclasses and Beam-wise Feature Distillation (BFD) to preserve learning signals within each LiDAR beam under point loss. The approach is architecture-agnostic and yields state-of-the-art results on SemanticKITTI→SemanticSTF and SemanticPOSS→SemanticSTF, with notable improvements for things classes (e.g., gains of up to +4.8 to +7.9 mIoU). These results demonstrate enhanced robustness of safety-critical perception in autonomous driving across diverse adverse-weather scenarios, with practical implications for safer navigation and planning.
Abstract
Existing domain generalization methods for LiDAR semantic segmentation under adverse weather struggle to accurately predict "things" categories compared to "stuff" categories. In typical driving scenes, "things" categories can be dynamic and associated with higher collision risks, making them crucial for safe navigation and planning. Recognizing the importance of "things" categories, we identify their performance drop as a serious bottleneck in existing approaches. We observed that adverse weather induces degradation of semantic-level features and both corruption of local features, leading to a misprediction of "things" as "stuff". To mitigate these corruptions, we suggest our method, NTN - segmeNt Things for No-accident. To address semantic-level feature corruption, we bind each point feature to its superclass, preventing the misprediction of things classes into visually dissimilar categories. Additionally, to enhance robustness against local corruption caused by adverse weather, we define each LiDAR beam as a local region and propose a regularization term that aligns the clean data with its corrupted counterpart in feature space. NTN achieves state-of-the-art performance with a +2.6 mIoU gain on the SemanticKITTI-to-SemanticSTF benchmark and +7.9 mIoU on the SemanticPOSS-to-SemanticSTF benchmark. Notably, NTN achieves a +4.8 and +7.9 mIoU improvement on "things" classes, respectively, highlighting its effectiveness.
