Table of Contents
Fetching ...

See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction

Yuan Wu, Zhiqiang Yan, Yigong Zhang, Xiang Li, Jian Yang

TL;DR

LIAR addresses nighttime occupancy prediction by learning illumination-affined representations. It introduces Selective Low-light Image Enhancement (SLLIE) to globally and selectively enhance poorly lit images, and two illumination-aware modules, 2D-Illumination-guided Sampling (2D-IGS) and 3D-Illumination-driven Projection (3D-IDP), to mitigate underexposure and overexposure in 2D and 3D BEV contexts, respectively. Through Retinex-based enhancement, adaptive sampling, and illumination-weighted BEV refinement, LIAR achieves state-of-the-art results on real and synthetic nighttime datasets, with up to $7.44$ mIoU gains in easy settings and strong gains across severities; ablations confirm the effectiveness of each component. This approach advances robust 3D scene understanding in adverse lighting, with practical impact for safer autonomous driving under night conditions, and provides code and pretrained models for reproducibility.

Abstract

Occupancy prediction aims to estimate the 3D spatial distribution of occupied regions along with their corresponding semantic labels. Existing vision-based methods perform well on daytime benchmarks but struggle in nighttime scenarios due to limited visibility and challenging lighting conditions. To address these challenges, we propose LIAR, a novel framework that learns illumination-affined representations. LIAR first introduces Selective Low-light Image Enhancement (SLLIE), which leverages the illumination priors from daytime scenes to adaptively determine whether a nighttime image is genuinely dark or sufficiently well-lit, enabling more targeted global enhancement. Building on the illumination maps generated by SLLIE, LIAR further incorporates two illumination-aware components: 2D Illumination-guided Sampling (2D-IGS) and 3D Illumination-driven Projection (3D-IDP), to respectively tackle local underexposure and overexposure. Specifically, 2D-IGS modulates feature sampling positions according to illumination maps, assigning larger offsets to darker regions and smaller ones to brighter regions, thereby alleviating feature degradation in underexposed areas. Subsequently,3D-IDP enhances semantic understanding in overexposed regions by constructing illumination intensity fields and supplying refined residual queries to the BEV context refinement process. Extensive experiments on both real and synthetic datasets demonstrate the superior performance of LIAR under challenging nighttime scenarios. The source code and pretrained models are available [here](https://github.com/yanzq95/LIAR).

See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction

TL;DR

LIAR addresses nighttime occupancy prediction by learning illumination-affined representations. It introduces Selective Low-light Image Enhancement (SLLIE) to globally and selectively enhance poorly lit images, and two illumination-aware modules, 2D-Illumination-guided Sampling (2D-IGS) and 3D-Illumination-driven Projection (3D-IDP), to mitigate underexposure and overexposure in 2D and 3D BEV contexts, respectively. Through Retinex-based enhancement, adaptive sampling, and illumination-weighted BEV refinement, LIAR achieves state-of-the-art results on real and synthetic nighttime datasets, with up to mIoU gains in easy settings and strong gains across severities; ablations confirm the effectiveness of each component. This approach advances robust 3D scene understanding in adverse lighting, with practical impact for safer autonomous driving under night conditions, and provides code and pretrained models for reproducibility.

Abstract

Occupancy prediction aims to estimate the 3D spatial distribution of occupied regions along with their corresponding semantic labels. Existing vision-based methods perform well on daytime benchmarks but struggle in nighttime scenarios due to limited visibility and challenging lighting conditions. To address these challenges, we propose LIAR, a novel framework that learns illumination-affined representations. LIAR first introduces Selective Low-light Image Enhancement (SLLIE), which leverages the illumination priors from daytime scenes to adaptively determine whether a nighttime image is genuinely dark or sufficiently well-lit, enabling more targeted global enhancement. Building on the illumination maps generated by SLLIE, LIAR further incorporates two illumination-aware components: 2D Illumination-guided Sampling (2D-IGS) and 3D Illumination-driven Projection (3D-IDP), to respectively tackle local underexposure and overexposure. Specifically, 2D-IGS modulates feature sampling positions according to illumination maps, assigning larger offsets to darker regions and smaller ones to brighter regions, thereby alleviating feature degradation in underexposed areas. Subsequently,3D-IDP enhances semantic understanding in overexposed regions by constructing illumination intensity fields and supplying refined residual queries to the BEV context refinement process. Extensive experiments on both real and synthetic datasets demonstrate the superior performance of LIAR under challenging nighttime scenarios. The source code and pretrained models are available [here](https://github.com/yanzq95/LIAR).

Paper Structure

This paper contains 20 sections, 16 equations, 12 figures, 7 tables.

Figures (12)

  • Figure 1: (a) Nighttime images often suffer from both underexposure and overexposure, resulting in feature degradation and semantic deficiency. (b) Our approach learns illumination-aware representations to enhance both the fundamental 2D and 3D stages.
  • Figure 2: Pipeline of LIAR. The input $\boldsymbol{X}$ is initially processed by SLLIE to generate an illumination map $\boldsymbol{I}$ and an enhanced image $\tilde{\boldsymbol{X}}$. Using $\boldsymbol{I}$, the 2D-IGS module modulates feature sampling to enhance degraded features in underexposed regions, while the 3D-IDP module constructs 3D illumination intensity fields to compensate for deficient semantics in overexposed regions. Finally, a prediction head is deployed to output the occupancy.
  • Figure 3: Distributions of the illumination factor in night and day scenes.
  • Figure 3: Ablation study of LIAR on the Occ3D-nuScenes.
  • Figure 4: (a) Visualization of sampling points in low-light regions. (b) Statistical analysis of the offset distribution.
  • ...and 7 more figures