Making the Flow Glow -- Robot Perception under Severe Lighting Conditions using Normalizing Flow Gradients
Simon Kristoffersson Lind, Rudolph Triebel, Volker Krüger
TL;DR
This paper tackles robust robotic perception under severe lighting by shifting from global to pixel-level out-of-distribution detection. It leverages the absolute gradients of a normalizing-flow-based likelihood to produce region-specific OOD cues, enabling ROI-guided optimization of camera parameters to improve object detection in challenging scenes. Empirical results show that the NF-gradient approach yields substantial performance gains (e.g., ~60% higher success than prior methods) and that gradient magnitude correlates with detection reliability across detectors like YOLOv4 and Faster-RCNN. The work demonstrates practical gains for adaptive vision in robotics, provides code and a dataset, and discusses runtime considerations and future extensions for broader applicability.
Abstract
Modern robotic perception is highly dependent on neural networks. It is well known that neural network-based perception can be unreliable in real-world deployment, especially in difficult imaging conditions. Out-of-distribution detection is commonly proposed as a solution for ensuring reliability in real-world deployment. Previous work has shown that normalizing flow models can be used for out-of-distribution detection to improve reliability of robotic perception tasks. Specifically, camera parameters can be optimized with respect to the likelihood output from a normalizing flow, which allows a perception system to adapt to difficult vision scenarios. With this work we propose to use the absolute gradient values from a normalizing flow, which allows the perception system to optimize local regions rather than the whole image. By setting up a table top picking experiment with exceptionally difficult lighting conditions, we show that our method achieves a 60% higher success rate for an object detection task compared to previous methods.
