Efficient Camera Exposure Control for Visual Odometry via Deep Reinforcement Learning
Shuyang Zhang, Jinhao He, Yilong Zhu, Jin Wu, Jie Yuan
TL;DR
This paper tackles the problem of visual odometry degradation under rapid illumination changes by introducing a deep reinforcement learning framework for camera exposure control trained entirely offline. It decouples exposure selection from parameter allocation via a two-module pipeline and employs a lightweight image simulator with bracketing-based photometric synthesis and motion augmentation to achieve data-efficient training. Three reward designs—statistical, feature-based, and pose-based—yield different VO-relevant intelligences, with the feature-based reward offering robust performance in challenging sequences and fast inference on CPU. The approach demonstrates improved VO stability and faster reaction times compared with traditional methods, highlighting practical potential for robust, exposure-aware VO in real-world robotics without online hardware interaction.
Abstract
The stability of visual odometry (VO) systems is undermined by degraded image quality, especially in environments with significant illumination changes. This study employs a deep reinforcement learning (DRL) framework to train agents for exposure control, aiming to enhance imaging performance in challenging conditions. A lightweight image simulator is developed to facilitate the training process, enabling the diversification of image exposure and sequence trajectory. This setup enables completely offline training, eliminating the need for direct interaction with camera hardware and the real environments. Different levels of reward functions are crafted to enhance the VO systems, equipping the DRL agents with varying intelligence. Extensive experiments have shown that our exposure control agents achieve superior efficiency-with an average inference duration of 1.58 ms per frame on a CPU-and respond more quickly than traditional feedback control schemes. By choosing an appropriate reward function, agents acquire an intelligent understanding of motion trends and anticipate future illumination changes. This predictive capability allows VO systems to deliver more stable and precise odometry results. The codes and datasets are available at https://github.com/ShuyangUni/drl_exposure_ctrl.
