Learning to Control Camera Exposure via Reinforcement Learning
Kyunghyun Lee, Ukcheol Shin, Byeong-Uk Lee
TL;DR
The paper tackles robust camera exposure control under dynamic lighting, a critical bottleneck for vision systems. It introduces DRL-AE, a deep reinforcement learning framework that jointly optimizes exposure time and gain using a lightweight vectorized RoI intensity history, continuous-relative actions, a flicker-aware reward, static-to-dynamic curriculum, and domain randomization. Empirical results show rapid convergence within about five steps and real-time CPU processing (~1 ms inference, ~6 ms total per frame), with superior feature extraction and object detection performance over built-in AE across darkroom, dataset, and real-world driving scenarios. The work demonstrates that DRL can provide fast, generalizable exposure control that enhances downstream perception tasks in diverse lighting, marking a first step toward robust, perception-aware camera control.
Abstract
Adjusting camera exposure in arbitrary lighting conditions is the first step to ensure the functionality of computer vision applications. Poorly adjusted camera exposure often leads to critical failure and performance degradation. Traditional camera exposure control methods require multiple convergence steps and time-consuming processes, making them unsuitable for dynamic lighting conditions. In this paper, we propose a new camera exposure control framework that rapidly controls camera exposure while performing real-time processing by exploiting deep reinforcement learning. The proposed framework consists of four contributions: 1) a simplified training ground to simulate real-world's diverse and dynamic lighting changes, 2) flickering and image attribute-aware reward design, along with lightweight state design for real-time processing, 3) a static-to-dynamic lighting curriculum to gradually improve the agent's exposure-adjusting capability, and 4) domain randomization techniques to alleviate the limitation of the training ground and achieve seamless generalization in the wild.As a result, our proposed method rapidly reaches a desired exposure level within five steps with real-time processing (1 ms). Also, the acquired images are well-exposed and show superiority in various computer vision tasks, such as feature extraction and object detection.
