PromptLNet: Region-Adaptive Aesthetic Enhancement via Prompt Guidance in Low-Light Enhancement Net
Jun Yin, Yangfan He, Miao Zhang, Pengyu Zeng, Tianyi Wang, Shuai Lu, Xueqian Wang
TL;DR
This work tackles the mismatch between objective metrics and aesthetic quality in low-light image enhancement by introducing a prompt-driven, region-adaptive framework guided by human aesthetic preferences via RLHF. It combines a Retinex-based reasoning segment, region-specific brightness controls, adaptive contextual compensation, and diffusion generation controlled by ControlNet, all steered by an aesthetic reward model trained with BLIP-based features. A PFNet-driven pipeline automates the aesthetic alignment process, supported by a curated dataset of 1000 samples with multi-dimensional expert and automated scores across diverse lighting conditions. Experimental results demonstrate improved visual quality and controllability over existing methods across multiple benchmarks, highlighting the potential for practical, user-driven aesthetic refinement in real-world applications. Overall, the approach advances low-light enhancement by aligning光image outputs with human preferences while enabling semantic, region-level control.
Abstract
Learning and improving large language models through human preference feedback has become a mainstream approach, but it has rarely been applied to the field of low-light image enhancement. Existing low-light enhancement evaluations typically rely on objective metrics (such as FID, PSNR, etc.), which often result in models that perform well objectively but lack aesthetic quality. Moreover, most low-light enhancement models are primarily designed for global brightening, lacking detailed refinement. Therefore, the generated images often require additional local adjustments, leading to research gaps in practical applications. To bridge this gap, we propose the following innovations: 1) We collect human aesthetic evaluation text pairs and aesthetic scores from multiple low-light image datasets (e.g., LOL, LOL2, LOM, DCIM, MEF, etc.) to train a low-light image aesthetic evaluation model, supplemented by an optimization algorithm designed to fine-tune the diffusion model. 2) We propose a prompt-driven brightness adjustment module capable of performing fine-grained brightness and aesthetic adjustments for specific instances or regions. 3) We evaluate our method alongside existing state-of-the-art algorithms on mainstream benchmarks. Experimental results show that our method not only outperforms traditional methods in terms of visual quality but also provides greater flexibility and controllability, paving the way for improved aesthetic quality.
