DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions
Sanket Kalwar, Mihir Ungarala, Shruti Jain, Aaron Monis, Krishna Reddy Konda, Sourav Garg, K Madhava Krishna
TL;DR
DiffPrompter addresses semantic segmentation in adverse weather by introducing differentiable visual prompts and a differentiable adaptor framework that augments foundation-model backbones. It provides a $\nabla$HFC image processing block and a shallow vision encoder to jointly learn visual prompts and latent embeddings, enabling both parallel (PDA) and sequential (SDA) adaptor architectures. Through extensive experiments on datasets like BDD100K, ACDC, Wild-Dash, Dark-Zurich, COD10K, and CAMO, the approach achieves superior out-of-distribution generalization and improved segmentation performance on both high-level and low-level tasks. The work demonstrates the importance of integrating local and global representations via differentiable prompts and suggests future directions toward visual-language prompting and 3D scene understanding for robust perception in autonomous driving.
Abstract
Semantic segmentation in adverse weather scenarios is a critical task for autonomous driving systems. While foundation models have shown promise, the need for specialized adaptors becomes evident for handling more challenging scenarios. We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in foundation models. Our proposed $\nabla$HFC image processing block excels particularly in adverse weather conditions, where conventional methods often fall short. Furthermore, we investigate the advantages of jointly training visual and latent prompts, demonstrating that this combined approach significantly enhances performance in out-of-distribution scenarios. Our differentiable visual prompts leverage parallel and series architectures to generate prompts, effectively improving object segmentation tasks in adverse conditions. Through a comprehensive series of experiments and evaluations, we provide empirical evidence to support the efficacy of our approach. Project page at https://diffprompter.github.io.
