Evaluation of Safety Cognition Capability in Vision-Language Models for Autonomous Driving
Enming Zhang, Peizhe Gong, Xingyuan Dai, Min Huang, Yisheng Lv, Qinghai Miao
TL;DR
This work tackles the safety-cognition alignment problem for vision-language models in autonomous driving by introducing SCD-Bench, a four-dimension safety evaluation framework, and ADA for scalable annotation. It further provides SCD-Training, a large-scale dataset to train safety-aware models, and an automated GPT-4o-based evaluation pipeline achieving high agreement with human judgments. Empirical results show existing open models underperform on safety cognition (SR < 60%), while SCD-Model-1B trained with SCD-Training attains around 90% safety on SCD-Bench, and general-task improvements accompany domain-specific gains. The framework offers a practical path toward robust, safety-aligned VLMs for interactive autonomous-driving scenarios with strong cross-domain benefits.
Abstract
Ensuring the safety of vision-language models (VLMs) in autonomous driving systems is of paramount importance, yet existing research has largely focused on conventional benchmarks rather than safety-critical evaluation. In this work, we present SCD-Bench (Safety Cognition Driving Benchmark) a novel framework specifically designed to assess the safety cognition capabilities of VLMs within interactive driving scenarios. To address the scalability challenge of data annotation, we introduce ADA (Autonomous Driving Annotation), a semi-automated labeling system, further refined through expert review by professionals with domain-specific knowledge in autonomous driving. To facilitate scalable and consistent evaluation, we also propose an automated assessment pipeline leveraging large language models, which demonstrates over 98% agreement with human expert judgments. In addressing the broader challenge of aligning VLMs with safety cognition in driving environments, we construct SCD-Training, the first large-scale dataset tailored for this task, comprising 324.35K high-quality samples. Through extensive experiments, we show that models trained on SCD-Training exhibit marked improvements not only on SCD-Bench, but also on general and domain-specific benchmarks, offering a new perspective on enhancing safety-aware interactions in vision-language systems for autonomous driving.
