Table of Contents
Fetching ...

Enhancing Road Safety: Real-Time Detection of Driver Distraction through Convolutional Neural Networks

Amaan Aijaz Sheikh, Imaad Zaffar Khan

TL;DR

The paper tackles real-time driver distraction detection by evaluating a spectrum of CNN architectures, including SimpleCNN, VGG16/19 variants, and a Hybrid CNN-Transformer, using the State Farm Distracted Driver dataset. It systematically compares baseline, depth-varying, and fine-tuned configurations under consistent training conditions, and investigates dual-framework implementations (Keras and PyTorch) to assess robustness and practicality. Key findings show that fine-tuned VGG19 and the Hybrid CNN-Transformer achieve top-tier accuracy (~0.98) but with varying inference times, underscoring a trade-off between accuracy and real-time performance. The work demonstrates actionable insights for deploying driver-monitoring systems, balancing model complexity, training dynamics, and hardware capabilities to enhance road safety.

Abstract

As we navigate our daily commutes, the threat posed by a distracted driver is at a large, resulting in a troubling rise in traffic accidents. Addressing this safety concern, our project harnesses the analytical power of Convolutional Neural Networks (CNNs), with a particular emphasis on the well-established models VGG16 and VGG19. These models are acclaimed for their precision in image recognition and are meticulously tested for their ability to detect nuances in driver behavior under varying environmental conditions. Through a comparative analysis against an array of CNN architectures, this study seeks to identify the most efficient model for real-time detection of driver distractions. The ultimate aim is to incorporate the findings into vehicle safety systems, significantly boosting their capability to prevent accidents triggered by inattention. This research not only enhances our understanding of automotive safety technologies but also marks a pivotal step towards creating vehicles that are intuitively aligned with driver behaviors, ensuring safer roads for all.

Enhancing Road Safety: Real-Time Detection of Driver Distraction through Convolutional Neural Networks

TL;DR

The paper tackles real-time driver distraction detection by evaluating a spectrum of CNN architectures, including SimpleCNN, VGG16/19 variants, and a Hybrid CNN-Transformer, using the State Farm Distracted Driver dataset. It systematically compares baseline, depth-varying, and fine-tuned configurations under consistent training conditions, and investigates dual-framework implementations (Keras and PyTorch) to assess robustness and practicality. Key findings show that fine-tuned VGG19 and the Hybrid CNN-Transformer achieve top-tier accuracy (~0.98) but with varying inference times, underscoring a trade-off between accuracy and real-time performance. The work demonstrates actionable insights for deploying driver-monitoring systems, balancing model complexity, training dynamics, and hardware capabilities to enhance road safety.

Abstract

As we navigate our daily commutes, the threat posed by a distracted driver is at a large, resulting in a troubling rise in traffic accidents. Addressing this safety concern, our project harnesses the analytical power of Convolutional Neural Networks (CNNs), with a particular emphasis on the well-established models VGG16 and VGG19. These models are acclaimed for their precision in image recognition and are meticulously tested for their ability to detect nuances in driver behavior under varying environmental conditions. Through a comparative analysis against an array of CNN architectures, this study seeks to identify the most efficient model for real-time detection of driver distractions. The ultimate aim is to incorporate the findings into vehicle safety systems, significantly boosting their capability to prevent accidents triggered by inattention. This research not only enhances our understanding of automotive safety technologies but also marks a pivotal step towards creating vehicles that are intuitively aligned with driver behaviors, ensuring safer roads for all.
Paper Structure (48 sections, 2 equations, 24 figures, 3 tables)

This paper contains 48 sections, 2 equations, 24 figures, 3 tables.

Figures (24)

  • Figure 1: Pixel Intensity Distribution across RGB channels
  • Figure 2: System Architecture
  • Figure 3: System Architecture
  • Figure 4: Training and Validation Loss for SimpleCNN
  • Figure 5: Training and Validation Accuracy VGG16- Deep Network
  • ...and 19 more figures