Table of Contents
Fetching ...

YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection

Olalekan Akindele, Joshua Atolagbe

TL;DR

A new attention-based foundation architecture, YOLO-ELA, is proposed, which achieves a state-of-the-art performance on high-resolution UAV images and demonstrates the effectiveness of attention-based convolutional neural networks (CNN) in object detection tasks.

Abstract

Existing detection methods for insulator defect identification from unmanned aerial vehicles (UAV) struggle with complex background scenes and small objects, leading to suboptimal accuracy and a high number of false positives detection. Using the concept of local attention modeling, this paper proposes a new attention-based foundation architecture, YOLO-ELA, to address this issue. The Efficient Local Attention (ELA) blocks were added into the neck part of the one-stage YOLOv8 architecture to shift the model's attention from background features towards features of insulators with defects. The SCYLLA Intersection-Over-Union (SIoU) criterion function was used to reduce detection loss, accelerate model convergence, and increase the model's sensitivity towards small insulator defects, yielding higher true positive outcomes. Due to a limited dataset, data augmentation techniques were utilized to increase the diversity of the dataset. In addition, we leveraged the transfer learning strategy to improve the model's performance. Experimental results on high-resolution UAV images show that our method achieved a state-of-the-art performance of 96.9% mAP0.5 and a real-time detection speed of 74.63 frames per second, outperforming the baseline model. This further demonstrates the effectiveness of attention-based convolutional neural networks (CNN) in object detection tasks.

YOLO-ELA: Efficient Local Attention Modeling for High-Performance Real-Time Insulator Defect Detection

TL;DR

A new attention-based foundation architecture, YOLO-ELA, is proposed, which achieves a state-of-the-art performance on high-resolution UAV images and demonstrates the effectiveness of attention-based convolutional neural networks (CNN) in object detection tasks.

Abstract

Existing detection methods for insulator defect identification from unmanned aerial vehicles (UAV) struggle with complex background scenes and small objects, leading to suboptimal accuracy and a high number of false positives detection. Using the concept of local attention modeling, this paper proposes a new attention-based foundation architecture, YOLO-ELA, to address this issue. The Efficient Local Attention (ELA) blocks were added into the neck part of the one-stage YOLOv8 architecture to shift the model's attention from background features towards features of insulators with defects. The SCYLLA Intersection-Over-Union (SIoU) criterion function was used to reduce detection loss, accelerate model convergence, and increase the model's sensitivity towards small insulator defects, yielding higher true positive outcomes. Due to a limited dataset, data augmentation techniques were utilized to increase the diversity of the dataset. In addition, we leveraged the transfer learning strategy to improve the model's performance. Experimental results on high-resolution UAV images show that our method achieved a state-of-the-art performance of 96.9% mAP0.5 and a real-time detection speed of 74.63 frames per second, outperforming the baseline model. This further demonstrates the effectiveness of attention-based convolutional neural networks (CNN) in object detection tasks.

Paper Structure

This paper contains 13 sections, 14 equations, 6 figures, 3 tables.

Figures (6)

  • Figure 1: The Proposed YOLO-ELA Architecture
  • Figure 2: The Schematic Diagram of SIoU Loss
  • Figure 3: Images showing outcome of data augmentation
  • Figure 4: The Comparison between CIoU and SIoU Loss
  • Figure 5: Heatmap of YOLOv8 and YOLOv8+ELA.
  • ...and 1 more figures