BAFLineDP: Code Bilinear Attention Fusion Framework for Line-Level Defect Prediction
Shaojian Qiu, Huihao Huang, Jianxiang Luo, Yingjie Kuang, Haoyu Luo
TL;DR
This work tackles the challenge of fine-grained line-level defect prediction by addressing the lack of contextual semantics and local line interactions in prior methods. It introduces BAFLineDP, a code bilinear attention fusion framework that combines CodeBERT-based line embeddings, Bi-GRU line-level context, and a bilinear attention fusion network to capture global and local information for defect localization. Empirical evaluation on 9 open-source projects across 32 releases shows that BAFLineDP outperforms both file-level and line-level baselines in WPDP and CPDP settings, improving AUC, BA, and MCC for file-level predictions and Recall@Top20%LOC alongside reduced Effort@Top20%Recall for line-level predictions. The results demonstrate practical benefits for predicting and prioritizing defective lines, enabling more cost-effective software quality assurance, with code and data publicly available on GitHub.
Abstract
Software defect prediction aims to identify defect-prone code, aiding developers in optimizing testing resource allocation. Most defect prediction approaches primarily focus on coarse-grained, file-level defect prediction, which fails to provide developers with the precision required to locate defective code. Recently, some researchers have proposed fine-grained, line-level defect prediction methods. However, most of these approaches lack an in-depth consideration of the contextual semantics of code lines and neglect the local interaction information among code lines. To address the above issues, this paper presents a line-level defect prediction method grounded in a code bilinear attention fusion framework (BAFLineDP). This method discerns defective code files and lines by integrating source code line semantics, line-level context, and local interaction information between code lines and line-level context. Through an extensive analysis involving within- and cross-project defect prediction across 9 distinct projects encompassing 32 releases, our results demonstrate that BAFLineDP outperforms current advanced file-level and line-level defect prediction approaches.
