Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

Yikun Li; Ting Zhang; Jieke Shi; Chengran Yang; Junda He; Xin Zhou; Jinfeng Jiang; Huihui Huang; Wen Bin Leow; Yide Yin; Eng Lieh Ouh; Lwin Khin Shar; David Lo

Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

Yikun Li, Ting Zhang, Jieke Shi, Chengran Yang, Junda He, Xin Zhou, Jinfeng Jiang, Huihui Huang, Wen Bin Leow, Yide Yin, Eng Lieh Ouh, Lwin Khin Shar, David Lo

TL;DR

This work tackles the gap in vulnerability detection by moving beyond isolated function analysis to inter-procedural reasoning. It introduces CPRVul, a two-phase framework that profiles and ranks inter-procedural context using a code property graph, then trains LLMs to reason over the function, curated context, and vulnerability metadata. Empirical results across PrimeVul, TitanVul, and CleanVul show CPRVul achieving state-of-the-art accuracy, with notable gains on several CWEs and consistent precision improvements, illustrating the value of structured reasoning over curated context. The authors also release context-enriched benchmarks and provide thorough ablations, demonstrating that the synergy between context profiling and reasoning drives robust improvements in inter-procedural vulnerability detection and offering a path for practical deployment in real-world codebases.

Abstract

Recent progress in ML and LLMs has improved vulnerability detection, and recent datasets have reduced label noise and unrelated code changes. However, most existing approaches still operate at the function level, where models are asked to predict whether a single function is vulnerable without inter-procedural context. In practice, vulnerability presence and root cause often depend on contextual information. Naively appending such context is not a reliable solution: real-world context is long, redundant, and noisy, and we find that unstructured context frequently degrades the performance of strong fine-tuned code models. We present CPRVul, a context-aware vulnerability detection framework that couples Context Profiling and Selection with Structured Reasoning. CPRVul constructs a code property graph, and extracts candidate context. It then uses an LLM to generate security-focused profiles and assign relevance scores, selecting only high-impact contextual elements that fit within the model's context window. In the second phase, CPRVul integrates the target function, the selected context, and auxiliary vulnerability metadata to generate reasoning traces, which are used to fine-tune LLMs for reasoning-based vulnerability detection. We evaluate CPRVul on three high-quality vulnerability datasets: PrimeVul, TitanVul, and CleanVul. Across all datasets, CPRVul consistently outperforms function-only baselines, achieving accuracies ranging from 64.94% to 73.76%, compared to 56.65% to 63.68% for UniXcoder. Specifically, on the challenging PrimeVul benchmark, CPRVul achieves 67.78% accuracy, outperforming prior state-of-the-art approaches, improving accuracy from 55.17% to 67.78% (22.9% improvement). Our ablations further show that neither raw context nor processed context alone benefits strong code models; gains emerge only when processed context is paired with structured reasoning.

Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

TL;DR

Abstract

Paper Structure (54 sections, 1 figure, 8 tables)

This paper contains 54 sections, 1 figure, 8 tables.

Introduction
Our Solution
Evaluation
Main Contributions
Paper Structure
Background and Motivation
Limitations of Function-Level Vulnerability Detection
Example I: Condition Pushdown Bug in MariaDB (CVE-2021-46666)
Example II: Invalid Lock Release in USBIP Host Driver (CVE-2018-5814)
Empirical Evidence on the Use of Inter-Procedural Context
Experimental Setup
Results
Implications
Implications for Reasoning-Based Detection
CPRVul: Approach
...and 39 more sections

Figures (1)

Figure 1: Overview of our context-aware vulnerability detection framework CPRVul. The approach operates in two phases. In Phase I, we prepare and prioritize inter-procedural context through four stages: (1) extracting callers, callees, and global variables related to a target function via CPG-based static analysis; (2) constructing security risk profiles for each contextual element; (3) ranking elements by vulnerability relevance; and (4) integrating the highest-ranked context that fits within the LLM's context window. In Phase II, the target function body, selected contextual elements (callers, callees, and globals), and auxiliary vulnerability metadata (e.g., commit messages, CVE and CWE information) are jointly provided to the LLM to generate structured reasoning traces. These reasoning traces are then used for supervised fine-tuning, training the LLM to perform vulnerability detection as a reasoning task.

Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

TL;DR

Abstract

Beyond Function-Level Analysis: Context-Aware Reasoning for Inter-Procedural Vulnerability Detection

Authors

TL;DR

Abstract

Table of Contents

Figures (1)