Trust Me, I Know This Function: Hijacking LLM Static Analysis using Bias

Shir Bernstein; David Beste; Daniel Ayzenshteyn; Lea Schonherr; Yisroel Mirsky

Trust Me, I Know This Function: Hijacking LLM Static Analysis using Bias

Shir Bernstein, David Beste, Daniel Ayzenshteyn, Lea Schonherr, Yisroel Mirsky

TL;DR

This work uncovers a previously overlooked vulnerability in LLM-based static code analysis: abstraction bias toward familiar code patterns can cause models to misinterpret small, deterministic bugs without altering runtime behavior. The authors formalize Familiar Pattern Attacks (FPAs), introduce an automated, black-box attack generator, and demonstrate cross-model and cross-language transferability, including real-world code agents. They also explore defensive uses, show the attacks persist even when models are warned, and discuss the limitations of deduplication and traditional static/dynamic defenses. The findings highlight the need for semantic-aware robustness in code understanding systems and outline dual-use implications for security and defense.

Abstract

Large Language Models (LLMs) are increasingly trusted to perform automated code review and static analysis at scale, supporting tasks such as vulnerability detection, summarization, and refactoring. In this paper, we identify and exploit a critical vulnerability in LLM-based code analysis: an abstraction bias that causes models to overgeneralize familiar programming patterns and overlook small, meaningful bugs. Adversaries can exploit this blind spot to hijack the control flow of the LLM's interpretation with minimal edits and without affecting actual runtime behavior. We refer to this attack as a Familiar Pattern Attack (FPA). We develop a fully automated, black-box algorithm that discovers and injects FPAs into target code. Our evaluation shows that FPAs are not only effective against basic and reasoning models, but are also transferable across model families (OpenAI, Anthropic, Google), and universal across programming languages (Python, C, Rust, Go). Moreover, FPAs remain effective even when models are explicitly warned about the attack via robust system prompts. Finally, we explore positive, defensive uses of FPAs and discuss their broader implications for the reliability and safety of code-oriented LLMs.

Trust Me, I Know This Function: Hijacking LLM Static Analysis using Bias

TL;DR

Abstract

Trust Me, I Know This Function: Hijacking LLM Static Analysis using Bias

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)

Theorems & Definitions (2)