StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization

Yiming Tang; Yi Fan; Chenxiao Yu; Tiankai Yang; Yue Zhao; Xiyang Hu

StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization

Yiming Tang, Yi Fan, Chenxiao Yu, Tiankai Yang, Yue Zhao, Xiyang Hu

TL;DR

StealthRank investigates stealthy adversarial manipulations of LLM-based ranking by injecting optimized prompts into item descriptions. It introduces an energy-based SRP objective, optimized with Langevin dynamics in logit space to balance ranking gains, linguistic fluency, and avoidance of obvious promotional cues. Across four instruction-tuned LLM rerankers and two datasets, SRP achieves stronger target promotion while maintaining natural language and minimal detectable signals, with ablations and human studies corroborating its effectiveness. The work highlights security vulnerabilities in LLM-driven retrieval systems and motivates defenses to strengthen robustness in practical product search and document retrieval pipelines.

Abstract

The integration of large language models (LLMs) into information retrieval systems introduces new attack surfaces, particularly for adversarial ranking manipulations. We present $\textbf{StealthRank}$, a novel adversarial attack method that manipulates LLM-driven ranking systems while maintaining textual fluency and stealth. Unlike existing methods that often introduce detectable anomalies, StealthRank employs an energy-based optimization framework combined with Langevin dynamics to generate StealthRank Prompts (SRPs)-adversarial text sequences embedded within item or document descriptions that subtly yet effectively influence LLM ranking mechanisms. We evaluate StealthRank across multiple LLMs, demonstrating its ability to covertly boost the ranking of target items while avoiding explicit manipulation traces. Our results show that StealthRank consistently outperforms state-of-the-art adversarial ranking baselines in both effectiveness and stealth, highlighting critical vulnerabilities in LLM-driven ranking systems. Our code is publicly available at $\href{https://github.com/Tangyiming205069/controllable-seo}{here}$.

StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization

TL;DR

Abstract

The integration of large language models (LLMs) into information retrieval systems introduces new attack surfaces, particularly for adversarial ranking manipulations. We present

, a novel adversarial attack method that manipulates LLM-driven ranking systems while maintaining textual fluency and stealth. Unlike existing methods that often introduce detectable anomalies, StealthRank employs an energy-based optimization framework combined with Langevin dynamics to generate StealthRank Prompts (SRPs)-adversarial text sequences embedded within item or document descriptions that subtly yet effectively influence LLM ranking mechanisms. We evaluate StealthRank across multiple LLMs, demonstrating its ability to covertly boost the ranking of target items while avoiding explicit manipulation traces. Our results show that StealthRank consistently outperforms state-of-the-art adversarial ranking baselines in both effectiveness and stealth, highlighting critical vulnerabilities in LLM-driven ranking systems. Our code is publicly available at

StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization

TL;DR

Abstract

StealthRank: LLM Ranking Manipulation via Stealthy Prompt Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)