Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network

Xin Liu; Rongwu Xu; Xinyi Jia; Jason Liao; Jiao Sun; Ling Huang; Wei Xu

Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network

Xin Liu, Rongwu Xu, Xinyi Jia, Jason Liao, Jiao Sun, Ling Huang, Wei Xu

TL;DR

The paper tackles the rising problem of LLM-generated spam reviews by synthesizing three realistic datasets and introducing FraudSquad, a hybrid detector that fuses LM-derived embeddings with a gated graph transformer on a review graph. FraudSquad achieves state-of-the-art detection without manual feature engineering and with minimal labeled data, outperforming baselines by up to 44.22% in precision and 43.01% in recall on synthetic datasets and showing strong results on human-written spam as well. The approach leverages both semantic content and user-behavior signals, demonstrating robustness across datasets and highlighting the importance of adapting spam detection to LLM-era threats. The work provides practical impact for online platforms by offering an efficient, scalable framework and releasing code and synthetic datasets to advance research in e-commerce security.

Abstract

The rise of large language models (LLMs) has enabled the generation of highly persuasive spam reviews that closely mimic human writing. These reviews pose significant challenges for existing detection systems and threaten the credibility of online platforms. In this work, we first create three realistic LLM-generated spam review datasets using three distinct LLMs, each guided by product metadata and genuine reference reviews. Evaluations by GPT-4.1 confirm the high persuasion and deceptive potential of these reviews. To address this threat, we propose FraudSquad, a hybrid detection model that integrates text embeddings from a pre-trained language model with a gated graph transformer for spam node classification. FraudSquad captures both semantic and behavioral signals without relying on manual feature engineering or massive training resources. Experiments show that FraudSquad outperforms state-of-the-art baselines by up to 44.22% in precision and 43.01% in recall on three LLM-generated datasets, while also achieving promising results on two human-written spam datasets. Furthermore, FraudSquad maintains a modest model size and requires minimal labeled training data, making it a practical solution for real-world applications. Our contributions include new synthetic datasets, a practical detection framework, and empirical evidence highlighting the urgency of adapting spam detection to the LLM era. Our code and datasets are available at: https://anonymous.4open.science/r/FraudSquad-5389/.

Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network

TL;DR

Abstract

Detecting LLM-Generated Spam Reviews by Integrating Language Model Embeddings and Graph Neural Network

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)