ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator
Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha
TL;DR
The paper addresses hallucinations in retrieval-augmented QA caused by noisy or fabricated retrieved content. It introduces ATM, a two-agent adversarial framework where an Attacker fabricates and permutes retrieved documents while a Generator learns to produce golden answers despite the noise, using multi-agent iterative tuning (MITO) that combines SFT, KL regularization, and DPO-guided adversarial updates. Empirical results across four knowledge-intensive QA datasets show that ATM achieves consistent gains over state-of-the-art robustness baselines, with convergence within a few iterations. The work demonstrates a practical path to robust RAG-QA systems in noisy information environments and suggests future work on joint retriever-generator optimization.
Abstract
Large language models (LLMs) are proven to benefit a lot from retrieval-augmented generation (RAG) in alleviating hallucinations confronted with knowledge-intensive questions. RAG adopts information retrieval techniques to inject external knowledge from semantic-relevant documents as input contexts. However, since today's Internet is flooded with numerous noisy and fabricating content, it is inevitable that RAG systems are vulnerable to these noises and prone to respond incorrectly. To this end, we propose to optimize the retrieval-augmented Generator with an Adversarial Tuning Multi-agent system (ATM). The ATM steers the Generator to have a robust perspective of useful documents for question answering with the help of an auxiliary Attacker agent through adversarially tuning the agents for several iterations. After rounds of multi-agent iterative tuning, the Generator can eventually better discriminate useful documents amongst fabrications. The experimental results verify the effectiveness of ATM and we also observe that the Generator can achieve better performance compared to the state-of-the-art baselines.
