AutoRev: Multi-Modal Graph Retrieval for Automated Peer-Review Generation
Maitreya Prafulla Chitale, Ketaki Mangesh Shetye, Harshit Gupta, Manav Chaudhary, Manish Shrivastava, Vasudeva Varma
TL;DR
AutoRev presents a graph-based multi-modal Retrieval-Augmented Generation system for automated peer-review generation. By representing a research paper as a hierarchical graph and using a Graph Attention Network to select salient passages, it reduces input length for large language models and enhances review generation. The approach is trained and evaluated on ICLR 2024 data, demonstrating up to 58.72% improvements over baselines on standard metrics, with supportive human evaluation and LLM-as-a-Judge validation. AutoRev aims to streamline the peer-review workflow, enabling scalable, high-quality feedback to researchers while acknowledging ethical and practical considerations for deployment.
Abstract
Enhancing the quality and efficiency of academic publishing is critical for both authors and reviewers, as research papers are central to scholarly communication and a major source of high-quality content on the web. To support this goal, we propose AutoRev, an automatic peer-review system designed to provide actionable, high-quality feedback to both reviewers and authors. AutoRev leverages a novel Multi-Modal Retrieval-Augmented Generation (RAG) framework that combines textual and graphical representations of academic papers. By modelling documents as graphs, AutoRev effectively retrieves the most pertinent information, significantly reducing the input context length for LLMs and thereby enhancing their review generation capabilities. Experimental results show that AutoRev outperforms state-of-the-art baselines by up to 58.72% and demonstrates competitive performance in human evaluations against ground truth reviews. We envision AutoRev as a powerful tool to streamline the peer-review workflow, alleviating challenges and enabling scalable, high-quality scholarly publishing. By guiding both authors and reviewers, AutoRev has the potential to accelerate the dissemination of quality research on the web at a larger scale. Code will be released upon acceptance.
