RankLLM: A Python Package for Reranking with LLMs

Sahel Sharifymoghaddam; Ronak Pradeep; Andre Slavescu; Ryan Nguyen; Andrew Xu; Zijian Chen; Yilin Zhang; Yidi Chen; Jasper Xian; Jimmy Lin

RankLLM: A Python Package for Reranking with LLMs

Sahel Sharifymoghaddam, Ronak Pradeep, Andre Slavescu, Ryan Nguyen, Andrew Xu, Zijian Chen, Yilin Zhang, Yidi Chen, Jasper Xian, Jimmy Lin

TL;DR

Rank-LLM tackles the fragmentation in LLM-based reranking by delivering a modular, open-source Python package that supports pointwise, pairwise, and listwise reranking with a broad set of LLMs. It integrates retrieval (via Pyserini), evaluation, analysis, and training to provide end-to-end, reproducible workflows, including 2CR reproducibility pages. The framework handles large candidate lists through a sliding window approach and enforces robust post-processing of model outputs, with configurable prompt templates and diverse coordinators (Mono-T5, Duo-T5, LiT5, SafeOpenai, SafeGenai, vLLM-based OSLLM, etc.). This work enables rapid experimentation and benchmarking in retrieval-augmented pipelines, promotes transparency and replicability, and supports end-to-end deployment through integration with LangChain, LlamaIndex, and popular inference backends.

Abstract

The adoption of large language models (LLMs) as rerankers in multi-stage retrieval systems has gained significant traction in academia and industry. These models refine a candidate list of retrieved documents, often through carefully designed prompts, and are typically used in applications built on retrieval-augmented generation (RAG). This paper introduces RankLLM, an open-source Python package for reranking that is modular, highly configurable, and supports both proprietary and open-source LLMs in customized reranking workflows. To improve usability, RankLLM features optional integration with Pyserini for retrieval and provides integrated evaluation for multi-stage pipelines. Additionally, RankLLM includes a module for detailed analysis of input prompts and LLM responses, addressing reliability concerns with LLM APIs and non-deterministic behavior in Mixture-of-Experts (MoE) models. This paper presents the architecture of RankLLM, along with a detailed step-by-step guide and sample code. We reproduce results from RankGPT, LRL, RankVicuna, RankZephyr, and other recent models. RankLLM integrates with common inference frameworks and a wide range of LLMs. This compatibility allows for quick reproduction of reported results, helping to speed up both research and real-world applications. The complete repository is available at rankllm.ai, and the package can be installed via PyPI.

RankLLM: A Python Package for Reranking with LLMs

TL;DR

Abstract

RankLLM: A Python Package for Reranking with LLMs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)