ListT5: Listwise Reranking with Fusion-in-Decoder Improves Zero-shot Retrieval
Soyoung Yoon, Eunbi Choi, Jiyeon Kim, Hyeongu Yun, Yireun Kim, Seung-won Hwang
TL;DR
ListT5 introduces Fusion-in-Decoder based listwise reranking to jointly evaluate multiple passages and produce a sorted permutation, addressing zero-shot retrieval challenges. It extends a basic $m \to r$ unit to full $n \to k$ reranking using an $m$-ary tournament tree, achieving $O(n + k \log_m n)$ complexity and improving efficiency over prior listwise and pairwise methods. Empirically, ListT5 outperforms RankT5 on BEIR by around $+1.3$ NDCG@10 on BM25 Top-100 and demonstrates robustness to positional bias, while maintaining competitive FLOPs with pointwise baselines. The work provides extensive ablations and stability analyses, showing that generating passages in increasing relevance and using tournament-based inference contribute to zero-shot performance gains and practical usability; code and models are open-sourced for reproducibility and further research.
Abstract
We propose ListT5, a novel reranking approach based on Fusion-in-Decoder (FiD) that handles multiple candidate passages at both train and inference time. We also introduce an efficient inference framework for listwise ranking based on m-ary tournament sort with output caching. We evaluate and compare our model on the BEIR benchmark for zero-shot retrieval task, demonstrating that ListT5 (1) outperforms the state-of-the-art RankT5 baseline with a notable +1.3 gain in the average NDCG@10 score, (2) has an efficiency comparable to pointwise ranking models and surpasses the efficiency of previous listwise ranking models, and (3) overcomes the lost-in-the-middle problem of previous listwise rerankers. Our code, model checkpoints, and the evaluation framework are fully open-sourced at \url{https://github.com/soyoung97/ListT5}.
