Beyond Reproducibility: Advancing Zero-shot LLM Reranking Efficiency with Setwise Insertion

Jakub Podolak; Leon Peric; Mina Janicijevic; Roxana Petcu

Beyond Reproducibility: Advancing Zero-shot LLM Reranking Efficiency with Setwise Insertion

Jakub Podolak, Leon Peric, Mina Janicijevic, Roxana Petcu

TL;DR

The paper addresses the efficiency and effectiveness of zero-shot document reranking with Large Language Models by reproducing Zhuang et al.'s Setwise method and introducing Setwise Insertion. It demonstrates that leveraging a prior initial ranking within Setwise prompts reduces redundant comparisons and stabilizes decisions, achieving a 31% faster query time and a 23% reduction in LLM inferences while slightly improving NDCG@10. Experiments across multiple architectures (Flan-T5, Vicuna, Llama) and datasets (TREC 2019/2020) confirm Setwise's superior trade-offs against Pointwise, Pairwise, and Listwise baselines, and show strong robustness in both encoder-decoder and decoder-only models. The work highlights practical gains for efficient, accurate zero-shot reranking and suggests future exploration on broader datasets and model families.

Abstract

This study presents a comprehensive reproducibility and extension analysis of the Setwise prompting methodology for zero-shot ranking with Large Language Models (LLMs), as proposed by Zhuang et al. We evaluate its effectiveness and efficiency compared to traditional Pointwise, Pairwise, and Listwise approaches in document ranking tasks. Our reproduction confirms the findings of Zhuang et al., highlighting the trade-offs between computational efficiency and ranking effectiveness in Setwise methods. Building on these insights, we introduce Setwise Insertion, a novel approach that leverages the initial document ranking as prior knowledge, reducing unnecessary comparisons and uncertainty by focusing on candidates more likely to improve the ranking results. Experimental results across multiple LLM architectures (Flan-T5, Vicuna, and Llama) show that Setwise Insertion yields a 31% reduction in query time, a 23% reduction in model inferences, and a slight improvement in reranking effectiveness compared to the original Setwise method. These findings highlight the practical advantage of incorporating prior ranking knowledge into Setwise prompting for efficient and accurate zero-shot document reranking.

Beyond Reproducibility: Advancing Zero-shot LLM Reranking Efficiency with Setwise Insertion

TL;DR

Abstract

Beyond Reproducibility: Advancing Zero-shot LLM Reranking Efficiency with Setwise Insertion

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)