Table of Contents
Fetching ...

LLM-VPRF: Large Language Model Based Vector Pseudo Relevance Feedback

Hang Li, Shengyao Zhuang, Bevan Koopman, Guido Zuccon

TL;DR

This work demonstrates that Vector Pseudo Relevance Feedback (VPRF) can extend to Large Language Model (LLM)–based dense retrievers, enabling iterative refinement of query embeddings via feedback passages. It formalizes two VPRF variants, Average and Rocchio, for LLM embeddings and evaluates across BEIR and TREC Deep Learning benchmarks using three state-of-the-art LLM backbones (PromptReps, RepLLaMa, LLM2Vec). The results show consistent performance gains with modest computational overhead, though the magnitude and stability of improvements depend on the specific LLM architecture and dataset characteristics. The findings bridge VPRF with modern LLM-based retrieval, offering practical guidance for deploying feedback-enhanced dense retrieval in real-world systems while highlighting areas for improved feedback selection and broader model coverage.

Abstract

Vector Pseudo Relevance Feedback (VPRF) has shown promising results in improving BERT-based dense retrieval systems through iterative refinement of query representations. This paper investigates the generalizability of VPRF to Large Language Model (LLM) based dense retrievers. We introduce LLM-VPRF and evaluate its effectiveness across multiple benchmark datasets, analyzing how different LLMs impact the feedback mechanism. Our results demonstrate that VPRF's benefits successfully extend to LLM architectures, establishing it as a robust technique for enhancing dense retrieval performance regardless of the underlying models. This work bridges the gap between VPRF with traditional BERT-based dense retrievers and modern LLMs, while providing insights into their future directions.

LLM-VPRF: Large Language Model Based Vector Pseudo Relevance Feedback

TL;DR

This work demonstrates that Vector Pseudo Relevance Feedback (VPRF) can extend to Large Language Model (LLM)–based dense retrievers, enabling iterative refinement of query embeddings via feedback passages. It formalizes two VPRF variants, Average and Rocchio, for LLM embeddings and evaluates across BEIR and TREC Deep Learning benchmarks using three state-of-the-art LLM backbones (PromptReps, RepLLaMa, LLM2Vec). The results show consistent performance gains with modest computational overhead, though the magnitude and stability of improvements depend on the specific LLM architecture and dataset characteristics. The findings bridge VPRF with modern LLM-based retrieval, offering practical guidance for deploying feedback-enhanced dense retrieval in real-world systems while highlighting areas for improved feedback selection and broader model coverage.

Abstract

Vector Pseudo Relevance Feedback (VPRF) has shown promising results in improving BERT-based dense retrieval systems through iterative refinement of query representations. This paper investigates the generalizability of VPRF to Large Language Model (LLM) based dense retrievers. We introduce LLM-VPRF and evaluate its effectiveness across multiple benchmark datasets, analyzing how different LLMs impact the feedback mechanism. Our results demonstrate that VPRF's benefits successfully extend to LLM architectures, establishing it as a robust technique for enhancing dense retrieval performance regardless of the underlying models. This work bridges the gap between VPRF with traditional BERT-based dense retrievers and modern LLMs, while providing insights into their future directions.

Paper Structure

This paper contains 15 sections, 2 equations, 3 tables.