Rank It, Then Ask It: Input Reranking for Maximizing the Performance of LLMs on Symmetric Tasks

Mohsen Dehghankar; Abolfazl Asudeh

Rank It, Then Ask It: Input Reranking for Maximizing the Performance of LLMs on Symmetric Tasks

Mohsen Dehghankar, Abolfazl Asudeh

TL;DR

This work tackles improving LLM performance on symmetric tasks by reordering a bag of input elements before querying the model. It introduces the LLM input reranking problem and a two‑stage solution: (i) exposure discovery to learn rank‑position recall patterns for a given LLM, and (ii) relevance estimation to rank elements by their expected impact on the query answer, using a bipartite graph framework to debias scores. The expected utility is defined as $\mathbb{E}[utility(\pi|q)] = \sum_{i=1}^{|\mathcal{I}|} \mathbb{E}[\mathcal{X}_{\mathcal{L}}(i)] \cdot \mathbb{E}[Rel_q(e_{\pi(i)})]$, guiding the reranking. Experimental results on Graph Degree tasks and real DB queries show reranking can achieve up to 99% proximity to the optimum bound, with notable differences in memory patterns between GPT‑3.5 Turbo and GPT‑4o Mini. The approach remains model‑agnostic and acts as a wrapper to enhance symmetric‑task performance for current and future LLMs.

Abstract

Large language models (LLMs) have quickly emerged as practical and versatile tools that provide new solutions for a wide range of domains. In this paper, we consider the application of LLMs on symmetric tasks where a query is asked on an (unordered) bag of elements. Examples of such tasks include answering aggregate queries on a database table. In general, when the bag contains a large number of elements, LLMs tend to overlook some elements, leading to challenges in generating accurate responses to the query. LLMs receive their inputs as ordered sequences. However, in this problem, we leverage the fact that the symmetric input is not ordered, and reordering should not affect the LLM's response. Observing that LLMs are less likely to miss elements at certain positions of the input, we introduce the problem of LLM input reranking: to find a ranking of the input that maximizes the LLM's accuracy for the given query without making explicit assumptions about the query. Finding the optimal ranking requires identifying (i) the relevance of each input element for answering the query and (ii) the importance of each rank position for the LLM's attention. We develop algorithms for estimating these values efficiently utilizing a helper LLM. We conduct comprehensive experiments on different synthetic and real datasets to validate our proposal and to evaluate the effectiveness of our proposed algorithms. Our experiments confirm that our reranking approach improves the accuracy of the LLMs on symmetric tasks by up to $99\%$ proximity to the optimum upper bound.

Rank It, Then Ask It: Input Reranking for Maximizing the Performance of LLMs on Symmetric Tasks

TL;DR

Abstract

Rank It, Then Ask It: Input Reranking for Maximizing the Performance of LLMs on Symmetric Tasks

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (3)