SPLADE-v3: New baselines for SPLADE

Carlos Lassance; Hervé Déjean; Thibault Formal; Stéphane Clinchant

SPLADE-v3: New baselines for SPLADE

Carlos Lassance, Hervé Déjean, Thibault Formal, Stéphane Clinchant

TL;DR

SPLADE-v3 introduces training-structure enhancements to SPLADE that yield strong first-stage retrieval baselines. It leverages multiple negatives per batch, ensemble distillation scores, and dual distillation losses (KL-Div and MarginMSE), with targeted fine-tuning starting from SPLADE++SelfDistil. Evaluations via a meta-analysis across 44 query sets show significant gains over BM25 and SPLADE++, and competitive performance with cross-encoder re-rankers on MS MARCO and BEIR. The work also explores efficiency-focused variants (DistilBERT, Lexical, Doc) to balance retrieval effectiveness and deployment cost, offering robust, scalable baselines for information retrieval systems.

Abstract

A companion to the release of the latest version of the SPLADE library. We describe changes to the training structure and present our latest series of models -- SPLADE-v3. We compare this new version to BM25, SPLADE++, as well as re-rankers, and showcase its effectiveness via a meta-analysis over more than 40 query sets. SPLADE-v3 further pushes the limit of SPLADE models: it is statistically significantly more effective than both BM25 and SPLADE++, while comparing well to cross-encoder re-rankers. Specifically, it gets more than 40 MRR@10 on the MS MARCO dev set, and improves by 2% the out-of-domain results on the BEIR benchmark.

SPLADE-v3: New baselines for SPLADE

TL;DR

Abstract

Paper Structure (14 sections, 4 figures, 2 tables)

This paper contains 14 sections, 4 figures, 2 tables.

Introduction
Better Training
Multiple Negatives Per Batch
Better Distillation Scores
Two Distillation Losses
Further Fine-Tuning SPLADE
A New Baseline, SPLADE-v3
SPLADE-v3
Evaluation
Comparison to BM25
Comparison to SPLADE++SelfDistil
Comparison to re-rankers
SPLADE-v3-DistilBERT, SPLADE-v3-Lexical and SPLADE-v3-Doc
Conclusion

Figures (4)

Figure 1: Meta-analysis comparison of SPLADE-v3 and BM25.
Figure 2: Meta-analysis comparison of SPLADE-v3 and SPLADE++SelfDistil.
Figure 3: Meta-analysis comparison of SPLADE-v3 and MiniLM (re-ranking the top-50 returned by SPLADE-v3).
Figure 4: Meta-analysis comparison of SPLADE-v3 and DeBERTaV3 (re-ranking the top-50 returned by SPLADE-v3).

SPLADE-v3: New baselines for SPLADE

TL;DR

Abstract

SPLADE-v3: New baselines for SPLADE

Authors

TL;DR

Abstract

Table of Contents

Figures (4)