Table of Contents
Fetching ...

Factuality and Transparency Are All RAG Needs! Self-Explaining Contrastive Evidence Re-ranking

Francielle Vargas, Daniel Pedronette

TL;DR

The paper addresses the unreliability of retrieval-augmented generation in safety-critical domains by introducing Self-Explaining Contrastive Evidence Re-ranking (CER). CER uses a two-stage approach: contrastive retrieval with an evidence-sensitive embedding space trained via triplet loss, and self-explaining re-ranking that provides token-level attributions to expose evidential reasoning, feeding an LLM to generate grounded responses. On clinical trial data, CER yields improved retrieval accuracy and clearer differentiation between evidential and non-evidential content, reducing potential hallucinations and increasing transparency. This work advances reliable, evidence-based RAG in healthcare and offers a framework for interpretable, evidence-grounded AI pipelines.

Abstract

This extended abstract introduces Self-Explaining Contrastive Evidence Re-Ranking (CER), a novel method that restructures retrieval around factual evidence by fine-tuning embeddings with contrastive learning and generating token-level attribution rationales for each retrieved passage. Hard negatives are automatically selected using a subjectivity-based criterion, forcing the model to pull factual rationales closer while pushing subjective or misleading explanations apart. As a result, the method creates an embedding space explicitly aligned with evidential reasoning. We evaluated our method on clinical trial reports, and initial experimental results show that CER improves retrieval accuracy, mitigates the potential for hallucinations in RAG systems, and provides transparent, evidence-based retrieval that enhances reliability, especially in safety-critical domains.

Factuality and Transparency Are All RAG Needs! Self-Explaining Contrastive Evidence Re-ranking

TL;DR

The paper addresses the unreliability of retrieval-augmented generation in safety-critical domains by introducing Self-Explaining Contrastive Evidence Re-ranking (CER). CER uses a two-stage approach: contrastive retrieval with an evidence-sensitive embedding space trained via triplet loss, and self-explaining re-ranking that provides token-level attributions to expose evidential reasoning, feeding an LLM to generate grounded responses. On clinical trial data, CER yields improved retrieval accuracy and clearer differentiation between evidential and non-evidential content, reducing potential hallucinations and increasing transparency. This work advances reliable, evidence-based RAG in healthcare and offers a framework for interpretable, evidence-grounded AI pipelines.

Abstract

This extended abstract introduces Self-Explaining Contrastive Evidence Re-Ranking (CER), a novel method that restructures retrieval around factual evidence by fine-tuning embeddings with contrastive learning and generating token-level attribution rationales for each retrieved passage. Hard negatives are automatically selected using a subjectivity-based criterion, forcing the model to pull factual rationales closer while pushing subjective or misleading explanations apart. As a result, the method creates an embedding space explicitly aligned with evidential reasoning. We evaluated our method on clinical trial reports, and initial experimental results show that CER improves retrieval accuracy, mitigates the potential for hallucinations in RAG systems, and provides transparent, evidence-based retrieval that enhances reliability, especially in safety-critical domains.

Paper Structure

This paper contains 4 sections, 3 figures.

Figures (3)

  • Figure 1: The pipeline for self-Explaining Re-Ranking with Contrastive Evidence Selection for Retrieval-Augmented Generation.
  • Figure 2: Self-learning and fine-tuned models evaluated using both Euclidean and cosine distance metrics.
  • Figure 3: Results for the Contriever self-learning model (left) and the Contriever fine-tuned model (right). For the self-learning model, the average distance among positive pairs is Intra-Pos = 0.5716 and among negative pairs is Intra-Neg = 0.5977; the average distance between positives and negatives is Inter = 0.5901. For the fine-tuned model, the average distance among positive pairs is Intra-Pos = 0.7766 and among negative pairs is Intra-Neg = 0.8141; the average distance between positives and negatives is Inter = 0.8110.