From Fragments to Facts: A Curriculum-Driven DPO Approach for Generating Hindi News Veracity Explanations

Pulkit Bansal; Raghvendra Kumar; Shakti Singh; Sriparna Saha; Adam Jatowt

From Fragments to Facts: A Curriculum-Driven DPO Approach for Generating Hindi News Veracity Explanations

Pulkit Bansal, Raghvendra Kumar, Shakti Singh, Sriparna Saha, Adam Jatowt

TL;DR

This work tackles the challenge of generating reliable Hindi news explanations by introducing DeFactoX, a two-stage framework that couples veracity prediction with explanation generation. It advances a curriculum-based training paradigm and a novel Hin-DPO objective that leverages Actuality (factual correctness) and Finesse (output stability) to align model explanations with human reasoning. A synthetic, ranking-based Hindi preference dataset grounds the alignment in human-like explanations, while experiments across multiple LLMs and PLMs demonstrate improved semantic quality and veracity alignment over strong baselines. The approach offers a scalable path to automated, trustworthy explanations for Hindi misinformation, with potential extension to other low-resource languages through multilingual transfer and human-in-the-loop feedback.

Abstract

In an era of rampant misinformation, generating reliable news explanations is vital, especially for under-represented languages like Hindi. Lacking robust automated tools, Hindi faces challenges in scaling misinformation detection. To bridge this gap, we propose a novel framework integrating Direct Preference Optimization (DPO) with curriculum learning to align machine-generated explanations with human reasoning. Fact-checked explanations from credible sources serve as preferred responses, while LLM outputs highlight system limitations and serve as non-preferred responses. To refine task-specific alignment, we introduce two key parameters -- Actuality and Finesse -- into the DPO loss function, enhancing explanation quality and consistency. Experiments with LLMs (Mistral, Llama, Gemma) and PLMs (mBART, mT5) confirm the framework's effectiveness in generating coherent, contextually relevant explanations. This scalable approach combats misinformation and extends automated explanation generation to low-resource languages.

From Fragments to Facts: A Curriculum-Driven DPO Approach for Generating Hindi News Veracity Explanations

TL;DR

Abstract

From Fragments to Facts: A Curriculum-Driven DPO Approach for Generating Hindi News Veracity Explanations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)