Mind Your Neighbours: Leveraging Analogous Instances for Rhetorical Role Labeling for Legal Documents
T. Y. S. S Santosh, Hassan Sarwat, Ahmed Abdou, Matthias Grabmair
TL;DR
The paper tackles Rhetorical Role Labeling (RRL) for legal judgments, a task challenged by long-range context, interdependent roles, data scarcity, and label imbalance. It introduces neighbor-based strategies at both inference and training time: interpolation with kNN, single and multiple prototypes, and neighborhood-driven losses including contrastive and prototypical objectives, including a discourse-aware variant. Across four Indian-law datasets, the methods yield macro-F1 gains, with kNN interpolation providing strong gains, prototypical learning offering robust cross-domain transfer, and discursive contrastive losses enhancing embedding structure. The work demonstrates that leveraging neighborhood information improves both in-domain performance and cross-domain generalization, contributing scalable, domain-adaptable approaches for legal document processing.
Abstract
Rhetorical Role Labeling (RRL) of legal judgments is essential for various tasks, such as case summarization, semantic search and argument mining. However, it presents challenges such as inferring sentence roles from context, interrelated roles, limited annotated data, and label imbalance. This study introduces novel techniques to enhance RRL performance by leveraging knowledge from semantically similar instances (neighbours). We explore inference-based and training-based approaches, achieving remarkable improvements in challenging macro-F1 scores. For inference-based methods, we explore interpolation techniques that bolster label predictions without re-training. While in training-based methods, we integrate prototypical learning with our novel discourse-aware contrastive method that work directly on embedding spaces. Additionally, we assess the cross-domain applicability of our methods, demonstrating their effectiveness in transferring knowledge across diverse legal domains.
