LegalDuet: Learning Fine-grained Representations for Legal Judgment Prediction via a Dual-View Contrastive Learning

Buqiang Xu; Xin Dai; Zhenghao Liu; Huiyuan Xie; Xiaoyuan Yi; Shuo Wang; Yukun Yan; Liner Yang; Yu Gu; Ge Yu

LegalDuet: Learning Fine-grained Representations for Legal Judgment Prediction via a Dual-View Contrastive Learning

Buqiang Xu, Xin Dai, Zhenghao Liu, Huiyuan Xie, Xiaoyuan Yi, Shuo Wang, Yukun Yan, Liner Yang, Yu Gu, Ge Yu

TL;DR

LegalDuet tackles fine-grained Legal Judgment Prediction by replacing reliance on token-level cues with a continuous pretraining regime that learns tailored embeddings for criminal facts. It introduces dual-view contrastive learning with Law Case Clustering ($\mathcal{L}_{LCC}$) and Legal Decision Matching ($\mathcal{L}_{LDM}$), jointly optimized as $\mathcal{L}_{LegalDuet} = \mathcal{L}_{LCC} + \mathcal{L}_{LDM}$ to shape a discriminative embedding space. Evaluated on CAIL2018, LegalDuet consistently outperforms baselines across tasks and backbones, reduces prediction entropy, and yields more compact, well-separated embeddings (lower Davies-Bouldin Index) that better align criminal facts with legal decisions. The approach generalizes to multiple PLMs (e.g., BERT-xs, BERT-Chinese) without requiring bespoke LJP architectures, and the authors release the code for reproducibility.

Abstract

Legal Judgment Prediction (LJP) is a fundamental task of legal artificial intelligence, aiming to automatically predict the judgment outcomes of legal cases. Existing LJP models primarily focus on identifying legal triggers within criminal fact descriptions by contrastively training language models. However, these LJP models overlook the importance of learning to effectively distinguish subtle differences among judgments, which is crucial for producing more accurate predictions. In this paper, we propose LegalDuet, which continuously pretrains language models to learn a more tailored embedding space for representing legal cases. Specifically, LegalDuet designs a dual-view mechanism to continuously pretrain language models: 1) Law Case Clustering retrieves similar cases as hard negatives and employs contrastive training to differentiate among confusing cases; 2) Legal Decision Matching aims to identify legal clues within criminal fact descriptions to align them with the chain of reasoning that contains the correct legal decision. Our experiments on the CAIL2018 dataset demonstrate the effectiveness of LegalDuet. Further analysis reveals that LegalDuet improves the ability of pretrained language models to distinguish confusing criminal charges by reducing prediction uncertainty and enhancing the separability of criminal charges. The experiments demonstrate that LegalDuet produces a more concentrated and distinguishable embedding space, effectively aligning criminal facts with corresponding legal decisions. The code is available at https://github.com/NEUIR/LegalDuet.

LegalDuet: Learning Fine-grained Representations for Legal Judgment Prediction via a Dual-View Contrastive Learning

TL;DR

) and Legal Decision Matching (

), jointly optimized as

to shape a discriminative embedding space. Evaluated on CAIL2018, LegalDuet consistently outperforms baselines across tasks and backbones, reduces prediction entropy, and yields more compact, well-separated embeddings (lower Davies-Bouldin Index) that better align criminal facts with legal decisions. The approach generalizes to multiple PLMs (e.g., BERT-xs, BERT-Chinese) without requiring bespoke LJP architectures, and the authors release the code for reproducibility.

Abstract

Paper Structure (20 sections, 14 equations, 11 figures, 7 tables)

This paper contains 20 sections, 14 equations, 11 figures, 7 tables.

Introduction
Related Work
Methodology
Preliminary of Legal Judgment Prediction
Fine-grained Legal Representation Learning through the Dual-View Contrastive Learning
Experiment Methodology
Evaluation Results
Overall Performance
Ablation Study
Learned Embeddings of Criminal Facts Optimized by LegalDuet
Conclusion
Appendix
Details of Legal Decision Construction
Further Processing of Hard Negatives for Contrastive Learning
Implementation Details of Baselines
...and 5 more sections

Figures (11)

Figure 1: An Example of Dual-View Contrastive Learning Mechanism in LegalDuet. LegalDuet incorporates both Law Case Clustering (LCC) and Legal Decision Matching (LDM) tasks for continuously pretraining language models.
Figure 2: Illustration of Our LegalDuet.
Figure 3: Entropy Distributions of Legal Judgment Predictions.
Figure 4: DBI Reduction Values and Embedding Visualizations of SAILER and LegalDuet. The embeddings of criminal facts, Provoking Troubles, Robbery, Fraud, Intentional Homicide, Theft, Intentional Injury, are annotated.
Figure 5: Legal Decision Template Used in LegalDuet.
...and 6 more figures

LegalDuet: Learning Fine-grained Representations for Legal Judgment Prediction via a Dual-View Contrastive Learning

TL;DR

Abstract

LegalDuet: Learning Fine-grained Representations for Legal Judgment Prediction via a Dual-View Contrastive Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (11)