Japanese Tort-case Dataset for Rationale-supported Legal Judgment Prediction
Hiroaki Yamada, Takenobu Tokunaga, Ryutaro Ohara, Akira Tokutsu, Keisuke Takeshita, Mihoko Sumida
TL;DR
The paper introduces the Japanese Tort-case Dataset (JTD), the first large-scale, real-judgment dataset for Japanese Legal Judgment Prediction, and defines two tasks: Tort Prediction and Rationale Extraction. It provides a detailed, multi-stage annotation pipeline with 41 legal experts and character-level rationale spans, supported by inter-annotator reliability analyses. A hierarchical Inter-Span Transformer (IST) architecture plus multi-task learning establishes strong baselines, showing that joint modeling of outcomes and rationales improves performance, albeit with substantial room for improvement relative to expert judgments. Error analysis highlights missing external knowledge and data limitations inherent to publicly available judgment documents, informing future dataset enhancements and modeling strategies. The work offers a valuable resource for Japanese legal NLP and sets a foundation for cross-jurisdictional, explainable LJP research in civil law contexts.
Abstract
This paper presents the first dataset for Japanese Legal Judgment Prediction (LJP), the Japanese Tort-case Dataset (JTD), which features two tasks: tort prediction and its rationale extraction. The rationale extraction task identifies the court's accepting arguments from alleged arguments by plaintiffs and defendants, which is a novel task in the field. JTD is constructed based on annotated 3,477 Japanese Civil Code judgments by 41 legal experts, resulting in 7,978 instances with 59,697 of their alleged arguments from the involved parties. Our baseline experiments show the feasibility of the proposed two tasks, and our error analysis by legal experts identifies sources of errors and suggests future directions of the LJP research.
