Judging by the Rules: Compliance-Aligned Framework for Modern Slavery Statement Monitoring
Wenhao Xu, Akshatha Arodi, Jian-Yun Nie, Arsene Fansi Tchango
TL;DR
This paper tackles the challenge of scalable, auditable compliance verification for modern slavery statements by reframing the task as rule-alignment rather than purely factual classification. It introduces a two-stage framework: key-rule extraction to distill regulatory rubrics, and CALLM, a rule-aligned LLM trained with feedback from a domain-specific Compliance Alignment Judge (CA-Judge) using Group Relative Policy Optimization. CALLM produces outputs that are not only accurate but explicitly tethered to statutory rules, improving auditability and human verifiability in high-stakes regulatory contexts, and demonstrating cross-jurisdiction generalization. The approach yields superior task performance, stronger rule-adherence in generated justifications, and favorable human preferences compared with larger models, suggesting practical benefits for real-world compliance review of modern slavery disclosures.
Abstract
Modern slavery affects millions of people worldwide, and regulatory frameworks such as Modern Slavery Acts now require companies to publish detailed disclosures. However, these statements are often vague and inconsistent, making manual review time-consuming and difficult to scale. While NLP offers a promising path forward, high-stakes compliance tasks require more than accurate classification: they demand transparent, rule-aligned outputs that legal experts can verify. Existing applications of large language models (LLMs) often reduce complex regulatory assessments to binary decisions, lacking the necessary structure for robust legal scrutiny. We argue that compliance verification is fundamentally a rule-matching problem: it requires evaluating whether textual statements adhere to well-defined regulatory rules. To this end, we propose a novel framework that harnesses AI for rule-level compliance verification while preserving expert oversight. At its core is the Compliance Alignment Judge (CA-Judge), which evaluates model-generated justifications based on their fidelity to statutory requirements. Using this feedback, we train the Compliance Alignment LLM (CALLM), a model that produces rule-consistent, human-verifiable outputs. CALLM improves predictive performance and generates outputs that are both transparent and legally grounded, offering a more verifiable and actionable solution for real-world compliance analysis.
