Table of Contents
Fetching ...

AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements

Adriana Eufrosina Bora, Pierre-Luc St-Charles, Mirko Bronzi, Arsène Fansi Tchango, Bruno Rousseau, Kerrie Mengersen

TL;DR

This paper presents AIMS.au, the largest sentence-level dataset of Australian Modern Slavery Act statements annotated to identify sentences containing mandated information. It details meticulous annotation guidelines, preprocessing, and gold-standard subsets, and demonstrates a machine learning pipeline for detecting Act-relevant sentences, evaluated under zero-shot and supervised settings. Results show fine-tuned models substantially surpass zero-shot large language models, with context and careful annotation driving performance. The dataset's open release and alignment with other legal contexts suggest strong practical impact for automated compliance monitoring and cross-jurisdiction research in supply chain transparency.

Abstract

Despite over a decade of legislative efforts to address modern slavery in the supply chains of large corporations, the effectiveness of government oversight remains hampered by the challenge of scrutinizing thousands of statements annually. While Large Language Models (LLMs) can be considered a well established solution for the automatic analysis and summarization of documents, recognizing concrete modern slavery countermeasures taken by companies and differentiating those from vague claims remains a challenging task. To help evaluate and fine-tune LLMs for the assessment of corporate statements, we introduce a dataset composed of 5,731 modern slavery statements taken from the Australian Modern Slavery Register and annotated at the sentence level. This paper details the construction steps for the dataset that include the careful design of annotation specifications, the selection and preprocessing of statements, and the creation of high-quality annotation subsets for effective model evaluations. To demonstrate our dataset's utility, we propose a machine learning methodology for the detection of sentences relevant to mandatory reporting requirements set by the Australian Modern Slavery Act. We then follow this methodology to benchmark modern language models under zero-shot and supervised learning settings.

AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements

TL;DR

This paper presents AIMS.au, the largest sentence-level dataset of Australian Modern Slavery Act statements annotated to identify sentences containing mandated information. It details meticulous annotation guidelines, preprocessing, and gold-standard subsets, and demonstrates a machine learning pipeline for detecting Act-relevant sentences, evaluated under zero-shot and supervised settings. Results show fine-tuned models substantially surpass zero-shot large language models, with context and careful annotation driving performance. The dataset's open release and alignment with other legal contexts suggest strong practical impact for automated compliance monitoring and cross-jurisdiction research in supply chain transparency.

Abstract

Despite over a decade of legislative efforts to address modern slavery in the supply chains of large corporations, the effectiveness of government oversight remains hampered by the challenge of scrutinizing thousands of statements annually. While Large Language Models (LLMs) can be considered a well established solution for the automatic analysis and summarization of documents, recognizing concrete modern slavery countermeasures taken by companies and differentiating those from vague claims remains a challenging task. To help evaluate and fine-tune LLMs for the assessment of corporate statements, we introduce a dataset composed of 5,731 modern slavery statements taken from the Australian Modern Slavery Register and annotated at the sentence level. This paper details the construction steps for the dataset that include the careful design of annotation specifications, the selection and preprocessing of statements, and the creation of high-quality annotation subsets for effective model evaluations. To demonstrate our dataset's utility, we propose a machine learning methodology for the detection of sentences relevant to mandatory reporting requirements set by the Australian Modern Slavery Act. We then follow this methodology to benchmark modern language models under zero-shot and supervised learning settings.

Paper Structure

This paper contains 26 sections, 11 figures, 6 tables.

Figures (11)

  • Figure 1: Correspondences between the AU MSA Mandatory Criteria and the questions designed for the annotation of the proposed AIMS.au dataset, with fictitious examples of disclosures that could be found in statements published by reporting entities.
  • Figure 1: Agreement scores averaged across all double-annotated statements. We report the intersection over union (IAA) and Cohen’s Kappa (CK). The two scores are relatively comparable except for the most imbalanced criterion (C4, “remediation”) whose CK score is more negatively impacted.
  • Figure 2: Overview of the annotation workflow for the AIMS.au dataset.
  • Figure 3: Overview of the distribution of text across the 5,731 statements in our proposed dataset.
  • Figure 4: Distribution of relevant sentences found by annotators over the total number of sentences per statement for our eleven questions.
  • ...and 6 more figures