Table of Contents
Fetching ...

Leveraging Language Models to Detect Greenwashing

Avalon Vinella, Margaret Capetz, Rebecca Pattichis, Christina Chance, Reshmi Ghosh, Kai-Wei Chang

TL;DR

A novel preliminary methodology to train a language model on generated labels for greenwashing risk, developing a preliminary mathematical formulation to quantify greenwashing risk, and a fine-tuned ClimateBERT model for this problem are introduced.

Abstract

In recent years, climate change repercussions have increasingly captured public interest. Consequently, corporations are emphasizing their environmental efforts in sustainability reports to bolster their public image. Yet, the absence of stringent regulations in review of such reports allows potential greenwashing. In this study, we introduce a novel preliminary methodology to train a language model on generated labels for greenwashing risk. Our primary contributions encompass: developing a preliminary mathematical formulation to quantify greenwashing risk, a fine-tuned ClimateBERT model for this problem, and a comparative analysis of results. On a test set comprising of sustainability reports, our best model achieved an average accuracy score of 86.34% and F1 score of 0.67, demonstrating that our proof-of-concept methodology shows a promising direction of exploration for this task.

Leveraging Language Models to Detect Greenwashing

TL;DR

A novel preliminary methodology to train a language model on generated labels for greenwashing risk, developing a preliminary mathematical formulation to quantify greenwashing risk, and a fine-tuned ClimateBERT model for this problem are introduced.

Abstract

In recent years, climate change repercussions have increasingly captured public interest. Consequently, corporations are emphasizing their environmental efforts in sustainability reports to bolster their public image. Yet, the absence of stringent regulations in review of such reports allows potential greenwashing. In this study, we introduce a novel preliminary methodology to train a language model on generated labels for greenwashing risk. Our primary contributions encompass: developing a preliminary mathematical formulation to quantify greenwashing risk, a fine-tuned ClimateBERT model for this problem, and a comparative analysis of results. On a test set comprising of sustainability reports, our best model achieved an average accuracy score of 86.34% and F1 score of 0.67, demonstrating that our proof-of-concept methodology shows a promising direction of exploration for this task.
Paper Structure (12 sections, 2 equations, 1 figure, 5 tables)

This paper contains 12 sections, 2 equations, 1 figure, 5 tables.

Figures (1)

  • Figure 1: Accuracy and F1 scores of the frozen and non-frozen ClimateBERT models during fine-tuning.