SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

Qilong Wu; Xiaoneng Xiang; Hejia Huang; Xuan Wang; Yeo Wei Jie; Ranjan Satapathy; Ricardo Shirota Filho; Bharadwaj Veeravalli

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

Qilong Wu, Xiaoneng Xiang, Hejia Huang, Xuan Wang, Yeo Wei Jie, Ranjan Satapathy, Ricardo Shirota Filho, Bharadwaj Veeravalli

TL;DR

SusGen-30K fills a critical gap by providing a balanced, multi-task dataset and a tuned LLM family (SusGen-GPT) for finance and ESG tasks, including ESG-report generation. The authors also introduce TCFD-Bench to evaluate ESG reporting quality and demonstrate that SusGen-GPT achieves near GPT-4 performance with only 7–8B parameters, aided by Retrieval-Augmented Generation (RAG) for extracting ESG content from annual reports. A data-centric fine-tuning pipeline with LoRA/QLoRA, multilingual augmentation, anonymization, and instruction-based formatting enables strong cross-task performance while maintaining efficiency. The work highlights practical, scalable approaches for advancing financial NLP and ESG reporting, and suggests future work to broaden ESG subtasks, diversify report templates, and enhance evaluation. Overall, SusGen-GPT offers a deployable, efficient tool for generating TCFD-compliant ESG reports and supporting ESG-related inquiries in finance.

Abstract

The rapid growth of the financial sector and the rising focus on Environmental, Social, and Governance (ESG) considerations highlight the need for advanced NLP tools. However, open-source LLMs proficient in both finance and ESG domains remain scarce. To address this gap, we introduce SusGen-30K, a category-balanced dataset comprising seven financial NLP tasks and ESG report generation, and propose TCFD-Bench, a benchmark for evaluating sustainability report generation. Leveraging this dataset, we developed SusGen-GPT, a suite of models achieving state-of-the-art performance across six adapted and two off-the-shelf tasks, trailing GPT-4 by only 2% despite using 7-8B parameters compared to GPT-4's 1,700B. Based on this, we propose the SusGen system, integrated with Retrieval-Augmented Generation (RAG), to assist in sustainability report generation. This work demonstrates the efficiency of our approach, advancing research in finance and ESG.

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

TL;DR

Abstract

SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)