Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

Xiaofeng Wang; Zhixin Zhang; Jinguang Zheng; Yiming Ai; Rui Wang

Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

Xiaofeng Wang, Zhixin Zhang, Jinguang Zheng, Yiming Ai, Rui Wang

TL;DR

This work addresses automating debt collection negotiations (DCN) with large language models (LLMs) by constructing a synthetic 975-record DCN dataset and a 13-metric evaluation framework to assess both dialogue quality and financial outcomes. It reveals that vanilla LLMs tend to concede excessively and struggle with decision rationality, prompting the design of the Multi-Agent Debt Negotiation (MADeN) framework, which adds Planning and Judging modules to improve strategy and evaluation. The authors also explore post-training approaches, including Direct Preference Optimization with rejection sampling, demonstrating that MADeN and DPO-MAG can substantially improve debt recovery, collection efficiency, and debtor health relative to baseline LLMs. Together, these contributions advance AI-assisted DCN and provide a benchmark for future research on autonomous negotiation in finance.

Abstract

Debt collection negotiations (DCN) are vital for managing non-performing loans (NPLs) and reducing creditor losses. Traditional methods are labor-intensive, while large language models (LLMs) offer promising automation potential. However, prior systems lacked dynamic negotiation and real-time decision-making capabilities. This paper explores LLMs in automating DCN and proposes a novel evaluation framework with 13 metrics across 4 aspects. Our experiments reveal that LLMs tend to over-concede compared to human negotiators. To address this, we propose the Multi-Agent Debt Negotiation (MADeN) framework, incorporating planning and judging modules to improve decision rationality. We also apply post-training techniques, including DPO with rejection sampling, to optimize performance. Our studies provide valuable insights for practitioners and researchers seeking to enhance efficiency and outcomes in this domain.

Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

TL;DR

Abstract

Debt Collection Negotiations with Large Language Models: An Evaluation System and Optimizing Decision Making with Multi-Agent

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)