CRMWeaver: Building Powerful Business Agent via Agentic RL and Shared Memories
Yilong Lai, Yipin Yang, Jialong Wu, Fengran Mo, Zhenglin Wang, Ting Liang, Jianguo Lin, Keping Yang
TL;DR
The paper tackles the challenge of building robust business agents capable of operating over complex, multi-relational enterprise data. It proposes CRMWeaver, a framework that combines synthetic data generation, a two-stage training pipeline (SFT followed by RL with DAPO), and a long-term memory module to reuse solution guidelines. Experiments on the CRMArena-Pro benchmark show that a lightweight Qwen-3-4B model achieves competitive B2B and B2C performance, with memory augmentation improving reliability and generalization. The approach reduces reliance on very large models while delivering practical capabilities for real-world business scenarios.
Abstract
Recent years have witnessed the rapid development of LLM-based agents, which shed light on using language agents to solve complex real-world problems. A prominent application lies in business agents, which interact with databases and internal knowledge bases via tool calls to fulfill diverse user requirements. However, this domain is characterized by intricate data relationships and a wide range of heterogeneous tasks, from statistical data queries to knowledge-based question-answering. To address these challenges, we propose CRMWeaver, a novel approach that enhances business agents in such complex settings. To acclimate the agentic model to intricate business environments, we employ a synthesis data generation and RL-based paradigm during training, which significantly improves the model's ability to handle complex data and varied tasks. During inference, a shared memories mechanism is introduced, prompting the agent to learn from task guidelines in similar problems, thereby further boosting its effectiveness and generalization, especially in unseen scenarios. We validate the efficacy of our approach on the CRMArena-Pro dataset, where our lightweight model achieves competitive results in both B2B and B2C business scenarios, underscoring its practical value for real-world applications.
