Dingtalk DeepResearch: A Unified Multi Agent Framework for Adaptive Intelligence in Enterprise Environments
Mengyuan Chen, Chengjun Dai, Xinyang Dong, Chengzhe Feng, Kewei Fu, Jianshe Li, Zhihan Peng, Yongqi Tong, Junshao Zhang, Hong Zhu
TL;DR
The paper tackles the challenge of enterprise-scale information synthesis that requires deep research, cross-document reasoning, and multimodal report generation. It proposes a unified, three-layer, multi-agent framework (Agent Studio, Core, Data Layer) with DingAutoEvaluator to enable evaluation-driven continuous optimization and secure workflow integration in real-world settings. A central contribution is the large-scale documentary reinforcement learning pipeline (Doc-RM, SFT for structured formats, static and live RL, online DPO) plus a structure-aware tabular reasoning engine (NL2SQL with robust retrieval) guided by an automated online evaluation flywheel. The framework demonstrates practical impact through production deployment, end-to-end showcases, and forthcoming service availability, enabling adaptive, multi-modal enterprise intelligence within the Dingtalk ecosystem.
Abstract
We present Dingtalk DeepResearch, a unified multi agent intelligence framework for real world enterprise environments, delivering deep research, heterogeneous table reasoning, and multimodal report generation.
