Large Language Model-based Human-Agent Collaboration for Complex Task Solving
Xueyang Feng, Zhi-Yuan Chen, Yujia Qin, Yankai Lin, Xu Chen, Zhiyuan Liu, Ji-Rong Wen
TL;DR
This work addresses the gap in fully autonomous LLM-based agents handling complex, dynamic tasks by introducing ReHAC, a reinforcement learning–driven framework that optimally times human interventions. By formulating human-agent collaboration as an MDP and training a dual-policy system with offline RL, ReHAC balances task performance with intervention costs. Empirical results across HotpotQA, StrategyQA, and InterCode show that ReHAC outperforms baselines and generalizes across datasets, with scalable policy models and GPT-4 simulations supporting broader applicability. The study also discusses extensions to multi-level collaboration, development-stage frameworks for LLM agents, and safety/alignment considerations for real-world deployment.
Abstract
In recent developments within the research community, the integration of Large Language Models (LLMs) in creating fully autonomous agents has garnered significant interest. Despite this, LLM-based agents frequently demonstrate notable shortcomings in adjusting to dynamic environments and fully grasping human needs. In this work, we introduce the problem of LLM-based human-agent collaboration for complex task-solving, exploring their synergistic potential. In addition, we propose a Reinforcement Learning-based Human-Agent Collaboration method, ReHAC. This approach includes a policy model designed to determine the most opportune stages for human intervention within the task-solving process. We construct a human-agent collaboration dataset to train this policy model in an offline reinforcement learning environment. Our validation tests confirm the model's effectiveness. The results demonstrate that the synergistic efforts of humans and LLM-based agents significantly improve performance in complex tasks, primarily through well-planned, limited human intervention. Datasets and code are available at: https://github.com/XueyangFeng/ReHAC.
