Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search

Zijian Song; Xiaoxin Lin; Tao Pu; Zhenlong Yuan; Guangrun Wang; Liang Lin

Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search

Zijian Song, Xiaoxin Lin, Tao Pu, Zhenlong Yuan, Guangrun Wang, Liang Lin

TL;DR

This work formalizes Human-Centric Open-Future Task Discovery (HOTD) to identify tasks that reduce human effort across uncertain futures, introducing HOTD-Bench for real-world video-based evaluation and CMAST as a scalable, multi-agent search-tree framework. HOTD-Bench combines a simulation-based protocol and open-vocabulary labels to assess potential tasks beyond observed trajectories. CMAST decomposes complex reasoning across specialized agents and a structured search tree, enabling robust task discovery and integration with diverse LMMs, achieving superior Valid Task Ratio and competitive Valid Task Count. The combined approach advances anticipatory, human-aligned assistance in dynamic, open-ended environments and provides a scalable evaluation platform for future embodied AI systems.

Abstract

Recent progress in robotics and embodied AI is largely driven by Large Multimodal Models (LMMs). However, a key challenge remains underexplored: how can we advance LMMs to discover tasks that assist humans in open-future scenarios, where human intentions are highly concurrent and dynamic. In this work, we formalize the problem of Human-centric Open-future Task Discovery (HOTD), focusing particularly on identifying tasks that reduce human effort across plausible futures. To facilitate this study, we propose HOTD-Bench, which features over 2K real-world videos, a semi-automated annotation pipeline, and a simulation-based protocol tailored for open-set future evaluation. Additionally, we propose the Collaborative Multi-Agent Search Tree (CMAST) framework, which decomposes complex reasoning through a multi-agent system and structures the reasoning process through a scalable search tree module. In our experiments, CMAST achieves the best performance on the HOTD-Bench, significantly surpassing existing LMMs. It also integrates well with existing LMMs, consistently improving performance.

Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search

TL;DR

Abstract

Human-Centric Open-Future Task Discovery: Formulation, Benchmark, and Scalable Tree-Based Search

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)