Evaluating Human Trust in LLM-Based Planners: A Preliminary Study
Shenghui Chen, Yunhao Yang, Kayla Boggess, Seongkook Heo, Lu Feng, Ufuk Topcu
TL;DR
This study investigates how humans trust LLM-based planners relative to classical PDDL solvers in a gripper-domain planning task. By pairing subjective trust judgments with objective evaluation accuracy in a within-subject design (n=30) across four planner conditions, the authors show that correctness predominantly drives both accuracy and trust. Explanations provided with LLM plans can boost evaluation accuracy, but exert limited influence on trust, while plan refinement may increase trust without enhancing correctness, signaling potential overtrust under RLHF-driven outputs. The findings suggest design priorities for human-centered AI planning: prioritize correctness and calibrate trust carefully when deploying LLM-based planners, and explore how explanations and refinement mechanisms interact with user trust in real-world settings.
Abstract
Large Language Models (LLMs) are increasingly used for planning tasks, offering unique capabilities not found in classical planners such as generating explanations and iterative refinement. However, trust--a critical factor in the adoption of planning systems--remains underexplored in the context of LLM-based planning tasks. This study bridges this gap by comparing human trust in LLM-based planners with classical planners through a user study in a Planning Domain Definition Language (PDDL) domain. Combining subjective measures, such as trust questionnaires, with objective metrics like evaluation accuracy, our findings reveal that correctness is the primary driver of trust and performance. Explanations provided by the LLM improved evaluation accuracy but had limited impact on trust, while plan refinement showed potential for increasing trust without significantly enhancing evaluation accuracy.
