Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections
Lingjun Zhao, Khanh Nguyen, Hal Daumé
TL;DR
This work introduces HEAR, a system that detects hallucinations in language-guided navigation instructions and proposes corrections, paired with a user interface that highlights potential errors and reveals correction options on demand. By combining a hallucination detector and a hallucination-type classifier with synthetic data generation, HEAR can rank corrections and guide humans through long-horizon 3D navigation tasks even when instructions are imperfect. In experiments with 80 human participants, HEAR improves success rate and reduces final-location error, and user engagement with exploration prompts increases task persistence and performance. The results demonstrate that structured uncertainty communication can significantly boost human decision-making in sequential, vision-language tasks and offer a generalizable approach for robust AI-assisted navigation.
Abstract
Language models will inevitably err in situations with which they are unfamiliar. However, by effectively communicating uncertainties, they can still guide humans toward making sound decisions in those contexts. We demonstrate this idea by developing HEAR, a system that can successfully guide humans in simulated residential environments despite generating potentially inaccurate instructions. Diverging from systems that provide users with only the instructions they generate, HEAR warns users of potential errors in its instructions and suggests corrections. This rich uncertainty information effectively prevents misguidance and reduces the search space for users. Evaluation with 80 users shows that HEAR achieves a 13% increase in success rate and a 29% reduction in final location error distance compared to only presenting instructions to users. Interestingly, we find that offering users possibilities to explore, HEAR motivates them to make more attempts at the task, ultimately leading to a higher success rate. To our best knowledge, this work is the first to show the practical benefits of uncertainty communication in a long-horizon sequential decision-making problem.
