Fostering Human Learning in Sequential Decision-Making: Understanding the Role of Evaluative Feedback
Piyush Gupta, Subir Biswas, Vaibhav Srivastava
TL;DR
This work addresses how AI-generated evaluative feedback affects human learning in sequential decision-making. By combining Tower of Hanoi experiments, maximum entropy inverse reinforcement learning, and multiple behavioral models, the study shows that evaluative feedback improves skill acquisition and transfer, while intermediate sub-goal guidance alone is insufficient. IRL reveals that feedback reorganizes the implicit reward landscape to emphasize target and critical states, and model comparison indicates that humans tend to update action-values in response to feedback, especially under sparse reward conditions. These findings inform the design of AI tutoring and IoT feedback mechanisms to enhance complex decision-making and learning efficiency.
Abstract
Cognitive rehabilitation, STEM (science, technology, engineering, and math) skill acquisition, and coaching games such as chess often require tutoring decision-making strategies. The advancement of AI-driven tutoring systems for facilitating human learning requires an understanding of the impact of evaluative feedback on human decision-making and skill development. To this end, we conduct human experiments using Amazon Mechanical Turk to study the influence of evaluative feedback on human decision-making in sequential tasks. In these experiments, participants solve the Tower of Hanoi puzzle and receive AI-generated feedback while solving it. We examine how this feedback affects their learning and skill transfer to related tasks. Additionally, treating humans as noisy optimal agents, we employ maximum entropy inverse reinforcement learning to analyze the effect of feedback on the implicit human reward structure that guides their decision making. Lastly, we explore various computational models to understand how people incorporate evaluative feedback into their decision-making processes. Our findings underscore that humans perceive evaluative feedback as indicative of their long-term strategic success, thus aiding in skill acquisition and transfer in sequential decision-making tasks. Moreover, we demonstrate that evaluative feedback fosters a more structured and organized learning experience compared to learning without feedback. Furthermore, our results indicate that providing intermediate goals alone does not significantly enhance human learning outcomes.
