Acquiring Grounded Representations of Words with Situated Interactive Instruction
Shiwali Mohan, Aaron H. Mininger, James R. Kirk, John E. Laird
TL;DR
The paper tackles grounding word meanings across adjectives, nouns, prepositions, and verbs by enabling mixed-initiative, situated instruction with a human mentor. It implements the approach in the Soar cognitive architecture and demonstrates online, incremental learning on a table-top robot arm, integrating perception-derived features with semantic, episodic, and procedural knowledge. The method grounds words to perceptual symbols, spatial relations, and action concepts, enabling the agent to learn quickly from few examples and to initiate learning when needed. The study shows flexible human-agent collaboration, efficient acquisition across word classes, and potential for scalable grounding in real-world robotic manipulation tasks.
Abstract
We present an approach for acquiring grounded representations of words from mixed-initiative, situated interactions with a human instructor. The work focuses on the acquisition of diverse types of knowledge including perceptual, semantic, and procedural knowledge along with learning grounded meanings. Interactive learning allows the agent to control its learning by requesting instructions about unknown concepts, making learning efficient. Our approach has been instantiated in Soar and has been evaluated on a table-top robotic arm capable of manipulating small objects.
