CoGen: Learning from Feedback with Coupled Comprehension and Generation
Mustafa Omer Gul, Yoav Artzi
TL;DR
This work investigates coupling language comprehension and generation in a continual learning setting where a single model interacts with humans in two-player reference games. By combining joint inference and data sharing, and training with feedback signals via a contextual-bandit REINFORCE objective, the system achieves substantial, time-evolving gains in both comprehension and generation, while producing language that aligns more closely with human discourse. The approach leverages a pragmatic, RSA-inspired inference stance and introduces data-sharing across roles to inject human language into generator training, resulting in improved data efficiency and more diverse, human-like utterances. The findings demonstrate a viable pathway for interactive AI that learns from user interactions, with practical implications for scalable, human-aligned language systems.
Abstract
Systems with both language comprehension and generation capabilities can benefit from the tight connection between the two. This work studies coupling comprehension and generation with focus on continually learning from interaction with users. We propose techniques to tightly integrate the two capabilities for both learning and inference. We situate our studies in two-player reference games, and deploy various models for thousands of interactions with human users, while learning from interaction feedback signals. We show dramatic improvements in performance over time, with comprehension-generation coupling leading to performance improvements up to 26% in absolute terms and up to 17% higher accuracies compared to a non-coupled system. Our analysis also shows coupling has substantial qualitative impact on the system's language, making it significantly more human-like.
