Foundation Models in Augmentative and Alternative Communication: Opportunities and Challenges
Ambra Di Paola, Serena Muraro, Roberto Marinelli, Christian Pilato
TL;DR
The paper investigates how foundation models can enhance augmentative and alternative communication (AAC) by enabling scalable, personalized content generation. It introduces AMBRA, an open platform that blends cloud and edge computing with federated learning and generative AI to produce tailored messages and symbols for AAC users, including a content tokenization and symbol-generation pipeline. The authors discuss opportunities such as highly personalized content and open collaboration, alongside challenges like privacy, standardization, and symbol simplicity, proposing a roadmap that emphasizes openness and cross-institutional sharing. The work aims to democratize AAC, reduce educator workload, and foster inclusive communication through AI-enabled, context-aware content creation.
Abstract
Augmentative and Alternative Communication (AAC) are essential techniques that help people with communication disabilities. AAC demonstrates its transformative power by replacing spoken language with symbol sequences. However, to unlock its full potential, AAC materials must adhere to specific characteristics, placing the onus on educators to create custom-tailored materials and symbols. This paper introduces AMBRA (Pervasive and Personalized Augmentative and Alternative Communication based on Federated Learning and Generative AI), an open platform that aims to leverage the capabilities of foundation models to tackle many AAC issues, opening new opportunities (but also challenges) for AI-enhanced AAC. We thus present a compelling vision--a roadmap towards a more inclusive society. By leveraging the capabilities of modern technologies, we aspire to not only transform AAC but also guide the way toward a world where communication knows no bounds.
