SALAD: Smart AI Language Assistant Daily
Ragib Amin Nihal, Tran Dong Huu Quoc, Lin Zirui, Xu Yimimg, Liu Haoran, An Zhaoyi, Kyou Ma
TL;DR
The paper addresses foreigners' difficulties in learning Japanese and the inadequacy of conventional translators for language acquisition. It proposes SALAD, an AI-driven platform that integrates Kanji-Kana-Romaji translations, speech recognition, grammar explanations, vocabulary tracking, and lyrics-based song generation, leveraging tools like Whisper, gTTS, ChatGPT, and DiffSinger. The system architecture combines Translation, Vocabulary, Lyrics, and Song modules with dual UI implementations (Gradio web and PySide6 desktop) and a centralized progress database. Survey results indicate substantial perceived usefulness and potential to improve conversational fluency, while limitations include language pair scope and API dependency, suggesting directions for future extension.
Abstract
SALAD is an AI-driven language-learning application designed to help foreigners learn Japanese. It offers translations in Kanji-Kana-Romaji, speech recognition, translated audio, vocabulary tracking, grammar explanations, and songs generated from newly learned words. The app targets beginners and intermediate learners, aiming to make language acquisition more accessible and enjoyable. SALAD uses daily translations to enhance fluency and comfort in communication with native speakers. The primary objectives include effective Japanese language learning, user engagement, and progress tracking. A survey by us found that 39% of foreigners in Japan face discomfort in conversations with Japanese speakers. Over 60% of foreigners expressed confidence in SALAD's ability to enhance their Japanese language skills. The app uses large language models, speech recognition, and diffusion models to bridge the language gap and foster a more inclusive community in Japan.
