Compositional Phoneme Approximation for L1-Grounded L2 Pronunciation Training
Jisang Park, Minu Kim, DaYoung Hong, Jongha Lee
TL;DR
The paper tackles the problem that L2 learners map unfamiliar phonemes to their L1 categories, hindering efficient pronunciation training. It introduces compositional phoneme approximation (CPA), a feature-space, L1-grounded method that composes L2 phonemes from sequences of L1 phoneme features, with vowels built from two L1 vowels and consonants from base L1 segments plus context-driven changes. In a 10-minute training with 20 Korean English learners across 18 target items, CPA achieves a $76.0\%$ in-box formant rate, a $53.4\%$ phoneme-recognition accuracy, and about $80\%$ word-level nativeness in comparative judgments, indicating robust, rapid gains. These results suggest that leveraging L1 articulatory knowledge through CPA can accelerate early-stage L2 pronunciation learning, though extending to suprasegmentals and cross-script adaptation remains for future work.
Abstract
Learners of a second language (L2) often map non-native phonemes to similar native-language (L1) phonemes, making conventional L2-focused training slow and effortful. To address this, we propose an L1-grounded pronunciation training method based on compositional phoneme approximation (CPA), a feature-based representation technique that approximates L2 sounds with sequences of L1 phonemes. Evaluations with 20 Korean non-native English speakers show that CPA-based training achieves a 76% in-box formant rate in acoustic analysis, 17.6% relative improvement in phoneme recognition accuracy, and over 80% of speech being rated as more native-like, with minimal training. Project page: https://gsanpark.github.io/CPA-Pronunciation.
