Federated Learning for Emoji Prediction in a Mobile Keyboard
Swaroop Ramaswamy, Rajiv Mathews, Kanishka Rao, Françoise Beaufays
TL;DR
This work tackles emoji prediction in mobile keyboards under privacy constraints by deploying a CIFG-LSTM model trained with federated learning. It combines language-model pretraining, a dedicated triggering mechanism, and diversification to balance accuracy, latency, and emoji variety, achieving superior top-1 accuracy in federated evaluations and strong live-traffic gains. Deployments use low-latency inference with quantized TensorFlow Lite and demonstrate practical privacy-preserving training for real-world NLU tasks. Overall, the study demonstrates that federated learning can yield production-quality, on-device models despite sparse, imbalanced data and strict latency requirements.
Abstract
We show that a word-level recurrent neural network can predict emoji from text typed on a mobile keyboard. We demonstrate the usefulness of transfer learning for predicting emoji by pretraining the model using a language modeling task. We also propose mechanisms to trigger emoji and tune the diversity of candidates. The model is trained using a distributed on-device learning framework called federated learning. The federated model is shown to achieve better performance than a server-trained model. This work demonstrates the feasibility of using federated learning to train production-quality models for natural language understanding tasks while keeping users' data on their devices.
