Applied Federated Learning: Improving Google Keyboard Query Suggestions
Timothy Yang, Galen Andrew, Hubert Eichner, Haicheng Sun, Wei Li, Nicholas Kong, Daniel Ramage, Françoise Beaufays
TL;DR
The paper demonstrates an end-to-end application of federated learning to train a triggering model on-device that filters Gboard query suggestions, improving click-through while preserving user privacy. It details a two-stage architecture with a server-trained baseline and an FL-trained triggering model, and it characterizes on-device training dynamics, including diurnal effects and deployment-skew. Live results show CTR improvements consistent with training predictions, underscoring FL's viability for production, privacy-preserving keyboard features. The work provides a practical blueprint for deploying FL in large-scale mobile systems and discusses debugging techniques without access to raw training data.
Abstract
Federated learning is a distributed form of machine learning where both the training data and model training are decentralized. In this paper, we use federated learning in a commercial, global-scale setting to train, evaluate and deploy a model to improve virtual keyboard search suggestion quality without direct access to the underlying user data. We describe our observations in federated training, compare metrics to live deployments, and present resulting quality increases. In whole, we demonstrate how federated learning can be applied end-to-end to both improve user experiences and enhance user privacy.
