Dexterous Robotic Piano Playing at Scale
Le Chen, Yi Zhao, Jan Schneider, Quankai Gao, Simon Guist, Cheng Qian, Juho Kannala, Bernhard Schölkopf, Joni Pajarinen, Dieter Büchler
TL;DR
Dexterous robotic piano playing is hard due to high-dimensional, contact-rich dynamics. The paper introduces OmniPianist, a Flow Matching Transformer trained on RP1M++ with an Optimal Transport–based fingering mechanism that removes the need for human demonstrations. By combining large-scale specialist RL data and a flow-based imitation learner, OmniPianist achieves multi-piece proficiency and cross-embodiment generalization, reaching competitive performance on hundreds of pieces and strong zero-shot generalization. This work advances scalable, generalist dexterous manipulation and lays groundwork for real-robot deployment and richer musical expression.
Abstract
Endowing robot hands with human-level dexterity has been a long-standing goal in robotics. Bimanual robotic piano playing represents a particularly challenging task: it is high-dimensional, contact-rich, and requires fast, precise control. We present OmniPianist, the first agent capable of performing nearly one thousand music pieces via scalable, human-demonstration-free learning. Our approach is built on three core components. First, we introduce an automatic fingering strategy based on Optimal Transport (OT), allowing the agent to autonomously discover efficient piano-playing strategies from scratch without demonstrations. Second, we conduct large-scale Reinforcement Learning (RL) by training more than 2,000 agents, each specialized in distinct music pieces, and aggregate their experience into a dataset named RP1M++, consisting of over one million trajectories for robotic piano playing. Finally, we employ a Flow Matching Transformer to leverage RP1M++ through large-scale imitation learning, resulting in the OmniPianist agent capable of performing a wide range of musical pieces. Extensive experiments and ablation studies highlight the effectiveness and scalability of our approach, advancing dexterous robotic piano playing at scale.
