Switch-JustDance: Benchmarking Whole Body Motion Tracking Policies Using a Commercial Console Game
Jeonghwan Kim, Wontaek Kim, Yidan Lu, Jin Cheng, Fatemeh Zargarbashi, Zicheng Zeng, Zekun Qi, Zhiyang Dou, Nitish Sontakke, Donghoon Baek, Sehoon Ha, Tianyu Li
TL;DR
Switch-JustDance introduces a low-cost, hardware-in-the-loop benchmark that uses the Nintendo Switch Just Dance game to evaluate whole-body humanoid controllers and compare them against human performers. The pipeline reconstructs human motion with GVHMR, retargets to a robot via GMR, and evaluates control policies through the game's Joy-Con-based scoring. The authors validate the Just Dance Score as a reliable, discriminative, and repeatable metric and benchmark three controllers (GMT, TWIST, Any2Track) on hardware and in simulation, revealing a persistent human–robot gap and insights into input-motion effects and sim-to-real transfer. The work provides a practical, reproducible framework for evaluating athletic full-body control in real-world conditions and highlights directions for improving robustness, motion reconstruction, and cross-embodiment benchmarking.
Abstract
Recent advances in whole-body robot control have enabled humanoid and legged robots to perform increasingly agile and coordinated motions. However, standardized benchmarks for evaluating these capabilities in real-world settings, and in direct comparison to humans, remain scarce. Existing evaluations often rely on pre-collected human motion datasets or simulation-based experiments, which limit reproducibility, overlook hardware factors, and hinder fair human-robot comparisons. We present Switch-JustDance, a low-cost and reproducible benchmarking pipeline that leverages motion-sensing console games, Just Dance on the Nintendo Switch, to evaluate robot whole-body control. Using Just Dance on the Nintendo Switch as a representative platform, Switch-JustDance converts in-game choreography into robot-executable motions through streaming, motion reconstruction, and motion retargeting modules and enables users to evaluate controller performance through the game's built-in scoring system. We first validate the evaluation properties of Just Dance, analyzing its reliability, validity, sensitivity, and potential sources of bias. Our results show that the platform provides consistent and interpretable performance measures, making it a suitable tool for benchmarking embodied AI. Building on this foundation, we benchmark three state-of-the-art humanoid whole-body controllers on hardware and provide insights into their relative strengths and limitations.
