S-RL Toolbox: Environments, Datasets and Evaluation Metrics for State Representation Learning
Antonin Raffin, Ashley Hill, René Traoré, Timothée Lesort, Natalia Díaz-Rodríguez, David Filliat
TL;DR
The paper tackles the lack of standardized evaluation for state representation learning in robotics by introducing the S-RL Toolbox, a suite of environments, data generators, tasks, metrics and visualization tools designed for fast, reproducible SRL benchmarking in RL. It surveys SRL approaches (auto-encoders, robotic priors, forward/inverse models, and their combinations) and demonstrates how these representations can be evaluated across mobile navigation and robotic-arm tasks using both qualitative visualizations and quantitative metrics like KNN-MSE, correlation and GTC. The framework integrates multiple RL algorithms (notably PPO) to quantify how representation quality affects learning performance, and provides detailed implementation guidance and datasets to facilitate replication. Overall, the toolbox enables rapid iteration, interpretability, and fair comparisons of SRL methods in robotics control, with demonstrated speed and scalability for large-scale evaluation.
Abstract
State representation learning aims at learning compact representations from raw observations in robotics and control applications. Approaches used for this objective are auto-encoders, learning forward models, inverse dynamics or learning using generic priors on the state characteristics. However, the diversity in applications and methods makes the field lack standard evaluation datasets, metrics and tasks. This paper provides a set of environments, data generators, robotic control tasks, metrics and tools to facilitate iterative state representation learning and evaluation in reinforcement learning settings.
