Towards a Robust Soft Baby Robot With Rich Interaction Ability for Advanced Machine Learning Algorithms
Mohannad Alhakami, Dylan R. Ashley, Joel Dunham, Yanning Dai, Francesco Faccio, Eric Feron, Jürgen Schmidhuber
TL;DR
Problem: modern reinforcement learning in robotics requires platforms capable of long autonomous exploration with rich sensory input while tolerating sensor and actuation failures. Approach: a starfish-inspired, hybrid soft-hard robotic limb with camera-only sensing, high sensor redundancy, and replaceable failure points is built and validated in hardware; PPO and AC are tested under simulated camera failures to demonstrate robust learning. Contributions: a modular hardware design and fabrication workflow, a tendon-driven soft outer limb, camera-based sensing architecture, and empirical evidence of resilient RL under degraded sensing, complemented by open-source CAD, Gazebo models, and code. Significance: this work advances a practical path toward curiosity-driven learning and generally intelligent robots by decoupling learning from fragile hardware constraints.
Abstract
Advanced machine learning algorithms require platforms that are extremely robust and equipped with rich sensory feedback to handle extensive trial-and-error learning without relying on strong inductive biases. Traditional robotic designs, while well-suited for their specific use cases, are often fragile when used with these algorithms. To address this gap -- and inspired by the vision of enabling curiosity-driven baby robots -- we present a novel robotic limb designed from scratch. Our design has a hybrid soft-hard structure, high redundancy with rich non-contact sensors (exclusively cameras), and easily replaceable failure points. Proof-of-concept experiments using two contemporary reinforcement learning algorithms on a physical prototype demonstrate that our design is able to succeed in a simple target-finding task even under simulated sensor failures, all with minimal human oversight during extended learning periods. We believe this design represents a concrete step toward more tailored robotic designs for achieving general-purpose, generally intelligent robots.
