Intelligent Framework for Human-Robot Collaboration: Dynamic Ergonomics and Adaptive Decision-Making
Francesco Iodice, Elena De Momi, Arash Ajoudani
TL;DR
The paper tackles safety and ergonomics in industrial human-robot collaboration by proposing a unified framework that fuses advanced visual perception, non-invasive real-time ergonomic monitoring, and adaptive Behaviour Tree–based decision-making. It combines YOLO11 for object detection, UKF for tracking, OpenPose for 3D pose, SlowOnly for action recognition, and OWAS for continuous ergonomic assessment, all feeding a BT-driven allocator of robot and human roles. Experimental results demonstrate strong perception performance (mAP up to 72.4% at mAP@50:95), high intention recognition accuracy (92.5%), and rapider interventions (0.07 s BT reaction) with a 56% improvement over benchmarks in critical metrics. The framework’s non-invasive, modular design enables real-time adaptation to dynamic industrial tasks, promising enhanced operator safety and productivity; future work includes real-world deployment, multi-operator/multi-robot scenarios, and cross-domain extension.
Abstract
The integration of collaborative robots into industrial environments has improved productivity, but has also highlighted significant challenges related to operator safety and ergonomics. This paper proposes an innovative framework that integrates advanced visual perception, continuous ergonomic monitoring, and adaptive Behaviour Tree decision-making to overcome the limitations of traditional methods that typically operate as isolated components. Our approach synthesizes deep learning models, advanced tracking algorithms, and dynamic ergonomic assessments into a modular, scalable, and adaptive system. Experimental validation demonstrates the framework's superiority over existing solutions across multiple dimensions: the visual perception module outperformed previous detection models with 72.4% mAP@50:95; the system achieved high accuracy in recognizing operator intentions (92.5%); it promptly classified ergonomic risks with minimal latency (0.57 seconds); and it dynamically managed robotic interventions with exceptionally responsive decision-making capabilities (0.07 seconds), representing a 56% improvement over benchmark systems. This comprehensive solution provides a robust platform for enhancing human-robot collaboration in industrial environments by prioritizing ergonomic safety, operational efficiency, and real-time adaptability.
