Multi-face emotion detection for effective Human-Robot Interaction
Mohamed Ala Yahyaoui, Mouaad Oujabour, Leila Ben Letaifa, Amine Bohi
TL;DR
The paper tackles real-time multi-face emotion detection for human-robot interaction on resource-constrained mobile humanoid robots. It conducts a comparative study of transfer-learned FER models trained on FER2013 and selects EfficientNetV2-B0 for deployment on the Tiago++ robot due to its favorable accuracy-to-footprint balance, while employing Haarcascade for multi-face detection and a Tkinter/OpenCV GUI for visualization. The system is integrated with ROS to enable on-board face tracking and emotion-driven GUI display on the robot’s tablet, demonstrating real-time performance with single and multi-person scenarios within a ~139 MB model budget. The work highlights the practical viability of compact FER on robots while acknowledging the limitations of facial cues alone and proposing multimodal extensions and further optimization for robust HRI applications.
Abstract
The integration of dialogue interfaces in mobile devices has become ubiquitous, providing a wide array of services. As technology progresses, humanoid robots designed with human-like features to interact effectively with people are gaining prominence, and the use of advanced human-robot dialogue interfaces is continually expanding. In this context, emotion recognition plays a crucial role in enhancing human-robot interaction by enabling robots to understand human intentions. This research proposes a facial emotion detection interface integrated into a mobile humanoid robot, capable of displaying real-time emotions from multiple individuals on a user interface. To this end, various deep neural network models for facial expression recognition were developed and evaluated under consistent computer-based conditions, yielding promising results. Afterwards, a trade-off between accuracy and memory footprint was carefully considered to effectively implement this application on a mobile humanoid robot.
