Table of Contents
Fetching ...

Development of a Human-Robot Interaction Platform for Dual-Arm Robots Based on ROS and Multimodal Artificial Intelligence

Thanh Nguyen Canh, Ba Phuong Nguyen, Hong Quan Tran, Xiem HoangVan

TL;DR

This paper designed and developed a dual-arm robotic system capable of operating independently, performing industrial and service tasks while simultaneously simulating and modeling the robot in the ROS environment, and developed remote control software using voice commands through a natural language processing model.

Abstract

In this paper, we propose the development of an interactive platform between humans and a dual-arm robotic system based on the Robot Operating System (ROS) and a multimodal artificial intelligence model. Our proposed platform consists of two main components: a dual-arm robotic hardware system and software that includes image processing tasks and natural language processing using a 3D camera and embedded computing. First, we designed and developed a dual-arm robotic system with a positional accuracy of less than 2 cm, capable of operating independently, performing industrial and service tasks while simultaneously simulating and modeling the robot in the ROS environment. Second, artificial intelligence models for image processing are integrated to execute object picking and classification tasks with an accuracy of over 90%. Finally, we developed remote control software using voice commands through a natural language processing model. Experimental results demonstrate the accuracy of the multimodal artificial intelligence model and the flexibility of the dual-arm robotic system in interactive human environments.

Development of a Human-Robot Interaction Platform for Dual-Arm Robots Based on ROS and Multimodal Artificial Intelligence

TL;DR

This paper designed and developed a dual-arm robotic system capable of operating independently, performing industrial and service tasks while simultaneously simulating and modeling the robot in the ROS environment, and developed remote control software using voice commands through a natural language processing model.

Abstract

In this paper, we propose the development of an interactive platform between humans and a dual-arm robotic system based on the Robot Operating System (ROS) and a multimodal artificial intelligence model. Our proposed platform consists of two main components: a dual-arm robotic hardware system and software that includes image processing tasks and natural language processing using a 3D camera and embedded computing. First, we designed and developed a dual-arm robotic system with a positional accuracy of less than 2 cm, capable of operating independently, performing industrial and service tasks while simultaneously simulating and modeling the robot in the ROS environment. Second, artificial intelligence models for image processing are integrated to execute object picking and classification tasks with an accuracy of over 90%. Finally, we developed remote control software using voice commands through a natural language processing model. Experimental results demonstrate the accuracy of the multimodal artificial intelligence model and the flexibility of the dual-arm robotic system in interactive human environments.

Paper Structure

This paper contains 12 sections, 13 equations, 11 figures, 3 tables.

Figures (11)

  • Figure 1: Sơ đồ tổng quan hệ thống tay máy đôi
  • Figure 2: Vị trí các thiết bị trên khung Robot
  • Figure 3: Sơ đồ kết nối các linh kiện của robot
  • Figure 4: Hệ trục tọa độ của robot
  • Figure 5: Các bước tạo mô hình URDF từ mô hình Solidwork
  • ...and 6 more figures