Table of Contents
Fetching ...

Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity Recognition

Vincent Gbouna Zakka, Luis J. Manso, Zhuangzhuang Dai

TL;DR

This work tackles privacy-friendly human activity recognition from sparse mmWave radar point clouds by introducing MAK-GCN, which uses a Multi-Head Adaptive Kernel to generate multiple dynamic filters per neighbourhood. The architecture progressively refines local features while preserving global spatial context across five stages, combining a graph-feature extractor, dynamic filtering, and global descriptors to boost discriminative power. Empirical results on MMActivity and MiliPoint show state-of-the-art accuracy (up to 97.54% and 99.12% in ablations) and demonstrate robust performance under real-world robotic deployment, including dark conditions. The approach promises practical impact for privacy-preserving, real-time activity monitoring in home robotics and elder-care scenarios.

Abstract

Human activity recognition is increasingly vital for supporting independent living, particularly for the elderly and those in need of assistance. Domestic service robots with monitoring capabilities can enhance safety and provide essential support. Although image-based methods have advanced considerably in the past decade, their adoption remains limited by concerns over privacy and sensitivity to low-light or dark conditions. As an alternative, millimetre-wave (mmWave) radar can produce point cloud data which is privacy-preserving. However, processing the sparse and noisy point clouds remains a long-standing challenge. While graph-based methods and attention mechanisms show promise, they predominantly rely on "fixed" kernels; kernels that are applied uniformly across all neighbourhoods, highlighting the need for adaptive approaches that can dynamically adjust their kernels to the specific geometry of each local neighbourhood in point cloud data. To overcome this limitation, we introduce an adaptive approach within the graph convolutional framework. Instead of a single shared weight function, our Multi-Head Adaptive Kernel (MAK) module generates multiple dynamic kernels, each capturing different aspects of the local feature space. By progressively refining local features while maintaining global spatial context, our method enables convolution kernels to adapt to varying local features. Experimental results on benchmark datasets confirm the effectiveness of our approach, achieving state-of-the-art performance in human activity recognition. Our source code is made publicly available at: https://github.com/Gbouna/MAK-GCN

Multi-Head Adaptive Graph Convolution Network for Sparse Point Cloud-Based Human Activity Recognition

TL;DR

This work tackles privacy-friendly human activity recognition from sparse mmWave radar point clouds by introducing MAK-GCN, which uses a Multi-Head Adaptive Kernel to generate multiple dynamic filters per neighbourhood. The architecture progressively refines local features while preserving global spatial context across five stages, combining a graph-feature extractor, dynamic filtering, and global descriptors to boost discriminative power. Empirical results on MMActivity and MiliPoint show state-of-the-art accuracy (up to 97.54% and 99.12% in ablations) and demonstrate robust performance under real-world robotic deployment, including dark conditions. The approach promises practical impact for privacy-preserving, real-time activity monitoring in home robotics and elder-care scenarios.

Abstract

Human activity recognition is increasingly vital for supporting independent living, particularly for the elderly and those in need of assistance. Domestic service robots with monitoring capabilities can enhance safety and provide essential support. Although image-based methods have advanced considerably in the past decade, their adoption remains limited by concerns over privacy and sensitivity to low-light or dark conditions. As an alternative, millimetre-wave (mmWave) radar can produce point cloud data which is privacy-preserving. However, processing the sparse and noisy point clouds remains a long-standing challenge. While graph-based methods and attention mechanisms show promise, they predominantly rely on "fixed" kernels; kernels that are applied uniformly across all neighbourhoods, highlighting the need for adaptive approaches that can dynamically adjust their kernels to the specific geometry of each local neighbourhood in point cloud data. To overcome this limitation, we introduce an adaptive approach within the graph convolutional framework. Instead of a single shared weight function, our Multi-Head Adaptive Kernel (MAK) module generates multiple dynamic kernels, each capturing different aspects of the local feature space. By progressively refining local features while maintaining global spatial context, our method enables convolution kernels to adapt to varying local features. Experimental results on benchmark datasets confirm the effectiveness of our approach, achieving state-of-the-art performance in human activity recognition. Our source code is made publicly available at: https://github.com/Gbouna/MAK-GCN

Paper Structure

This paper contains 25 sections, 22 equations, 3 figures, 7 tables.

Figures (3)

  • Figure 1: mmWave radar deployed on a Robotino robot for activity monitoring in both dark and illuminated conditions
  • Figure 2: Architecture of the proposed MAKGConvNet: A graph is constructed from input data for convolution via the MAK module, followed by a common graph convolution. The outputs are concatenated and passed to the classification module.
  • Figure 3: Human activity recognition with Robotino robot