Fourier Analysis on Robustness of Graph Convolutional Neural Networks for Skeleton-based Action Recognition
Nariki Tanaka, Hiroshi Kera, Kazuhiko Kawamoto
TL;DR
This paper investigates the robustness of graph convolutional networks (GCNs) for skeleton-based action recognition under adversarial attacks and common corruptions. It introduces the Joint Fourier Transform (JFT), combining Graph Fourier Transform and Discrete Fourier Transform, to analyze frequency-domain robustness and to visualize sensitivity via Fourier heatmaps on the NTU RGB+D dataset. A key finding is that adversarial training does not induce a robustness trade-off with low-frequency perturbations as observed in CNNs, though Fourier analysis cannot fully explain vulnerability to skeletal part occlusion. The work advances understanding of GCN robustness for skeletal action recognition and highlights limitations of frequency-based explanations, pointing to future directions including alternative analyses and transformer-era approaches for robustness assessment.
Abstract
Using Fourier analysis, we explore the robustness and vulnerability of graph convolutional neural networks (GCNs) for skeleton-based action recognition. We adopt a joint Fourier transform (JFT), a combination of the graph Fourier transform (GFT) and the discrete Fourier transform (DFT), to examine the robustness of adversarially-trained GCNs against adversarial attacks and common corruptions. Experimental results with the NTU RGB+D dataset reveal that adversarial training does not introduce a robustness trade-off between adversarial attacks and low-frequency perturbations, which typically occurs during image classification based on convolutional neural networks. This finding indicates that adversarial training is a practical approach to enhancing robustness against adversarial attacks and common corruptions in skeleton-based action recognition. Furthermore, we find that the Fourier approach cannot explain vulnerability against skeletal part occlusion corruption, which highlights its limitations. These findings extend our understanding of the robustness of GCNs, potentially guiding the development of more robust learning methods for skeleton-based action recognition.
