Table of Contents
Fetching ...

Label-Efficient Skeleton-based Recognition with Stable-Invertible Graph Convolutional Networks

Hichem Sahbi

TL;DR

This work tackles skeleton-based action recognition under limited labeled data by formulating a principled active-learning framework that designs informative exemplars via an objective balancing representativity, diversity and uncertainty. A stable, invertible graph convolutional network enables mapping data to a latent space with a tractable distribution, enabling efficient exemplar design and faithful reconstruction to the input space. The approach includes a two-block pipeline (display and learning) with two exemplar-design variants (ambient and latent space) and a four-term objective guiding exemplar selection; experiments on two challenging datasets show that the proposed method outperforms baselines and other display strategies, especially in low-label regimes. The proposed stable-invertible GCNs and latent-space sampling have practical implications for scalable skeleton-based recognition in labeling-scarce settings.

Abstract

Skeleton-based action recognition is a hotspot in image processing. A key challenge of this task lies in its dependence on large, manually labeled datasets whose acquisition is costly and time-consuming. This paper devises a novel, label-efficient method for skeleton-based action recognition using graph convolutional networks (GCNs). The contribution of the proposed method resides in learning a novel acquisition function -- scoring the most informative subsets for labeling -- as the optimum of an objective function mixing data representativity, diversity and uncertainty. We also extend this approach by learning the most informative subsets using an invertible GCN which allows mapping data from ambient to latent spaces where the inherent distribution of the data is more easily captured. Extensive experiments, conducted on two challenging skeleton-based recognition datasets, show the effectiveness and the outperformance of our label-frugal GCNs against the related work.

Label-Efficient Skeleton-based Recognition with Stable-Invertible Graph Convolutional Networks

TL;DR

This work tackles skeleton-based action recognition under limited labeled data by formulating a principled active-learning framework that designs informative exemplars via an objective balancing representativity, diversity and uncertainty. A stable, invertible graph convolutional network enables mapping data to a latent space with a tractable distribution, enabling efficient exemplar design and faithful reconstruction to the input space. The approach includes a two-block pipeline (display and learning) with two exemplar-design variants (ambient and latent space) and a four-term objective guiding exemplar selection; experiments on two challenging datasets show that the proposed method outperforms baselines and other display strategies, especially in low-label regimes. The proposed stable-invertible GCNs and latent-space sampling have practical implications for scalable skeleton-based recognition in labeling-scarce settings.

Abstract

Skeleton-based action recognition is a hotspot in image processing. A key challenge of this task lies in its dependence on large, manually labeled datasets whose acquisition is costly and time-consuming. This paper devises a novel, label-efficient method for skeleton-based action recognition using graph convolutional networks (GCNs). The contribution of the proposed method resides in learning a novel acquisition function -- scoring the most informative subsets for labeling -- as the optimum of an objective function mixing data representativity, diversity and uncertainty. We also extend this approach by learning the most informative subsets using an invertible GCN which allows mapping data from ambient to latent spaces where the inherent distribution of the data is more easily captured. Extensive experiments, conducted on two challenging skeleton-based recognition datasets, show the effectiveness and the outperformance of our label-frugal GCNs against the related work.

Paper Structure

This paper contains 8 sections, 2 theorems, 7 equations, 4 tables.

Key Result

Proposition 1

The optimality conditions of Eq. of leads to the solution as the fixed-point of being $\hat{\mu}^{(\tau+1)}$, $\hat{{\bf D}}^{(\tau+1)}$ respectively where ${\bf S}$ equates (with ${\bf D}^{(\tau)}$ written for short as ${\bf D}$) here ${\bf S}$ is a similarity matrix between ${\bf D}$ and ${\bf H}$, ${\bf 1}_N$ is a vector of $N$ ones, and $\textrm{\bf diag}$ maps a vector to a diagonal matrix

Theorems & Definitions (4)

  • Proposition 1
  • Definition 1: Stability
  • Proposition 2
  • proof : Sketch of the Proof (Proposition 2)