Distillation-guided Representation Learning for Unconstrained Gait Recognition

Yuxiang Guo; Siyuan Huang; Ram Prabhakar; Chun Pong Lau; Rama Chellappa; Cheng Peng

Distillation-guided Representation Learning for Unconstrained Gait Recognition

Yuxiang Guo, Siyuan Huang, Ram Prabhakar, Chun Pong Lau, Rama Chellappa, Cheng Peng

TL;DR

This work proposes a framework, termed GAit DEtection and Recognition (GADER), for human authentication in challenging outdoor scenarios that leverages a Double Helical Signature to detect segments that contain human movement and builds discriminative features through a novel gait recognition method, where only frames containing gait information are used.

Abstract

Gait recognition holds the promise of robustly identifying subjects based on walking patterns instead of appearance information. While previous approaches have performed well for curated indoor data, they tend to underperform in unconstrained situations, e.g. in outdoor, long distance scenes, etc. We propose a framework, termed GAit DEtection and Recognition (GADER), for human authentication in challenging outdoor scenarios. Specifically, GADER leverages a Double Helical Signature to detect segments that contain human movement and builds discriminative features through a novel gait recognition method, where only frames containing gait information are used. To further enhance robustness, GADER encodes viewpoint information in its architecture, and distills representation from an auxiliary RGB recognition model, which enables GADER to learn from silhouette and RGB data at training time. At test time, GADER only infers from the silhouette modality. We evaluate our method on multiple State-of-The-Arts(SoTA) gait baselines and demonstrate consistent improvements on indoor and outdoor datasets, especially with a significant 25.2% improvement on unconstrained, remote gait data.

Distillation-guided Representation Learning for Unconstrained Gait Recognition

TL;DR

Abstract

Paper Structure (19 sections, 6 equations, 6 figures, 6 tables)

This paper contains 19 sections, 6 equations, 6 figures, 6 tables.

Introduction
Related Work
Gait Representations
Gait Recognition
Method
Gait Detector
Double Helical Signature
Light-weighted Classification
GAit Recognition (GAR)
Ratio Attention
Cross Modality Distillation
Loss functions
Experiments
Datasets and Metrics
Quantitative Evaluation
...and 4 more sections

Figures (6)

Figure 1: For each baseline, GADER raises the Rank-1 accuracy on BRIAR significantly, especially over 25% at 500m. Close Range(CR), Unmanned Aerial Vehicle(UAV).
Figure 2: Overview of end-to-end pipeline. GADER consists of two parts: gait detection and GAR. The gait detector utilizes gait representation to filter out segments without gait information or incomplete body which will be processed by a body recognition algorithm, and only frames with human movement are fed to GAR. GAR leverages ratio attention and RGB feature space to extract a more robust silhouette feature for recognition.
Figure 3: Four cases of DHS(a-d) are shown using two variables: full/part - indicates whether the body is complete, and stand/gait - shows whether gait information is present.
Figure 4: The gait detection process. The gait detector processes split DHS segments to obtain prediction followed by Non-Maximum Suppression (NMS) and concatenation to pinpoint frames where gait information is present in a sequence. The yellow bounding boxes indicate the frames that contain gait.
Figure 5: Some examples showing the absence of human movement in videos appearing in the curated dataset. We record the dataset name and corresponding subject id of the sequence.
...and 1 more figures

Distillation-guided Representation Learning for Unconstrained Gait Recognition

TL;DR

Abstract

Distillation-guided Representation Learning for Unconstrained Gait Recognition

Authors

TL;DR

Abstract

Table of Contents

Figures (6)