Diverse Intra- and Inter-Domain Activity Style Fusion for Cross-Person Generalization in Activity Recognition
Junru Zhang, Lang Feng, Zhidan Liu, Yuhan Wu, Yang He, Yabo Dong, Duanqing Xu
TL;DR
This paper tackles cross-person HAR generalization by addressing limited intra- and inter-domain style diversity in source data. It proposes domain padding, a diffusion-based approach that uses a learned activity style conditioner and a style-fused sampling strategy to generate diverse, class-preserving samples across and within domains. The method, DI2SDiff, leverages classifier-free guidance and random style combinations to expand the domain coverage, yielding state-of-the-art generalization on DSADS, PAMAP2, and USC-HAD with small training sets. Empirical results show substantial improvements over strong DG baselines, particularly on USC-HAD where sub-domain structure is challenging, and demonstrate that the generated data can boost other DG methods as a plug-in augmentation. The approach offers a data-efficient path to robust HAR systems, reducing the need for extensive data collection on edge devices while enabling better deployment on unseen users.
Abstract
Existing domain generalization (DG) methods for cross-person generalization tasks often face challenges in capturing intra- and inter-domain style diversity, resulting in domain gaps with the target domain. In this study, we explore a novel perspective to tackle this problem, a process conceptualized as domain padding. This proposal aims to enrich the domain diversity by synthesizing intra- and inter-domain style data while maintaining robustness to class labels. We instantiate this concept using a conditional diffusion model and introduce a style-fused sampling strategy to enhance data generation diversity. In contrast to traditional condition-guided sampling, our style-fused sampling strategy allows for the flexible use of one or more random styles to guide data synthesis. This feature presents a notable advancement: it allows for the maximum utilization of possible permutations and combinations among existing styles to generate a broad spectrum of new style instances. Empirical evaluations on a broad range of datasets demonstrate that our generated data achieves remarkable diversity within the domain space. Both intra- and inter-domain generated data have proven to be significant and valuable, contributing to varying degrees of performance enhancements. Notably, our approach outperforms state-of-the-art DG methods in all human activity recognition tasks.
