Table of Contents
Fetching ...

FIP: Endowing Robust Motion Capture on Daily Garment by Fusing Flex and Inertial Sensors

Jiawei Fang, Ruonan Zheng, Yuanyao, Xiaoxia Gao, Chengxu Zuo, Shihui Guo, Yiyue Luo

TL;DR

FIP advances clothes-based MoCap by fusing flex sensors with IMUs in everyday garments and addressing sensor displacements through a triad of components: a Displacement Latent Diffusion Model (DLDM) to synthesize inertial disturbances, a Physics-informed Calibrator (PIC) to correct flex-sensor primary displacement, and a Pose Fusion Predictor (PFP) to fuse multimodal readings. The approach is trained on simulated displacement data and augmented by diffusion-based sampling, then validated on real-device data showing significant improvements over state-of-the-art real-time IMU methods in angular, elbow, and positional errors. The system runs at 60 Hz and supports applications in VR/AR, rehabilitation, and fitness analysis, enabling robust, comfortable motion capture without camera-based infrastructure. This work lays the groundwork for scalable, ubiquitous MoCap from loose clothing by integrating sensor fusion, generative data synthesis, and physics-informed calibration to overcome displacement challenges.

Abstract

What if our clothes could capture our body motion accurately? This paper introduces Flexible Inertial Poser (FIP), a novel motion-capturing system using daily garments with two elbow-attached flex sensors and four Inertial Measurement Units (IMUs). To address the inevitable sensor displacements in loose wearables which degrade joint tracking accuracy significantly, we identify the distinct characteristics of the flex and inertial sensor displacements and develop a Displacement Latent Diffusion Model and a Physics-informed Calibrator to compensate for sensor displacements based on such observations, resulting in a substantial improvement in motion capture accuracy. We also introduce a Pose Fusion Predictor to enhance multimodal sensor fusion. Extensive experiments demonstrate that our method achieves robust performance across varying body shapes and motions, significantly outperforming SOTA IMU approaches with a 19.5% improvement in angular error, a 26.4% improvement in elbow angular error, and a 30.1% improvement in positional error. FIP opens up opportunities for ubiquitous human-computer interactions and diverse interactive applications such as Metaverse, rehabilitation, and fitness analysis.

FIP: Endowing Robust Motion Capture on Daily Garment by Fusing Flex and Inertial Sensors

TL;DR

FIP advances clothes-based MoCap by fusing flex sensors with IMUs in everyday garments and addressing sensor displacements through a triad of components: a Displacement Latent Diffusion Model (DLDM) to synthesize inertial disturbances, a Physics-informed Calibrator (PIC) to correct flex-sensor primary displacement, and a Pose Fusion Predictor (PFP) to fuse multimodal readings. The approach is trained on simulated displacement data and augmented by diffusion-based sampling, then validated on real-device data showing significant improvements over state-of-the-art real-time IMU methods in angular, elbow, and positional errors. The system runs at 60 Hz and supports applications in VR/AR, rehabilitation, and fitness analysis, enabling robust, comfortable motion capture without camera-based infrastructure. This work lays the groundwork for scalable, ubiquitous MoCap from loose clothing by integrating sensor fusion, generative data synthesis, and physics-informed calibration to overcome displacement challenges.

Abstract

What if our clothes could capture our body motion accurately? This paper introduces Flexible Inertial Poser (FIP), a novel motion-capturing system using daily garments with two elbow-attached flex sensors and four Inertial Measurement Units (IMUs). To address the inevitable sensor displacements in loose wearables which degrade joint tracking accuracy significantly, we identify the distinct characteristics of the flex and inertial sensor displacements and develop a Displacement Latent Diffusion Model and a Physics-informed Calibrator to compensate for sensor displacements based on such observations, resulting in a substantial improvement in motion capture accuracy. We also introduce a Pose Fusion Predictor to enhance multimodal sensor fusion. Extensive experiments demonstrate that our method achieves robust performance across varying body shapes and motions, significantly outperforming SOTA IMU approaches with a 19.5% improvement in angular error, a 26.4% improvement in elbow angular error, and a 30.1% improvement in positional error. FIP opens up opportunities for ubiquitous human-computer interactions and diverse interactive applications such as Metaverse, rehabilitation, and fitness analysis.

Paper Structure

This paper contains 46 sections, 9 equations, 17 figures, 3 tables.

Figures (17)

  • Figure 1: Comparison of related MoCap methods.
  • Figure 2: Prototype of the garment integrated with flex sensors, IMUs, and circuit board: (a-b) Front and back views of the garment, (c) IMU and circuit board positioned on the back, (d) IMU located at the waist, (e) Flexible sensors placed on the elbows and IMUs attached to the forearms.
  • Figure 3: Garment pattern design and sensor placement of our prototype. Blue: Flex Sensor; Yellow: IMU; Green: Wire.
  • Figure 4: The process of prototyping: (a) Assemble the sensors together by soldering; (b) Cut the fabric into pieces according to the patterns; (c) Integrate the assembled sensors into the fabric through heat pressing and sew the fabric into the garment.
  • Figure 5: Pipeline Overview. Left: For data preparation, we first utilize a simulation body-fabric model to synthesize the IMU Real-time Displacement. Then, for training a robust pose predictor, we train a Displacement Latent Diffusion Model (DLDM) to generate enough diverse data that covers real-world distribution. At last, we train a Pose Fusion Predictor leveraging simulated flex sensor data and generated IMU data, with the supervision of SMPL Pose loper2023smpl. Right: In our testing phase, flex sensor readings will be firstly input to our Physical-informed Calibrator to address the Primary Displacement, which will be input to the pre-trained Pose Fusion Predictor with IMU data.
  • ...and 12 more figures