OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare
Chen Long-fei, Subramanian Ramamoorthy, Robert B Fisher
TL;DR
OPPH addresses the need for reliable vision-based body motion estimation in healthcare by introducing a multi-stage operator that gates frame-level movement estimates with a binary motion-detection state derived from frame differences and a body mask. It integrates with both pose-based and optical-flow methods, notably RAFT, to suppress real-world noise and preserve long-term movement trends, achieving significant RMSE reductions on real-world motionless data (HuMoLs) and competitive gains on real-world and synthetic datasets (JHMDB, Surreal). The approach maintains high correlation with ground-truth motion over time ($r\approx$0.92--0.99 in key tests) and runs in real time (~$13.33$ fps) on standard hardware, making it suitable for crisis detection and chronic-condition monitoring in healthcare settings. Overall, OPPH offers a practical denoising and gating mechanism that enhances healthcare-oriented motion analysis without sacrificing active-motion accuracy, enabling robust long-term monitoring and timely intervention.
Abstract
Vision-based motion estimation methods show promise in accurately and unobtrusively estimating human body motion for healthcare purposes. However, these methods are not specifically designed for healthcare purposes and face challenges in real-world applications. Human pose estimation methods often lack the accuracy needed for detecting fine-grained, subtle body movements, while optical flow-based methods struggle with poor lighting conditions and unseen real-world data. These issues result in human body motion estimation errors, particularly during critical medical situations where the body is motionless, such as during unconsciousness. To address these challenges and improve the accuracy of human body motion estimation for healthcare purposes, we propose the OPPH operator designed to enhance current vision-based motion estimation methods. This operator, which considers human body movement and noise properties, functions as a multi-stage filter. Results tested on two real-world and one synthetic human motion dataset demonstrate that the operator effectively removes real-world noise, significantly enhances the detection of motionless states, maintains the accuracy of estimating active body movements, and maintains long-term body movement trends. This method could be beneficial for analyzing both critical medical events and chronic medical conditions.
