BiFingerPose: Bimodal Finger Pose Estimation for Touch Devices
Xiongjun Guan, Zhiyu Pan, Jianjiang Feng, Jie Zhou
TL;DR
This paper tackles the limits of single-modality finger pose estimation on touch devices by proposing BiFingerPose, a bimodal framework that fuses capacitive images and under-screen fingerprint patches to recover full 2D and 3D finger pose. It introduces a trig-based angular representation with distribution vectors and maps 2D pose to 3D pose via a polynomial UV-to-3D transformation, enabling robust 360° yaw without hardware changes. Extensive experiments and a 12-person user study show BiFingerPose surpasses state-of-the-art single-modality methods by over 21% in pose accuracy, while delivering 2.5x higher task efficiency and 23% better user operation accuracy. The approach is designed for practical deployment on devices with under-screen sensors and is supported by prototypes and public code, highlighting its potential to broaden touch-based interaction and security applications.
Abstract
Finger pose offers promising opportunities to expand human computer interaction capability of touchscreen devices. Existing finger pose estimation algorithms that can be implemented in portable devices predominantly rely on capacitive images, which are currently limited to estimating pitch and yaw angles and exhibit reduced accuracy when processing large-angle inputs (especially when it is greater than 45 degrees). In this paper, we propose BiFingerPose, a novel bimodal based finger pose estimation algorithm capable of simultaneously and accurately predicting comprehensive finger pose information. A bimodal input is explored, including a capacitive image and a fingerprint patch obtained from the touchscreen with an under-screen fingerprint sensor. Our approach leads to reliable estimation of roll angle, which is not achievable using only a single modality. In addition, the prediction performance of other pose parameters has also been greatly improved. The evaluation of a 12-person user study on continuous and discrete interaction tasks further validated the advantages of our approach. Specifically, BiFingerPose outperforms previous SOTA methods with over 21% improvement in prediction performance, 2.5 times higher task completion efficiency, and 23% better user operation accuracy, demonstrating its practical superiority. Finally, we delineate the application space of finger pose with respect to enhancing authentication security and improving interactive experiences, and develop corresponding prototypes to showcase the interaction potential. Our code will be available at https://github.com/XiongjunGuan/DualFingerPose.
