BioX-Bridge: Model Bridging for Unsupervised Cross-Modal Knowledge Transfer across Biosignals
Chenqi Li, Yu Liu, Timothy Denison, Tingting Zhu
TL;DR
BioX-Bridge addresses the challenge of unsupervised cross-modal knowledge transfer between biosignals under limited labeled data and large foundation models. It introduces a lightweight bridge network that aligns intermediate representations across modalities, using a two-stage bridge position selection and a prototype-based, low-rank projection to enable information flow from a new modality to an old modality. The training objective focuses on aligning the bridged representations with the old-modality representations, while only training the bridge parameters to minimize the alignment loss. Across ISRUC, FOG, and WESAD, BioX-Bridge achieves transfer performance comparable to or better than baselines while reducing trainable parameters by about 88–99%, demonstrating practical efficiency for modality-agnostic, resource-constrained biosignal applications.
Abstract
Biosignals offer valuable insights into the physiological states of the human body. Although biosignal modalities differ in functionality, signal fidelity, sensor comfort, and cost, they are often intercorrelated, reflecting the holistic and interconnected nature of human physiology. This opens up the possibility of performing the same tasks using alternative biosignal modalities, thereby improving the accessibility, usability, and adaptability of health monitoring systems. However, the limited availability of large labeled datasets presents challenges for training models tailored to specific tasks and modalities of interest. Unsupervised cross-modal knowledge transfer offers a promising solution by leveraging knowledge from an existing modality to support model training for a new modality. Existing methods are typically based on knowledge distillation, which requires running a teacher model alongside student model training, resulting in high computational and memory overhead. This challenge is further exacerbated by the recent development of foundation models that demonstrate superior performance and generalization across tasks at the cost of large model sizes. To this end, we explore a new framework for unsupervised cross-modal knowledge transfer of biosignals by training a lightweight bridge network to align the intermediate representations and enable information flow between foundation models and across modalities. Specifically, we introduce an efficient strategy for selecting alignment positions where the bridge should be constructed, along with a flexible prototype network as the bridge architecture. Extensive experiments across multiple biosignal modalities, tasks, and datasets show that BioX-Bridge reduces the number of trainable parameters by 88--99\% while maintaining or even improving transfer performance compared to state-of-the-art methods.
