DeepFeature: Iterative Context-aware Feature Generation for Wearable Biosignals
Kaiwei Liu, Yuting He, Bufang Yang, Mu Yuan, Chun Man Victor Wong, Ho Pong Andrew Sze, Zhenyu Yan, Hongkai Chen
TL;DR
DeepFeature tackles the challenge of deriving task-specific, context-aware features from multimodal wearable biosignals by integrating three feature-generation sources (direct LLM prompts, context-guided literature-informed prompts, and operator-based feature composition) with a robust, multi-layer feature-to-code translation and an iterative performance-driven refinement loop. The framework uses filtering and verification to ensure executable, reliable feature extraction, and employs ranking-based feature elimination plus model-performance feedback to progressively improve feature sets. Empirical results across eight healthcare tasks on four datasets show DeepFeature achieves an average AUROC improvement of 4.21-9.67% over baselines and outperforms state-of-the-art methods on five tasks, with ablations confirming each component’s contribution. The approach emphasizes offline feature engineering with acceptable system overhead, enabling robust, context-aware deployment on wearable biosignal platforms and offering a scalable path for diverse clinical applications.
Abstract
Biosignals collected from wearable devices are widely utilized in healthcare applications. Machine learning models used in these applications often rely on features extracted from biosignals due to their effectiveness, lower data dimensionality, and wide compatibility across various model architectures. However, existing feature extraction methods often lack task-specific contextual knowledge, struggle to identify optimal feature extraction settings in high-dimensional feature space, and are prone to code generation and automation errors. In this paper, we propose DeepFeature, the first LLM-empowered, context-aware feature generation framework for wearable biosignals. DeepFeature introduces a multi-source feature generation mechanism that integrates expert knowledge with task settings. It also employs an iterative feature refinement process that uses feature assessment-based feedback for feature re-selection. Additionally, DeepFeature utilizes a robust multi-layer filtering and verification approach for robust feature-to-code translation to ensure that the extraction functions run without crashing. Experimental evaluation results show that DeepFeature achieves an average AUROC improvement of 4.21-9.67% across eight diverse tasks compared to baseline methods. It outperforms state-of-the-art approaches on five tasks while maintaining comparable performance on the remaining tasks.
