Query-based Knowledge Transfer for Heterogeneous Learning Environments
Norah Alballa, Wenxuan Zhang, Ziquan Liu, Ahmed M. Abdelmoniem, Mohamed Elhoseiny, Marco Canini
TL;DR
The paper addresses customized query-driven knowledge transfer in decentralized learning under data heterogeneity and privacy constraints. It introduces Query-based Knowledge Transfer (QKT), a data-free, two-phase framework that uses synthetic data-driven masks for query-focused distillation (Phase 1) and subsequent Classification Head Refinement (Phase 2) to mitigate forgetting. Empirical results on standard and clinical benchmarks show substantial improvements over federated, KD, and ensemble baselines, with average gains around 20.9 points for single-class and 14.3 points for multi-class queries, while reducing communication overhead. A lighter variant, QKT Light, offers further efficiency, and ablations highlight the critical roles of masking, head refinement, and two-phase stability for robust decentralized learning.
Abstract
Decentralized collaborative learning under data heterogeneity and privacy constraints has rapidly advanced. However, existing solutions like federated learning, ensembles, and transfer learning, often fail to adequately serve the unique needs of clients, especially when local data representation is limited. To address this issue, we propose a novel framework called Query-based Knowledge Transfer (QKT) that enables tailored knowledge acquisition to fulfill specific client needs without direct data exchange. QKT employs a data-free masking strategy to facilitate communication-efficient query-focused knowledge transfer while refining task-specific parameters to mitigate knowledge interference and forgetting. Our experiments, conducted on both standard and clinical benchmarks, show that QKT significantly outperforms existing collaborative learning methods by an average of 20.91\% points in single-class query settings and an average of 14.32\% points in multi-class query scenarios. Further analysis and ablation studies reveal that QKT effectively balances the learning of new and existing knowledge, showing strong potential for its application in decentralized learning.
