LLM-QFL: Distilling Large Language Model for Quantum Federated Learning
Dev Gurung, Shiva Raj Pokhrel
TL;DR
The paper tackles inefficiencies in quantum federated learning by introducing LLM-QFL, a framework that distills and federated-fines-tunes large language models within QFL. It leverages federated distillation, where locally fine-tuned LLMs guide quantum model optimization and act as reinforcement agents to regulate optimizer steps, client selection, and early termination, backed by KL-based distillation and a global aggregation rule. The authors prove a convergence rate of $\mathcal{O}(\frac{1}{T})$ under standard assumptions and demonstrate practical gains in communication efficiency and training speed, aided by parameter-efficient tuning via LoRA/QLoRA and adaptive optimization. Empirical validation spans genomic (DemoHumanOrWorm) and language (TweetEval) tasks on IBM QPU and simulators, showing substantial idle computation reductions (~30%) and improved convergence, while highlighting real-hardware noise and queueing effects. The work offers a principled, scalable route to combine LLMs with quantum learning, with potential impact on privacy-preserving, efficient distributed quantum AI.
Abstract
Inspired by the power of large language models (LLMs), our research adapts them to quantum federated learning (QFL) to boost efficiency and performance. We propose a federated fine-tuning method that distills an LLM within QFL, allowing each client to locally adapt the model to its own data while preserving privacy and reducing unnecessary global updates. The fine-tuned LLM also acts as a reinforcement agent, optimizing QFL by adjusting optimizer steps, cutting down communication rounds, and intelligently selecting clients. Experiments show significant efficiency gains. We pioneer a synergy between LLM and QFL, offering: i) practical efficiency: Reduced communication costs and faster convergence. ii) theoretical rigor: Provable guarantees for adaptive federated optimization. iii) scalability: PEFT methods (LoRA, QLoRA) enable deployment on resource-constrained quantum devices. Code implementation is available here 1.
