QuCoWE Quantum Contrastive Word Embeddings with Variational Circuits for NearTerm Quantum Devices
Rabimba Karanjai, Hemanth Hegadehalli Madhavarao, Lei Xu, Weidong Shi
TL;DR
QuCoWE introduces a framework for learning quantum-native word embeddings using shallow, hardware-efficient PQCs trained with a contrastive skip-gram objective. By encoding words via data re-uploading and ring entanglement, and scoring similarity with a fidelity-based head and a logit-transformed fidelity head that aligns with PMI semantics, the method links quantum information measures to distributional semantics. An entanglement-budget regularizer based on single-qubit purities mitigates barren plateaus, and depolarizing/readout noise is analyzed with error-mitigation hooks. Experiments on Text8 and WikiText-2 show competitive intrinsic and extrinsic performance against 50–100D classical baselines while reducing learned parameters per token, suggesting near-term quantum devices can support viable semantic representations. The results highlight a practical, resource-aware path toward quantum NLP, with clear avenues for hardware deployment and extensions to contextualized embeddings.
Abstract
We present QuCoWE a framework that learns quantumnative word embeddings by training shallow hardwareefficient parameterized quantum circuits PQCs with a contrastive skipgram objective Words are encoded by datareuploading circuits with controlled ring entanglement similarity is computed via quantum state fidelity and passed through a logitfidelity head that aligns scores with the shiftedPMI scale of SGNSNoiseContrastive Estimation To maintain trainability we introduce an entanglementbudget regularizer based on singlequbit purity that mitigates barren plateaus On Text8 and WikiText2 QuCoWE attains competitive intrinsic WordSim353 SimLex999 and extrinsic SST2 TREC6 performance versus 50100d classical baselines while using fewer learned parameters per token All experiments are run in classical simulation we analyze depolarizingreadout noise and include errormitigation hooks zeronoise extrapolation randomized compiling to facilitate hardware deployment
