Quantum Pointwise Convolution: A Flexible and Scalable Approach for Neural Network Enhancement
An Ning, Tai-Yue Li, Nan-Yow Chen
TL;DR
The paper addresses the limitation of linear channel interactions in classical pointwise convolution by introducing Quantum Pointwise Convolution (QPC), a hybrid quantum-classical approach that uses amplitude encoding and strongly entangling circuit blocks to capture nonlinear cross-channel relationships. The method generates multiple feature maps per quantum kernel and leverages weight sharing to reduce parameter count, enabling end-to-end training with a parameter-shift gradient. Empirical results on FashionMNIST and CIFAR10 show that QPC-based models can achieve competitive or superior performance with fewer parameters than their classical counterparts, with deeper quantum layers yielding the best gains. This work suggests that quantum kernels can enhance CNNs' expressiveness in a resource-efficient manner, offering a viable path for integrating quantum circuits into mainstream CNN architectures on near-term hardware.
Abstract
In this study, we propose a novel architecture, the Quantum Pointwise Convolution, which incorporates pointwise convolution within a quantum neural network framework. Our approach leverages the strengths of pointwise convolution to efficiently integrate information across feature channels while adjusting channel outputs. By using quantum circuits, we map data to a higher-dimensional space, capturing more complex feature relationships. To address the current limitations of quantum machine learning in the Noisy Intermediate-Scale Quantum (NISQ) era, we implement several design optimizations. These include amplitude encoding for data embedding, allowing more information to be processed with fewer qubits, and a weight-sharing mechanism that accelerates quantum pointwise convolution operations, reducing the need to retrain for each input pixels. In our experiments, we applied the quantum pointwise convolution layer to classification tasks on the FashionMNIST and CIFAR10 datasets, where our model demonstrated competitive performance compared to its classical counterpart. Furthermore, these optimizations not only improve the efficiency of the quantum pointwise convolutional layer but also make it more readily deployable in various CNN-based or deep learning models, broadening its potential applications across different architectures.
