Hadamard product in deep learning: Introduction, Advances and Challenges
Grigorios G Chrysos, Yongtao Wu, Razvan Pascanu, Philip Torr, Volkan Cevher
TL;DR
The survey reframes the Hadamard product as a core architectural primitive in deep learning, highlighting its linear-cost capability to model nonlinear interactions and its potential to complement or replace heavier operators like self-attention in resource-constrained settings. It organizes existing work into four main domains—high-order interactions, multimodal fusion, adaptive modulation, and efficient pairwise operations—and connects them through a unifying view grounded in polynomial networks, gating, and masking. The authors also synthesize theoretical perspectives on expressivity, spectral bias, generalization, and robustness, alongside practical implementations and open problems, underscoring the Hadamard product’s broad applicability from edge devices to large language models. Overall, the paper argues that Hadamard-product-based primitives offer compelling trade-offs between efficiency and representational power, motivating future architectural innovations and cross-domain research.
Abstract
While convolution and self-attention mechanisms have dominated architectural design in deep learning, this survey examines a fundamental yet understudied primitive: the Hadamard product. Despite its widespread implementation across various applications, the Hadamard product has not been systematically analyzed as a core architectural primitive. We present the first comprehensive taxonomy of its applications in deep learning, identifying four principal domains: higher-order correlation, multimodal data fusion, dynamic representation modulation, and efficient pairwise operations. The Hadamard product's ability to model nonlinear interactions with linear computational complexity makes it particularly valuable for resource-constrained deployments and edge computing scenarios. We demonstrate its natural applicability in multimodal fusion tasks, such as visual question answering, and its effectiveness in representation masking for applications including image inpainting and pruning. This systematic review not only consolidates existing knowledge about the Hadamard product's role in deep learning architectures but also establishes a foundation for future architectural innovations. Our analysis reveals the Hadamard product as a versatile primitive that offers compelling trade-offs between computational efficiency and representational power, positioning it as a crucial component in the deep learning toolkit.
