Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power
Yuzhu Chen, Tian Qin, Xinmei Tian, Fengxiang He, Dacheng Tao
TL;DR
This work theoretically analyzes how enforcing equivariance affects expressive power in two-layer ReLU networks, revealing that layer-wise equivariance can strictly limit expressivity in some regimes. It introduces boundary hyperplanes and symmetric channel vectors to connect symmetry constraints to network geometry, showing GENs inherently require symmetric hyperplanes while LENs enforce symmetric channel vectors. The authors prove that enlarging model size by a factor |G| can compensate for expressivity losses and, in some cases, yield a lower-complexity hypothesis space, suggesting better generalization. The findings provide a principled guide for employing layer-wise equivariance, highlighting a trade-off between expressivity and model size with potential generalization benefits.
Abstract
Equivariant neural networks encode symmetry as an inductive bias and have achieved strong empirical performance in wide domains. However, their expressive power remains not well understood. Focusing on 2-layer ReLU networks, this paper investigates the impact of equivariance constraints on the expressivity of equivariant and layer-wise equivariant networks. By examining the boundary hyperplanes and the channel vectors of ReLU networks, we construct an example showing that equivariance constraints could strictly limit expressive power. However, we demonstrate that this drawback can be compensated via enlarging the model size. Furthermore, we show that despite a larger model size, the resulting architecture could still correspond to a hypothesis space with lower complexity, implying superior generalizability for equivariant networks.
