When Less is More: Approximating the Quantum Geometric Tensor with Block Structures
Ahmedeo Shokry, Alessandro Santini, Filippo Vicentini
TL;DR
The paper tackles the high cost of inverting the quantum geometric tensor (QGT) in natural-gradient optimization for neural quantum states. It introduces a block-diagonal QGT that partitions the metric by network modules, preserving intra-layer curvature while removing noisy cross-layer correlations. This approach improves conditioning and scalability, with empirical gains in faster convergence, lower energies, and greater stability on Heisenberg and frustrated $J_1$-$J_2$ models. The results suggest a practical, scalable alternative to full SR that can extend to other differentiable scientific simulators and modular neural architectures.
Abstract
The natural gradient is central in neural quantum states optimizations but it is limited by the cost of computing and inverting the quantum geometric tensor, the quantum analogue of the Fisher information matrix. We introduce a block-diagonal quantum geometric tensor that partitions the metric by network layers, analogous to block-structured Fisher methods such as K-FAC. This layer-wise approximation preserves essential curvature while removing noisy cross-layer correlations, improving conditioning and scalability. Experiments on Heisenberg and frustrated $J_1$-$J_2$ models show faster convergence, lower energy, and improved stability.
