Equivariant Scalar Fields for Molecular Docking with Fast Fourier Transforms
Bowen Jing, Tommi Jaakkola, Bonnie Berger
TL;DR
The paper tackles the bottleneck of expensive pose optimization in structure-based virtual screening by introducing a scoring function defined as the cross-correlation of SE(3)-equivariant scalar fields carried by protein and ligand graphs. It develops equivariant scalar-field networks (ESFs) and derives efficient FFT-based workflows over translations and rotations, enabling rapid, amortizable pose evaluation. Empirical results on decoy-pose scoring and rigid conformer docking show competitive accuracy with Gnina/Vina and superior robustness to predicted structures (ESFold) while achieving substantial runtime gains, including a 45–50x speedup on PDE10A when pocket-level amortization is exploited. The approach promises practical impact for high-throughput docking by enabling large-scale screening with modest resources, and it opens avenues for integration with refinement and torsion-aware strategies in full docking pipelines.
Abstract
Molecular docking is critical to structure-based virtual screening, yet the throughput of such workflows is limited by the expensive optimization of scoring functions involved in most docking algorithms. We explore how machine learning can accelerate this process by learning a scoring function with a functional form that allows for more rapid optimization. Specifically, we define the scoring function to be the cross-correlation of multi-channel ligand and protein scalar fields parameterized by equivariant graph neural networks, enabling rapid optimization over rigid-body degrees of freedom with fast Fourier transforms. The runtime of our approach can be amortized at several levels of abstraction, and is particularly favorable for virtual screening settings with a common binding pocket. We benchmark our scoring functions on two simplified docking-related tasks: decoy pose scoring and rigid conformer docking. Our method attains similar but faster performance on crystal structures compared to the widely-used Vina and Gnina scoring functions, and is more robust on computationally predicted structures. Code is available at https://github.com/bjing2016/scalar-fields.
