Fast Tree-Field Integrators: From Low Displacement Rank to Topological Transformers
Krzysztof Choromanski, Arijit Sehanobish, Somnath Basu Roy Chowdhury, Han Lin, Avinava Dubey, Tamas Sarlos, Snigdha Chaturvedi
TL;DR
This work introduces Fast Tree-Field Integrators (FTFI), a class of exact polylog-linear algorithms for integrating tensor fields on weighted trees by exploiting low displacement rank (LDR) structure. By constructing IntegratorTrees (ITs) and leveraging cross-term decompositions, FTFI achieves exact $f$-distance matrix multiplications with strong complexity guarantees for a broad family of functions $f$ (including rational, polynomial, exponential, and exponentiated-quadratic), enabling efficient approximations of graph metrics via tree metrics and accelerating applications from mesh interpolation to Graph Classification and Topological Vision Transformers (TopViT). Empirically, FTFI delivers 5.7–13x speedups over brute-force methods on large graphs, and when integrated into TT masking mechanisms, yields notable accuracy gains with only a few extra trainable parameters per layer. The work further demonstrates learnable $f$-distance matrices to improve approximation quality and provides extensive experiments on vision and graph tasks, along with a thorough theoretical treatment of the method. Overall, FTFI offers a practical, theoretically grounded path to scalable graph- and transformer-based computation on large-scale structured data.
Abstract
We present a new class of fast polylog-linear algorithms based on the theory of structured matrices (in particular low displacement rank) for integrating tensor fields defined on weighted trees. Several applications of the resulting fast tree-field integrators (FTFIs) are presented, including (a) approximation of graph metrics with tree metrics, (b) graph classification, (c) modeling on meshes, and finally (d) Topological Transformers (TTs) (Choromanski et al., 2022) for images. For Topological Transformers, we propose new relative position encoding (RPE) masking mechanisms with as few as three extra learnable parameters per Transformer layer, leading to 1.0-1.5%+ accuracy gains. Importantly, most of FTFIs are exact methods, thus numerically equivalent to their brute-force counterparts. When applied to graphs with thousands of nodes, those exact algorithms provide 5.7-13x speedups. We also provide an extensive theoretical analysis of our methods.
