KITINet: Kinetics Theory Inspired Network Architectures with PDE Simulation Approaches
Mingquan Feng, Yifan Fu, Tongcheng Zhang, Yu Jiang, Yixin Huang, Junchi Yan
TL;DR
KITINet reframes residual learning as kinetic particle dynamics governed by the Boltzmann transport equation and simulated via a Direct Simulation Monte Carlo–style collision mechanism. It introduces a collision-based residual module with divisional particle representations, demonstrating that training induces network parameter condensation and yields improvements across PDE operator learning, image classification, and NLP tasks with negligible FLOP overhead. Empirical results show consistent gains over ResNet and Transformer baselines, including notable improvements when embedded with FNO and OFormer for PDE solving and strong performance on CIFAR and BERT-based NLP tasks. The work offers a principled, physics-informed approach to architecture design, linking non-equilibrium dynamics to sparsity and generalization, while also outlining limitations and future scaling opportunities to larger benchmarks.
Abstract
Despite the widely recognized success of residual connections in modern neural networks, their design principles remain largely heuristic. This paper introduces KITINet (Kinetics Theory Inspired Network), a novel architecture that reinterprets feature propagation through the lens of non-equilibrium particle dynamics and partial differential equation (PDE) simulation. At its core, we propose a residual module that models feature updates as the stochastic evolution of a particle system, numerically simulated via a discretized solver for the Boltzmann transport equation (BTE). This formulation mimics particle collisions and energy exchange, enabling adaptive feature refinement via physics-informed interactions. Additionally, we reveal that this mechanism induces network parameter condensation during training, where parameters progressively concentrate into a sparse subset of dominant channels. Experiments on scientific computation (PDE operator), image classification (CIFAR-10/100), and text classification (IMDb/SNLI) show consistent improvements over classic network baselines, with negligible increase of FLOPs.
