Context-Aware Iteration Policy Network for Efficient Optical Flow Estimation
Ri Cheng, Ruian He, Xuhao Jiang, Shili Zhou, Weimin Tan, Bo Yan
TL;DR
This work tackles the inefficiency of fixed-iteration recurrent optical flow by introducing a context-aware iteration policy that assigns per-sample iteration budgets. It integrates a lightweight policy network with backbones like RAFT, GMA, FlowFormer, and KPA-Flow to decide whether to skip updates, guided by historical hidden state, iteration embedding, and an incremental loss that forecasts future improvements. The policy is controllable via a resource parameter $r$, enabling FLOP reductions of roughly $40\%$ on Sintel and $20\%$ on KITTI while maintaining accuracy. Ablation studies confirm the value of contextual cues and the incremental loss for adapting iterations to sample difficulty, making the approach practical for resource-constrained deployments.
Abstract
Existing recurrent optical flow estimation networks are computationally expensive since they use a fixed large number of iterations to update the flow field for each sample. An efficient network should skip iterations when the flow improvement is limited. In this paper, we develop a Context-Aware Iteration Policy Network for efficient optical flow estimation, which determines the optimal number of iterations per sample. The policy network achieves this by learning contextual information to realize whether flow improvement is bottlenecked or minimal. On the one hand, we use iteration embedding and historical hidden cell, which include previous iterations information, to convey how flow has changed from previous iterations. On the other hand, we use the incremental loss to make the policy network implicitly perceive the magnitude of optical flow improvement in the subsequent iteration. Furthermore, the computational complexity in our dynamic network is controllable, allowing us to satisfy various resource preferences with a single trained model. Our policy network can be easily integrated into state-of-the-art optical flow networks. Extensive experiments show that our method maintains performance while reducing FLOPs by about 40%/20% for the Sintel/KITTI datasets.
