FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression

Yifei Gao; Yong Chen; Chen Zhang

FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression

Yifei Gao, Yong Chen, Chen Zhang

TL;DR

FAME tackles function-on-function regression by learning a continuous, data-driven operator directly on irregular functional data. It combines a bidirectional NCDE-based continuous attention block with aMixture-of-Experts router to capture intra-function dynamics and a multi-head cross-attention module to fuse inter-function interactions, followed by an NCDE decoder that outputs continuous targets at arbitrary query points. The framework is proven to be Lipschitz-stable, sampling-invariant, and universally expressive, with empirical results showing state-of-the-art accuracy on synthetic and real FoFR benchmarks and robustness to irregular sampling and heterogeneity. Overall, FAME offers a principled, end-to-end approach that overcomes the limitations of fixed-basis and grid-discretization methods while enabling accurate, flexible function-to-function mappings in challenging settings.

Abstract

Functional data play a pivotal role across science and engineering, yet their infinite-dimensional nature makes representation learning challenging. Conventional statistical models depend on pre-chosen basis expansions or kernels, limiting the flexibility of data-driven discovery, while many deep-learning pipelines treat functions as fixed-grid vectors, ignoring inherent continuity. In this paper, we introduce Functional Attention with a Mixture-of-Experts (FAME), an end-to-end, fully data-driven framework for function-on-function regression. FAME forms continuous attention by coupling a bidirectional neural controlled differential equation with MoE-driven vector fields to capture intra-functional continuity, and further fuses change to inter-functional dependencies via multi-head cross attention. Extensive experiments on synthetic and real-world functional-regression benchmarks show that FAME achieves state-of-the-art accuracy, strong robustness to arbitrarily sampled discrete observations of functions.

FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression

TL;DR

Abstract

FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (18)