Table of Contents
Fetching ...

Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

Dahao Tang, Nan Yang, Yanli Li, Zhiyu Zhu, Zhibo Jin, Dong Yuan

TL;DR

This work presents a principled framework for adaptive horizon selection in federated time series forecasting by embedding heterogeneous, non-IID client data into a geometry-preserving intrinsic space. Central to the approach is a Synthetic Data Generator that captures AR memory, seasonality, and trend, enabling a clean decomposition of forecasting loss into a Bayesian (irreducible) term and an approximation term. The total loss is shown to be unimodal in the look-back horizon, with the optimum at the smallest horizon that saturates the Bayesian loss while the approximation cost grows, yielding a client-specific horizon criterion $H_k^*(\delta)$. A robust federated horizon aggregator (TrimMean) then derives a global horizon that remains effective across heterogeneous clients. Together, these results provide the first provable criterion for adaptive horizon selection in federated TSF and offer practical guidance for designing horizon-aware, privacy-preserving forecasting systems.

Abstract

Selecting an appropriate look-back horizon remains a fundamental challenge in time series forecasting (TSF), particularly in the federated learning scenarios where data is decentralized, heterogeneous, and often non-independent. While recent work has explored horizon selection by preserving forecasting-relevant information in an intrinsic space, these approaches are primarily restricted to centralized and independently distributed settings. This paper presents a principled framework for adaptive horizon selection in federated time series forecasting through an intrinsic space formulation. We introduce a synthetic data generator (SDG) that captures essential temporal structures in client data, including autoregressive dependencies, seasonality, and trend, while incorporating client-specific heterogeneity. Building on this model, we define a transformation that maps time series windows into an intrinsic representation space with well-defined geometric and statistical properties. We then derive a decomposition of the forecasting loss into a Bayesian term, which reflects irreducible uncertainty, and an approximation term, which accounts for finite-sample effects and limited model capacity. Our analysis shows that while increasing the look-back horizon improves the identifiability of deterministic patterns, it also increases approximation error due to higher model complexity and reduced sample efficiency. We prove that the total forecasting loss is minimized at the smallest horizon where the irreducible loss starts to saturate, while the approximation loss continues to rise. This work provides a rigorous theoretical foundation for adaptive horizon selection for time series forecasting in federated learning.

Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

TL;DR

This work presents a principled framework for adaptive horizon selection in federated time series forecasting by embedding heterogeneous, non-IID client data into a geometry-preserving intrinsic space. Central to the approach is a Synthetic Data Generator that captures AR memory, seasonality, and trend, enabling a clean decomposition of forecasting loss into a Bayesian (irreducible) term and an approximation term. The total loss is shown to be unimodal in the look-back horizon, with the optimum at the smallest horizon that saturates the Bayesian loss while the approximation cost grows, yielding a client-specific horizon criterion . A robust federated horizon aggregator (TrimMean) then derives a global horizon that remains effective across heterogeneous clients. Together, these results provide the first provable criterion for adaptive horizon selection in federated TSF and offer practical guidance for designing horizon-aware, privacy-preserving forecasting systems.

Abstract

Selecting an appropriate look-back horizon remains a fundamental challenge in time series forecasting (TSF), particularly in the federated learning scenarios where data is decentralized, heterogeneous, and often non-independent. While recent work has explored horizon selection by preserving forecasting-relevant information in an intrinsic space, these approaches are primarily restricted to centralized and independently distributed settings. This paper presents a principled framework for adaptive horizon selection in federated time series forecasting through an intrinsic space formulation. We introduce a synthetic data generator (SDG) that captures essential temporal structures in client data, including autoregressive dependencies, seasonality, and trend, while incorporating client-specific heterogeneity. Building on this model, we define a transformation that maps time series windows into an intrinsic representation space with well-defined geometric and statistical properties. We then derive a decomposition of the forecasting loss into a Bayesian term, which reflects irreducible uncertainty, and an approximation term, which accounts for finite-sample effects and limited model capacity. Our analysis shows that while increasing the look-back horizon improves the identifiability of deterministic patterns, it also increases approximation error due to higher model complexity and reduced sample efficiency. We prove that the total forecasting loss is minimized at the smallest horizon where the irreducible loss starts to saturate, while the approximation loss continues to rise. This work provides a rigorous theoretical foundation for adaptive horizon selection for time series forecasting in federated learning.

Paper Structure

This paper contains 35 sections, 7 theorems, 71 equations, 1 figure.

Key Result

Theorem 1

For each client $k \in \{1,\dots,K\}$, let $(U_k, V_k)$ denote its data-generating pair, where $U_k$ takes values in a measurable input space $\mathcal{M}(H)$ and $V_k$ in an output space $\mathcal{M}(S)$, both embedded in a real Hilbert space $(\mathcal{H},\|\cdot\|)$ with the associated Borel $\si where $\pi=(\pi_1,\dots,\pi_K)$ is any distribution over clients and the inner expectation is over

Figures (1)

  • Figure 1: Comparison between real-world data and data generated by the SDG. The close alignment indicates that the SDG effectively captures the patterns present in real data.

Theorems & Definitions (21)

  • Theorem 1: Federated Loss Decomposition
  • Theorem 2: Client-wise Bayesian Loss
  • Remark 1
  • Lemma 1: Server-level Bayesian Loss Aggregation
  • Theorem 3: Client-wise Approximation Loss
  • Lemma 2: Server-level Approximation Loss Aggregation
  • Remark 2
  • Theorem 4: Unimodality and Optimal Horizon
  • proof
  • Corollary 1: Coverage–Tolerance Mapping
  • ...and 11 more