Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

Dahao Tang; Nan Yang; Yanli Li; Zhiyu Zhu; Zhibo Jin; Dong Yuan

Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

Dahao Tang, Nan Yang, Yanli Li, Zhiyu Zhu, Zhibo Jin, Dong Yuan

TL;DR

This work presents a principled framework for adaptive horizon selection in federated time series forecasting by embedding heterogeneous, non-IID client data into a geometry-preserving intrinsic space. Central to the approach is a Synthetic Data Generator that captures AR memory, seasonality, and trend, enabling a clean decomposition of forecasting loss into a Bayesian (irreducible) term and an approximation term. The total loss is shown to be unimodal in the look-back horizon, with the optimum at the smallest horizon that saturates the Bayesian loss while the approximation cost grows, yielding a client-specific horizon criterion $H_k^*(\delta)$. A robust federated horizon aggregator (TrimMean) then derives a global horizon that remains effective across heterogeneous clients. Together, these results provide the first provable criterion for adaptive horizon selection in federated TSF and offer practical guidance for designing horizon-aware, privacy-preserving forecasting systems.

Abstract

Selecting an appropriate look-back horizon remains a fundamental challenge in time series forecasting (TSF), particularly in the federated learning scenarios where data is decentralized, heterogeneous, and often non-independent. While recent work has explored horizon selection by preserving forecasting-relevant information in an intrinsic space, these approaches are primarily restricted to centralized and independently distributed settings. This paper presents a principled framework for adaptive horizon selection in federated time series forecasting through an intrinsic space formulation. We introduce a synthetic data generator (SDG) that captures essential temporal structures in client data, including autoregressive dependencies, seasonality, and trend, while incorporating client-specific heterogeneity. Building on this model, we define a transformation that maps time series windows into an intrinsic representation space with well-defined geometric and statistical properties. We then derive a decomposition of the forecasting loss into a Bayesian term, which reflects irreducible uncertainty, and an approximation term, which accounts for finite-sample effects and limited model capacity. Our analysis shows that while increasing the look-back horizon improves the identifiability of deterministic patterns, it also increases approximation error due to higher model complexity and reduced sample efficiency. We prove that the total forecasting loss is minimized at the smallest horizon where the irreducible loss starts to saturate, while the approximation loss continues to rise. This work provides a rigorous theoretical foundation for adaptive horizon selection for time series forecasting in federated learning.

Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

TL;DR

Abstract

Optimal Look-back Horizon for Time Series Forecasting in Federated Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (21)