Universal Domain Adaptation Benchmark for Time Series Data Representation
Romain Mussard, Fannia Pacheco, Maxime Berar, Gilles Gasso, Paul Honeine
TL;DR
The paper tackles the underexplored problem of Universal Domain Adaptation for time-series data by introducing a standardized benchmark and protocol to evaluate TS backbones and UniDA methods under domain shifts. It systematically compares six UniDA methods across four backbones (including CNN and Fourier-based FNO) using a Bayesian-model-selection approach guided by the H-score $(2A_C A_U)/(A_C + A_U)$. Key findings show backbone choice profoundly influences UniDA performance, with CNN and FNO delivering the strongest results, while newer architectures like TSLANet and S3 often underperform in this setting; UniJDOT emerges as the most robust method across datasets and backbones. The work provides a practical framework and insights to drive future development of TS-tailored UniDA backbones and evaluation protocols, aiding researchers and practitioners in assessing robustness and generalization for time-series adaptation tasks.
Abstract
Deep learning models have significantly improved the ability to detect novelties in time series (TS) data. This success is attributed to their strong representation capabilities. However, due to the inherent variability in TS data, these models often struggle with generalization and robustness. To address this, a common approach is to perform Unsupervised Domain Adaptation, particularly Universal Domain Adaptation (UniDA), to handle domain shifts and emerging novel classes. While extensively studied in computer vision, UniDA remains underexplored for TS data. This work provides a comprehensive implementation and comparison of state-of-the-art TS backbones in a UniDA framework. We propose a reliable protocol to evaluate their robustness and generalization across different domains. The goal is to provide practitioners with a framework that can be easily extended to incorporate future advancements in UniDA and TS architectures. Our results highlight the critical influence of backbone selection in UniDA performance and enable a robustness analysis across various datasets and architectures.
