Towards Diverse Perspective Learning with Selection over Multiple Temporal Poolings
Jihyeon Seong, Jungmin Kim, Jaesik Choi
TL;DR
This paper tackles the challenge that no single temporal pooling method universally captures the temporal structure of time series data. It proposes SoM-TP, an attention-based selection mechanism over multiple temporal poolings (GTP, STP, DTP) within a single classifier, augmented by a Diverse Perspective Learning Network (DPLN) and a perspective loss to regularize learning across pooling perspectives. The approach enables non-iterative, batch-wise pooling selection inspired by Multiple Choice Learning, and is complemented by LRP-based analysis to demonstrate diverse perspective learning. Empirical results on extensive UCR/UEA benchmarks show SoM-TP surpasses traditional pooling methods and many state-of-the-art TSC models, with robust performance and informative attribution patterns, underscoring the value of pooling-level ensemble in time series classification.
Abstract
In Time Series Classification (TSC), temporal pooling methods that consider sequential information have been proposed. However, we found that each temporal pooling has a distinct mechanism, and can perform better or worse depending on time series data. We term this fixed pooling mechanism a single perspective of temporal poolings. In this paper, we propose a novel temporal pooling method with diverse perspective learning: Selection over Multiple Temporal Poolings (SoM-TP). SoM-TP dynamically selects the optimal temporal pooling among multiple methods for each data by attention. The dynamic pooling selection is motivated by the ensemble concept of Multiple Choice Learning (MCL), which selects the best among multiple outputs. The pooling selection by SoM-TP's attention enables a non-iterative pooling ensemble within a single classifier. Additionally, we define a perspective loss and Diverse Perspective Learning Network (DPLN). The loss works as a regularizer to reflect all the pooling perspectives from DPLN. Our perspective analysis using Layer-wise Relevance Propagation (LRP) reveals the limitation of a single perspective and ultimately demonstrates diverse perspective learning of SoM-TP. We also show that SoM-TP outperforms CNN models based on other temporal poolings and state-of-the-art models in TSC with extensive UCR/UEA repositories.
