Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

Tong Nie; Guoyang Qin; Lijun Sun; Wei Ma; Yu Mei; Jian Sun

Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

Tong Nie, Guoyang Qin, Lijun Sun, Wei Ma, Yu Mei, Jian Sun

TL;DR

This work proposes an adapted version of MLP-Mixer, named NexuSQN, that can rival SOTA baselines when tested on several traffic benchmarks and contributes to the exploration of simple-yet-effective models for real-world STTD forecasting.

Abstract

Spatiotemporal traffic data (STTD) displays complex correlational structures. Extensive advanced techniques have been designed to capture these structures for effective forecasting. However, because STTD is often massive in scale, practitioners need to strike a balance between effectiveness and efficiency using computationally efficient models. An alternative paradigm based on multilayer perceptron (MLP) called MLP-Mixer has the potential for both simplicity and effectiveness. Taking inspiration from its success in other domains, we propose an adapted version, named NexuSQN, for STTD forecast at scale. We first identify the challenges faced when directly applying MLP-Mixers as seriesand window-wise multivaluedness. To distinguish between spatial and temporal patterns, the concept of ST-contextualization is then proposed. Our results surprisingly show that this simple-yeteffective solution can rival SOTA baselines when tested on several traffic benchmarks. Furthermore, NexuSQN has demonstrated its versatility across different domains, including energy and environment data, and has been deployed in a collaborative project with Baidu to predict congestion in megacities like Beijing and Shanghai. Our findings contribute to the exploration of simple-yet-effective models for real-world STTD forecasting.

Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

TL;DR

Abstract

Paper Structure (58 sections, 17 equations, 8 figures, 20 tables)

This paper contains 58 sections, 17 equations, 8 figures, 20 tables.

Introduction
Preliminary and Related Work
Notations
MLP-Mixer
Related Work
ST-Contextualized MLP-Mixers
Overview of Architectural Components
Overview
Spatiotemporal Node Embedding
Flattened Dense Time-Mixers
Dimension Flattening and Projection
Time Mixing with Spatial Context
Structured Scalable Space-Mixers
Space Mixing with Temporal Context
Scalable Kernelized Contextualization
...and 43 more sections

Figures (8)

Figure 1: ST-ctx in urban traffic data: (a) If future series across locations differ but share a similar historical series, mixing time without knowing the location causes a series-wise multivaluedness. (b-c) The spatial context that distinguishes the locations is needed. (d) Even with spatial context enabled, if two windows with identical histories differ in the future, mixing space is window-wise indistinguishable. (e-f) The temporal context is necessary to disambiguate the multivaluedness.
Figure 2: Computational performance.
Figure 3: Illustration of spatial contextualization issue.
Figure 4: Temporal contextualization issue.
Figure 5: Robustness, transferability, and scalability of NexuSQN.
...and 3 more figures

Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

TL;DR

Abstract

Contextualizing MLP-Mixers Spatiotemporally for Urban Data Forecast at Scale

Authors

TL;DR

Abstract

Table of Contents

Figures (8)