Instruction-based Time Series Editing

Jiaxing Qiu; Dongliang Guo; Brynne Sullivan; Teague R. Henry; Thomas Hartvigsen

Instruction-based Time Series Editing

Jiaxing Qiu, Dongliang Guo, Brynne Sullivan, Teague R. Henry, Thomas Hartvigsen

TL;DR

The paper tackles the rigidity of prior time series editors by introducing instruction-based editing, allowing natural-language guidance to steer edits while preserving unrelated characteristics. It proposes InstructTime, a multimodal editor that encodes time series and instructions into a shared hyperspherical space and uses interpolated decoding to control editing strength across multiple resolutions. The work demonstrates state-of-the-art editing quality, supports smooth interpolated editing, and offers few-shot tuning to adapt to unseen conditions, highlighting strong generalizability. These advances enable nuanced, hypothesis-driven edits in real-world contexts where textual notes accompany time series data, with potential applications in healthcare and beyond.

Abstract

In time series editing, we aim to modify some properties of a given time series without altering others. For example, when analyzing a hospital patient's blood pressure, we may add a sudden early drop and observe how it impacts their future while preserving other conditions. Existing diffusion-based editors rely on rigid, predefined attribute vectors as conditions and produce all-or-nothing edits through sampling. This attribute- and sampling-based approach limits flexibility in condition format and lacks customizable control over editing strength. To overcome these limitations, we introduce Instruction-based Time Series Editing, where users specify intended edits using natural language. This allows users to express a wider range of edits in a more accessible format. We then introduce InstructTime, the first instruction-based time series editor. InstructTime takes in time series and instructions, embeds them into a shared multi-modal representation space, then decodes their embeddings to generate edited time series. By learning a structured multi-modal representation space, we can easily interpolate between embeddings to achieve varying degrees of edit. To handle local and global edits together, we propose multi-resolution encoders. In our experiments, we use synthetic and real datasets and find that InstructTime is a state-of-the-art time series editor: InstructTime achieves high-quality edits with controllable strength, can generalize to unseen instructions, and can be easily adapted to unseen conditions through few-shot learning.

Instruction-based Time Series Editing

TL;DR

Abstract

Instruction-based Time Series Editing

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)