Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Benjamin Eysenbach; Vivek Myers; Ruslan Salakhutdinov; Sergey Levine

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Benjamin Eysenbach, Vivek Myers, Ruslan Salakhutdinov, Sergey Levine

TL;DR

The paper tackles probabilistic inference on high‑dimensional time series by learning temporal contrastive representations whose marginal distribution is isotropic Gaussian and whose joint dynamics form a Gauss‑Markov chain. By introducing a parametrization with a learned matrix $A$, the authors show that future representations, intermediate waypoints, and full sequences admit closed‑form Gaussian posteriors, reducing planning and prediction to low‑dimensional matrix operations and even linear interpolation in special cases. Theoretical results are complemented by numerical experiments on synthetic spirals, mazes, and high‑dimensional robotic tasks (39D and 46D), demonstrating accurate inference and substantial planning gains over baselines. The work offers a practical, scalable route to inference‑driven planning in high‑dimensional time series without reconstruction, with potential impact in robotics, control, and financial time series.

Abstract

Given time series data, how can we answer questions like "what will happen in the future?" and "how did we get here?" These sorts of probabilistic inference questions are challenging when observations are high-dimensional. In this paper, we show how these questions can have compact, closed form solutions in terms of learned representations. The key idea is to apply a variant of contrastive learning to time series data. Prior work already shows that the representations learned by contrastive learning encode a probability ratio. By extending prior work to show that the marginal distribution over representations is Gaussian, we can then prove that joint distribution of representations is also Gaussian. Taken together, these results show that representations learned via temporal contrastive learning follow a Gauss-Markov chain, a graphical model where inference (e.g., prediction, planning) over representations corresponds to inverting a low-dimensional matrix. In one special case, inferring intermediate representations will be equivalent to interpolating between the learned representations. We validate our theory using numerical simulations on tasks up to 46-dimensions.

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

TL;DR

, the authors show that future representations, intermediate waypoints, and full sequences admit closed‑form Gaussian posteriors, reducing planning and prediction to low‑dimensional matrix operations and even linear interpolation in special cases. Theoretical results are complemented by numerical experiments on synthetic spirals, mazes, and high‑dimensional robotic tasks (39D and 46D), demonstrating accurate inference and substantial planning gains over baselines. The work offers a practical, scalable route to inference‑driven planning in high‑dimensional time series without reconstruction, with potential impact in robotics, control, and financial time series.

Abstract

Paper Structure (33 sections, 6 theorems, 34 equations, 9 figures)

This paper contains 33 sections, 6 theorems, 34 equations, 9 figures.

Introduction
Related Work
Representations for time-series data.
Contrastive Learning.
Goal-oriented decision making.
Planning.
Preliminaries
Contrastive learning.
Key assumptions
Contrastive Representations Make Inference Easy
A Parametrization for Shared Encoders
Representations Encode a Predictive Model
Planning over One Intermediate State
Example 1:
Example 2:
...and 18 more sections

Key Result

lemma 1

Under the assumptions from sec:prelims, the distribution over representations of future states follows a Gaussian distribution with mean parameter given by the initial state representation:

Figures (9)

Figure 1: We apply temporal contrastive learning to observation pairs to obtain representations ($\psi(x_0), \psi(x_{t+k})$) such that $A \psi(x_0)$ is close to $\psi(x_{t+k})$. While inferring waypoints in the high-dimensional observation space is challenging, we show that the distribution over intermediate latent representations has a closed form solution corresponding to linear interpolation between the initial and final representations.
Figure 2: A parametrization for temporal contrastive learning.
Figure 3: Predicting representations of future states.
Figure 4: Numerical simulation of our analysis.(Top Left) Toy dataset of time-series data consisting of many outwardly-spiraling trajectories. We apply temporal contrastive learning to these data. (Top Right) For three initial observations ($\blacksquare$), we use the learned representations to predict the distribution over future observations. Note that these distributions correctly capture the spiral structure. (Bottom Left) For three observations ($\star$), we use the learned representations to predict the distribution over preceding observations. (Bottom Right) Given an initial and final observation, we plot the inferred posterior distribution over the waypoint (\ref{['sec:planning']}). The representations capture the shape of the distribution.
Figure 5: Using inferred paths over our contrastive representations for control boosts success rates by $4.5\times$ on the most difficult goals ($18\% \rightarrow 84\%$). Alternative representation learning techniques fail to improve performance when used for planning.
...and 4 more figures

Theorems & Definitions (12)

lemma 1
theorem 1
theorem 2
lemma 2
proof
proof
proof
proof
lemma 3
proof : Proof of \ref{['lemma:full_infonce']}
...and 2 more

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

TL;DR

Abstract

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (12)