Dual Goal Representations

Seohong Park; Deepinder Mann; Sergey Levine

Dual Goal Representations

Seohong Park, Deepinder Mann, Sergey Levine

TL;DR

The paper introduces dual goal representations for goal-conditioned RL, representing a goal by its temporal distances to all states to achieve sufficiency and noise invariance. It provides a practical offline learning recipe that uses a parameterized temporal distance approximator and a downstream offline GCRL algorithm, demonstrated across the OGBench suite. Theoretical results establish sufficiency and noise invariance, and empirically the approach improves offline goal-reaching performance across 20 state- and pixel-based tasks. This framework offers a robust, environment-invariant goal representation that can be plugged into existing GCRL pipelines to enhance generalization and learning efficiency.

Abstract

In this work, we introduce dual goal representations for goal-conditioned reinforcement learning (GCRL). A dual goal representation characterizes a state by "the set of temporal distances from all other states"; in other words, it encodes a state through its relations to every other state, measured by temporal distance. This representation provides several appealing theoretical properties. First, it depends only on the intrinsic dynamics of the environment and is invariant to the original state representation. Second, it contains provably sufficient information to recover an optimal goal-reaching policy, while being able to filter out exogenous noise. Based on this concept, we develop a practical goal representation learning method that can be combined with any existing GCRL algorithm. Through diverse experiments on the OGBench task suite, we empirically show that dual goal representations consistently improve offline goal-reaching performance across 20 state- and pixel-based tasks.

Dual Goal Representations

TL;DR

Abstract

Dual Goal Representations

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (6)