Controlling Large Language Model with Latent Actions

Chengxing Jia; Ziniu Li; Pengyuan Wang; Yi-Chen Li; Zhenyu Hou; Yuxiao Dong; Yang Yu

Controlling Large Language Model with Latent Actions

Chengxing Jia, Ziniu Li, Pengyuan Wang, Yi-Chen Li, Zhenyu Hou, Yuxiao Dong, Yang Yu

TL;DR

CoLA introduces a latent-action framework to control large language models with a compact action space learned via an inverse dynamics model. The architecture couples a language world model with a discrete latent action codebook and a policy, enabling efficient RL and improved downstream performance while preserving the base model's capabilities. Experiments on math reasoning, agent tasks, and diverse prompts show higher semantic diversity, stronger math performance ($42.4$ on math500 vs $38.2$) and a peak $68.2$ with MCTS variants, along with improved robustness to reward hacking. The results suggest latent-action control offers a scalable path to more controllable, sample-efficient RL-based adaptation of LLMs for practical applications.

Abstract

Adapting Large Language Models (LLMs) to downstream tasks using Reinforcement Learning (RL) has proven to be an effective approach. However, LLMs do not inherently define the structure of an agent for RL training, particularly in terms of defining the action space. This paper studies learning a compact latent action space to enhance the controllability and exploration of RL for LLMs. We propose Controlling Large Language Models with Latent Actions (CoLA), a framework that integrates a latent action space into pre-trained LLMs. We apply CoLA to the Llama-3.1-8B model. Our experiments demonstrate that, compared to RL with token-level actions, CoLA's latent action enables greater semantic diversity in text generation. For enhancing downstream tasks, we show that CoLA with RL achieves a score of 42.4 on the math500 benchmark, surpassing the baseline score of 38.2, and reaches 68.2 when augmented with a Monte Carlo Tree Search variant. Furthermore, CoLA with RL consistently improves performance on agent-based tasks without degrading the pre-trained LLM's capabilities, unlike the baseline. Finally, CoLA reduces computation time by half in tasks involving enhanced thinking prompts for LLMs by RL. These results highlight CoLA's potential to advance RL-based adaptation of LLMs for downstream applications.

Controlling Large Language Model with Latent Actions

TL;DR

Abstract

Controlling Large Language Model with Latent Actions

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)