DISC: Dynamic Decomposition Improves LLM Inference Scaling

Jonathan Light; Wei Cheng; Benjamin Riviere; Wu Yue; Masafumi Oyamada; Mengdi Wang; Yisong Yue; Santiago Paternain; Haifeng Chen

DISC: Dynamic Decomposition Improves LLM Inference Scaling

Jonathan Light, Wei Cheng, Benjamin Riviere, Wu Yue, Masafumi Oyamada, Mengdi Wang, Yisong Yue, Santiago Paternain, Haifeng Chen

TL;DR

DISC addresses the inefficiency of static step sizes in LLM inference by introducing a dynamic decomposition that adaptively partitions solution steps based on real-time reward statistics. It combines a z-score based acceptance criterion with an adaptive prefix refinement to concentrate sampling on difficult regions, and is designed to be plug-and-play with greedy, beam, and Monte Carlo Tree Search. Empirically, DISC delivers consistent improvements in pass@k and token efficiency across APPS, MATH, and LiveCodeBench, including strong gains with open-source models and reasoning prompts, while maintaining negligible runtime overhead. The framework relies on minimal assumptions, requires only a scalar reward signal, and offers theoretical intuition about optimality under certain policy conditions, alongside practical analyses on temperature, partition fraction, and acceptance criteria. Overall, DISC provides a scalable, general approach to adaptive inference that can inform curriculum design, dataset augmentation, and future research in efficient reasoning for LLMs.

Abstract

Inference scaling methods for LLMs often rely on decomposing problems into steps (or groups of tokens), followed by sampling and selecting the best next steps. However, these steps and their sizes are often predetermined or manually designed based on domain knowledge. We propose dynamic decomposition, a method that adaptively and automatically partitions solution and reasoning traces into manageable steps during inference. By more effectively allocating compute -- particularly through subdividing challenging steps and prioritizing their sampling -- dynamic decomposition significantly improves inference efficiency. Experiments on benchmarks such as APPS, MATH, and LiveCodeBench demonstrate that dynamic decomposition outperforms static approaches, including token-level, sentence-level, and single-step decompositions, reducing the pass@10 error rate by 5.0%, 6.7%, and 10.5% respectively. These findings highlight the potential of dynamic decomposition to improve a wide range of inference scaling techniques.

DISC: Dynamic Decomposition Improves LLM Inference Scaling

TL;DR

Abstract

DISC: Dynamic Decomposition Improves LLM Inference Scaling

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (48)

Theorems & Definitions (11)