Qwen3-Coder-Next Technical Report

Ruisheng Cao; Mouxiang Chen; Jiawei Chen; Zeyu Cui; Yunlong Feng; Binyuan Hui; Yuheng Jing; Kaixin Li; Mingze Li; Junyang Lin; Zeyao Ma; Kashun Shum; Xuwu Wang; Jinxi Wei; Jiaxi Yang; Jiajun Zhang; Lei Zhang; Zongmeng Zhang; Wenting Zhao; Fan Zhou

Qwen3-Coder-Next Technical Report

Ruisheng Cao, Mouxiang Chen, Jiawei Chen, Zeyu Cui, Yunlong Feng, Binyuan Hui, Yuheng Jing, Kaixin Li, Mingze Li, Junyang Lin, Zeyao Ma, Kashun Shum, Xuwu Wang, Jinxi Wei, Jiaxi Yang, Jiajun Zhang, Lei Zhang, Zongmeng Zhang, Wenting Zhao, Fan Zhou

TL;DR

This work explores how far strong training recipes can push the capability limits of models with small parameter footprints and presents Qwen3-Coder-Next, an open-weight language model specialized for coding agents that activates only 3 billion parameters during inference, enabling strong coding capability with efficient inference.

Abstract

We present Qwen3-Coder-Next, an open-weight language model specialized for coding agents. Qwen3-Coder-Next is an 80-billion-parameter model that activates only 3 billion parameters during inference, enabling strong coding capability with efficient inference. In this work, we explore how far strong training recipes can push the capability limits of models with small parameter footprints. To achieve this, we perform agentic training through large-scale synthesis of verifiable coding tasks paired with executable environments, allowing learning directly from environment feedback via mid-training and reinforcement learning. Across agent-centric benchmarks including SWE-Bench and Terminal-Bench, Qwen3-Coder-Next achieves competitive performance relative to its active parameter count. We release both base and instruction-tuned open-weight versions to support research and real-world coding agent development.

Qwen3-Coder-Next Technical Report

TL;DR

Abstract

Paper Structure (61 sections, 2 equations, 8 figures, 16 tables)

This paper contains 61 sections, 2 equations, 8 figures, 16 tables.

Introduction
Scaling up Agentic Training
Task Synthesis
Creating Executable Environments from GitHub PRs.
Synthesizing Issues.
Infrastructure
Mid-training
Data
Natural Data
GitHub.
Text–code Grounding Data.
GitHub Pull Requests (PRs).
Synthetic Data
Single-turn QA.
Multi-turn Agentic Coding.
...and 46 more sections

Figures (8)

Figure 1: Comparison of Qwen3-Coder-Next to other open-weight models on SWE-Bench Verified, SWE-Bench Multilingual, SWE-Bench Pro, Terminal-Bench 2.0, and Aider.
Figure 2: Our pipeline for synthesizing bugs to scale up the number of software engineering tasks.
Figure 3: Scaling analysis of code agent pretraining on SWE tasks, SWE-Bench Verified and SWE-Bench Multilingual. Left: Within-scaffold scaling shows consistent improvement with the increase of mid-training tokens. Right: Cross-scaffold transfer reveals limited generalization across different agent frameworks.
Figure 4: Different tool chat templates, split by tool definition, tool calls, and tool response.
Figure 5: SWE-bench Verified performance vs. number of tool chat templates. Data volume and training configuration are kept identical.
...and 3 more figures

Qwen3-Coder-Next Technical Report

TL;DR

Abstract

Qwen3-Coder-Next Technical Report

Authors

TL;DR

Abstract

Table of Contents

Figures (8)