LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
Xinwu Ye, Yicheng Mao, Jia Zhang, Yimeng Liu, Li Hao, Fang Wu, Zhiwei Li, Yuxuan Liao, Zehong Wang, Zhiyuan Liu, Zhenfei Yin, Li Yuan, Philip Torr, Huan Sun, Xiangxiang Zeng, Mengdi Wang, Le Cong, Shenghua Gao, Xiangru Tang
TL;DR
The paper identifies a fundamental mismatch between discrete linguistic CoT and the continuous, structural nature of chemical reasoning. It introduces LatentChem, a latent-thinking interface that decouples chemical computation from language, enabling multi-step reasoning in continuous latent space with a dynamic perception loop powered by ChemAdapter, ChemUpdater, and LatentProjector. A four-stage training protocol culminating in a Group Relative Policy Optimization phase reveals an emergent phenomenon: models spontaneously internalize reasoning into latent space, achieving superior performance and large inference-speedups over explicit CoT baselines across multiple chemical benchmarks. This work demonstrates that chemical reasoning can be more naturally and efficiently realized through continuous latent dynamics, offering a substantial practical impact for faster, more reliable chemical AI systems and prompting future exploration of hybrid cognitive architectures that can expose latent traces when needed.
Abstract
Chemical large language models (LLMs) predominantly rely on explicit Chain-of-Thought (CoT) in natural language to perform complex reasoning. However, chemical reasoning is inherently continuous and structural, and forcing it into discrete linguistic tokens introduces a fundamental representation mismatch that constrains both efficiency and performance. We introduce LatentChem, a latent reasoning interface that decouples chemical computation from textual generation, enabling models to perform multi-step reasoning directly in continuous latent space while emitting language only for final outputs. Remarkably, we observe a consistent emergent behavior: when optimized solely for task success, models spontaneously internalize reasoning, progressively abandoning verbose textual derivations in favor of implicit latent computation. This shift is not merely stylistic but computationally advantageous. Across diverse chemical reasoning benchmarks, LatentChem achieves a 59.88\% non-tie win rate over strong CoT-based baselines on ChemCoTBench, while delivering a 10.84$\times$ average inference speedup. Our results provide empirical evidence that chemical reasoning is more naturally and effectively realized as continuous latent dynamics rather than discretized linguistic trajectories.
