Knowledge-Augmented Long-CoT Generation for Complex Biomolecular Reasoning

Tianwen Lyu; Xiang Zhuang; Keyan Ding; Xinzhe Cao; Lei Liang; Wei Zhao; Qiang Zhang; Huajun Chen

Knowledge-Augmented Long-CoT Generation for Complex Biomolecular Reasoning

Tianwen Lyu, Xiang Zhuang, Keyan Ding, Xinzhe Cao, Lei Liang, Wei Zhao, Qiang Zhang, Huajun Chen

TL;DR

The paper tackles the challenge of reliable, multi-step biomolecular reasoning where LLMs struggle with grounding and long-range dependencies. It introduces Bio-KCoT, a knowledge-augmented long-CoT framework that retrieves and prunes knowledge-graph–guided reasoning paths and integrates them into supervised fine-tuning and reinforcement learning. To support rigorous evaluation, it also introduces PrimeKGQA, a diverse biomolecular QA benchmark with varying reasoning depths. Across PrimeKGQA and external datasets, Bio-KCoT achieves state-of-the-art performance on deep, multi-hop tasks and demonstrates strong generalization with smaller models, highlighting the value of structured knowledge in biology-oriented reasoning.

Abstract

Understanding complex biomolecular mechanisms requires multi-step reasoning across molecular interactions, signaling cascades, and metabolic pathways. While large language models(LLMs) show promise in such tasks, their application to biomolecular problems is hindered by logical inconsistencies and the lack of grounding in domain knowledge. Existing approaches often exacerbate these issues: reasoning steps may deviate from biological facts or fail to capture long mechanistic dependencies. To address these challenges, we propose a Knowledge-Augmented Long-CoT Reasoning framework that integrates LLMs with knowledge graph-based multi-hop reasoning chains. The framework constructs mechanistic chains via guided multi-hop traversal and pruning on the knowledge graph; these chains are then incorporated into supervised fine-tuning to improve factual grounding and further refined with reinforcement learning to enhance reasoning reliability and consistency. Furthermore, to overcome the shortcomings of existing benchmarks, which are often restricted in scale and scope and lack annotations for deep reasoning chains, we introduce PrimeKGQA, a comprehensive benchmark for biomolecular question answering. Experimental results on both PrimeKGQA and existing datasets demonstrate that although larger closed-source models still perform well on relatively simple tasks, our method demonstrates clear advantages as reasoning depth increases, achieving state-of-the-art performance on multi-hop tasks that demand traversal of structured biological knowledge. These findings highlight the effectiveness of combining structured knowledge with advanced reasoning strategies for reliable and interpretable biomolecular reasoning.

Knowledge-Augmented Long-CoT Generation for Complex Biomolecular Reasoning

TL;DR

Abstract

Knowledge-Augmented Long-CoT Generation for Complex Biomolecular Reasoning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)