Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering
Jash Rajesh Parekh, Wonbin Kweon, Joey Chan, Rezarta Islamaj, Robert Leaman, Pengcheng Jiang, Chih-Hsuan Wei, Zhizheng Wang, Zhiyong Lu, Jiawei Han
TL;DR
This work tackles the gap in biomedical QA where clinical decisions are contingent on patient-specific factors. It introduces CondMedQA, a benchmark designed to test conditional multi-hop reasoning, and Condition-Gated Reasoning (CGR), a framework that builds condition-aware knowledge graphs and gates reasoning paths to ensure contextually valid inferences. CGR extracts condition-rich n-tuples, normalizes entities with UMLS, and performs gated graph traversal guided by a single LLM-based condition evaluation, yielding grounded, path-traced answers. Across CondMedQA and existing biomedical QA benchmarks, CGR achieves state-of-the-art performance on condition-sensitive questions while maintaining competitive results on factual tasks, highlighting the importance of explicit conditional modeling for robust medical reasoning and interpretability.
Abstract
Current biomedical question answering (QA) systems often assume that medical knowledge applies uniformly, yet real-world clinical reasoning is inherently conditional: nearly every decision depends on patient-specific factors such as comorbidities and contraindications. Existing benchmarks do not evaluate such conditional reasoning, and retrieval-augmented or graph-based methods lack explicit mechanisms to ensure that retrieved knowledge is applicable to given context. To address this gap, we propose CondMedQA, the first benchmark for conditional biomedical QA, consisting of multi-hop questions whose answers vary with patient conditions. Furthermore, we propose Condition-Gated Reasoning (CGR), a novel framework that constructs condition-aware knowledge graphs and selectively activates or prunes reasoning paths based on query conditions. Our findings show that CGR more reliably selects condition-appropriate answers while matching or exceeding state-of-the-art performance on biomedical QA benchmarks, highlighting the importance of explicitly modeling conditionality for robust medical reasoning.
