From Ambiguity to Verdict: A Semiotic-Grounded Multi-Perspective Agent for LLM Logical Reasoning
Yunyao Zhang, Xinglang Zhang, Junxi Sheng, Wenbing Li, Junqing Yu, Wei Yang, Zikai Song
TL;DR
This paper tackles the challenge of reasoning under simultaneous semantic and logical complexity by introducing LogicAgent, a semiotic-square-guided framework with three stages: semantic structuring, logical reasoning, and reflective verification. It grounds semantic oppositions in Greimas’ semiotic square, translates NL premises to FOL, and verifies conclusions through a three-tier reflection mechanism, addressing ambiguity and abstract concepts. To evaluate joint semantic and logical depth, the authors construct RepublicQA, a philosophically rich benchmark with high FKGL and systematic contrary-constructs, and demonstrate state-of-the-art performance on RepublicQA and strong generalization to ProntoQA, ProofWriter, FOLIO, and ProverQA. The work highlights the importance of multi-perspective, symbolically grounded reasoning for robust logical performance in semantically dense, abstract contexts.
Abstract
Logical reasoning is a fundamental capability of large language models. However, existing studies often overlook the interaction between logical complexity and semantic complexity, leading to systems that struggle with abstract propositions, ambiguous contexts, and conflicting stances that are central to human reasoning. We propose LogicAgent, a semiotic-square-guided framework that jointly addresses these two axes of difficulty. The semiotic square provides a principled structure for multi-perspective semantic analysis, and LogicAgent integrates automated deduction with reflective verification to manage logical complexity across deeper reasoning chains. To support evaluation under these conditions, we introduce RepublicQA, a benchmark that couples semantic complexity with logical depth. RepublicQA reaches college-level semantic difficulty (FKGL 11.94), contains philosophically grounded abstract propositions with systematically constructed contrary and contradictory forms, and offers a semantically rich setting for assessing logical reasoning in large language models. Experiments show that LogicAgent achieves state-of-the-art performance on RepublicQA with a 6.25 percent average improvement over strong baselines, and generalizes effectively to mainstream logical reasoning benchmarks including ProntoQA, ProofWriter, FOLIO, and ProverQA, achieving an additional 7.05 percent average gain. These results demonstrate the effectiveness of semiotic-grounded multi-perspective reasoning in enhancing logical performance.
