MDToC: Metacognitive Dynamic Tree of Concepts for Boosting Mathematical Problem-Solving of Large Language Models
Tung Duong Ta, Tim Oates, Thien Van Luong, Huan Vu, Tien Cuong Nguyen
TL;DR
MDToC introduces a metacognitive dynamic tree of concepts to boost mathematical problem-solving in LLMs by planning diverse concepts, monitoring calculations with evaluator/fixer loops, and reviewing results via majority voting. It constructs a depth-two concept tree, generates accuracy-verified calculations, and uses metacognitive prompts to mitigate intermediate errors. Across CHAMP, MATH, and Game-of-24 benchmarks, MDToC consistently outperforms Tree-of-Thought and Graph-of-Thought prompting across multiple backbone models, including GPT-4 variants, albeit with higher compute cost. The work highlights the promise of metacognitive calculation verification for more reliable, scalable math reasoning in LLMs and discusses practical considerations like cost and hyperparameter sensitivity.
Abstract
Despite advances in mathematical reasoning capabilities, Large Language Models (LLMs) still struggle with calculation verification when using established prompting techniques. We present MDToC (Metacognitive Dynamic Tree of Concepts), a three-phase approach that constructs a concept tree, develops accuracy-verified calculations for each concept, and employs majority voting to evaluate competing solutions. Evaluations across CHAMP, MATH, and Game-of-24 benchmarks demonstrate our MDToC's effectiveness, with GPT-4-Turbo achieving 58.1\% on CHAMP, 86.6\% on MATH, and 85\% on Game-of-24 - outperforming GoT by 5\%, 5.4\%, and 4\% on all these tasks, respectively, without hand-engineered hints. MDToC consistently surpasses existing prompting methods across all backbone models, yielding improvements of up to 7.6\% over ToT and 6.2\% over GoT, establishing metacognitive calculation verification as a promising direction for enhanced mathematical reasoning.
