Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

Yen-Ku Liu; Yun-Cheng Tsai

Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

Yen-Ku Liu, Yun-Cheng Tsai

TL;DR

Questions-of-Thoughts (QoT), a quality-driven inference-time scaffold that turns a user goal into an ordered sequence of engineering steps and stepwise self-questioning to verify constraints and reduce omission errors, is introduced, maintaining a lightweight reasoning record that stabilizes subsequent design decisions.

Abstract

Recent advances in large language models (LLMs) have accelerated AI-assisted software development, yet practical deployment remains constrained by incomplete implementations, weak modularization, and inconsistent security practices. We introduce Questions-of-Thoughts (QoT), a quality-driven inference-time scaffold that turns a user goal into (i) an ordered sequence of engineering steps and (ii) stepwise self-questioning to verify constraints and reduce omission errors, while maintaining a lightweight reasoning record that stabilizes subsequent design decisions. We evaluate QoT across three representative backend engineering domains: API Design, Data Communication, and File Systems. Each task requires multi-module decomposition and exposes standard failure modes in LLM-generated systems. To enable data-driven comparison, we score generated artifacts using an ISO/IEC-inspired quality rubric that measures Scalability, Completeness, Modularity, and Security. We report domain-wise gains as the change in total quality score, defined as the QoT score minus the NoQoT score. Results show capacity-dependent improvements: QoT yields consistent quality improvements for larger models and more complex domains, while smaller models may exhibit trade-offs under tight context and planning budgets. We release an open artifact with prompts, scoring guidelines, raw generations, and scripts that reproduce the reported tables and figures to support applied AI and data analytics research.

Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

TL;DR

Abstract

Paper Structure (18 sections, 2 equations, 4 figures, 3 tables, 1 algorithm)

This paper contains 18 sections, 2 equations, 4 figures, 3 tables, 1 algorithm.

Introduction
Literature Review
LLMs for Code Generation and Software Engineering
Inference-Time Structured Reasoning and Agentic Workflows
Reliability and Quality Criteria for Generated Code
Gap and Motivation
Architecture
Sequential Process Chain
Question-Answer Chain
Reasoning Knowledge Base
Implementation and Results
Evaluation Setup
Evaluation Metrics
Experimental Results
Findings and Observations
...and 3 more sections

Figures (4)

Figure 1: Overall Structure of the Questions-of-Thoughts (QoT) Algorithm
Figure 2: Sequential QA Chain Construction in QoT
Figure 3: Overall experimental results. QoT generally improves quality scores relative to non-QoT baselines, with model- and domain-dependent effects.
Figure 4: Detailed experimental results. Domain-wise comparisons across model configurations with and without QoT highlight capacity-dependent gains and occasional trade-offs. The table summarizes percentage improvements observed in key comparisons.

Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

TL;DR

Abstract

Quality-Driven Agentic Reasoning for LLM-Assisted Software Design: Questions-of-Thoughts (QoT) as a Time-Series Self-QA Chain

Authors

TL;DR

Abstract

Table of Contents

Figures (4)