CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models

Yifan Zhang; Xue Yang

CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models

Yifan Zhang, Xue Yang

TL;DR

CONSTRUCTA tackles the challenge of automating construction schedules in complex projects by integrating three core components: Static Retrieval-Augmented Generation for domain knowledge, Contextual Knowledge RAG for dynamic, context-sensitive input, and Construction RLHF with CPA-DPO to align outputs with expert preferences. On a proprietary dataset of 4,340 semiconductor activities, it achieves substantial improvements in missing value prediction, dependency analysis, and automated planning, outperforming baselines and highlighting strong discipline-, level-, and area-specific gains. The approach demonstrates how domain-informed retrieval, hierarchical context sampling, and human-guided optimization can yield robust, scalable, and adaptable scheduling for construction workflows, with clear path toward integration with industry tools. These results suggest significant practical impact for automated planning in large-scale construction environments and inform future research on domain-specific LLM customization and deployment strategies.

Abstract

Automating planning with LLMs presents transformative opportunities for traditional industries, yet remains underexplored. In commercial construction, the complexity of automated scheduling often requires manual intervention to ensure precision. We propose CONSTRUCTA, a novel framework leveraging LLMs to optimize construction schedules in complex projects like semiconductor fabrication. CONSTRUCTA addresses key challenges by: (1) integrating construction-specific knowledge through static RAG; (2) employing context-sampling techniques inspired by architectural expertise to provide relevant input; and (3) deploying Construction DPO to align schedules with expert preferences using RLHF. Experiments on proprietary data demonstrate performance improvements of +42.3% in missing value prediction, +79.1% in dependency analysis, and +28.9% in automated planning compared to baseline methods, showcasing its potential to revolutionize construction workflows and inspire domain-specific LLM advancements.

CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models

TL;DR

Abstract

CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)