Automated Design Optimization via Strategic Search with Large Language Models
Anthony Carreon, Vansh Sharma, Venkat Raman
TL;DR
This paper introduces AUTO, an LLM-driven, gradient-free design optimization framework that uses a Strategist and an Implementor to explore ill-defined design spaces, demonstrated on GPU code optimization tasks (chemical kinetics and dense matrix multiplication). The approach emphasizes context-curation, strategic decision-making, and constraint-driven evaluation, achieving 50-70% search efficiency versus Bayesian optimization and competitive performance relative to expert implementations. It provides a detailed analysis of solution quality, convergence behavior, and the cost implications of LLM-based optimization, revealing both promise and current limitations, such as compilation failures and limited knowledge of niche APIs. The work suggests that automated design optimization in ill-defined spaces is feasible and scalable, with potential impact across hardware-software co-design and multi-objective engineering problems, given future enhancements in knowledge integration and stopping criteria.
Abstract
Traditional optimization methods excel in well-defined search spaces but struggle with design problems where transformations and design parameters are difficult to define. Large language models (LLMs) offer a promising alternative by dynamically interpreting design spaces and leveraging encoded domain knowledge. To this end, we introduce AUTO, an LLM agent framework that treats design optimization as a gradient-free search problem guided by strategic LLM reasoning. The framework employs two collaborative agents: a Strategist that selects between exploration and exploitation strategies, and an Implementor that executes detailed designs. Applied to GPU code optimization -- a domain critical to fields from machine learning to scientific computing -- AUTO generates solutions competitive with expert implementations for chemical kinetics integration and dense matrix multiplication. The framework achieves 50-70% search efficiency relative to Bayesian optimization methodologies. It completes optimizations in approximately 8 hours at an estimated cost of up to \$159 per run, compared to an estimated cost of up to \$480 with median-wage software developers. These findings open the door to automating design optimization in ill-defined search spaces with limited prior information.
