Navigate Complex Physical Worlds via Geometrically Constrained LLM

Yongqiang Huang; Wentao Ye; Liyao Li; Junbo Zhao

Navigate Complex Physical Worlds via Geometrically Constrained LLM

Yongqiang Huang, Wentao Ye, Liyao Li, Junbo Zhao

TL;DR

This work innovatively explores the feasibility of using text-based LLMs as builders of the physical world and designs a workflow to enhance their spatial comprehension and construction capabilities.

Abstract

This study investigates the potential of Large Language Models (LLMs) for reconstructing and constructing the physical world solely based on textual knowledge. It explores the impact of model performance on spatial understanding abilities. To enhance the comprehension of geometric and spatial relationships in the complex physical world, the study introduces a set of geometric conventions and develops a workflow based on multi-layer graphs and multi-agent system frameworks. It examines how LLMs achieve multi-step and multi-objective geometric inference in a spatial environment using multi-layer graphs under unified geometric conventions. Additionally, the study employs a genetic algorithm, inspired by large-scale model knowledge, to solve geometric constraint problems. In summary, this work innovatively explores the feasibility of using text-based LLMs as physical world builders and designs a workflow to enhance their capabilities.

Navigate Complex Physical Worlds via Geometrically Constrained LLM

TL;DR

Abstract

Navigate Complex Physical Worlds via Geometrically Constrained LLM

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (18)