Large Causal Models from Large Language Models
Sridhar Mahadevan
TL;DR
DEMOCRITUS introduces a language-enabled framework to build large causal models by harvesting thousands of causal claims from LLMs and weaving them into domain-specific topos slices called large causal models. The core pipeline combines topic discovery, causal question/statement generation, relational triple extraction, and geometric refinement via a Geometric Transformer, producing navigable 2D/3D manifolds and local neighborhoods. Across economics, biology, and archaeology, the approach yields coherent macro-structures, hub-driven causal hubs, and cross-domain bridges, while highlighting limitations such as reliance on LLM content, lack of identifiability, and high compute costs that motivate active manifold building. The work lays a foundation for future dynamical and mechanistic models by linking textual causal narratives to structured, verifiable causal tools and simulations, with potential extensions to ODE-based or agent-based representations. The DEMOCRITUS framework thus offers a scalable, interpretable, and domain-spanning tool for hypothesis generation and exploratory causal reasoning, pending further validation and integration with quantitative causal inference methods.
Abstract
We introduce a new paradigm for building large causal models (LCMs) that exploits the enormous potential latent in today's large language models (LLMs). We describe our ongoing experiments with an implemented system called DEMOCRITUS (Decentralized Extraction of Manifold Ontologies of Causal Relations Integrating Topos Universal Slices) aimed at building, organizing, and visualizing LCMs that span disparate domains extracted from carefully targeted textual queries to LLMs. DEMOCRITUS is methodologically distinct from traditional narrow domain and hypothesis centered causal inference that builds causal models from experiments that produce numerical data. A high-quality LLM is used to propose topics, generate causal questions, and extract plausible causal statements from a diverse range of domains. The technical challenge is then to take these isolated, fragmented, potentially ambiguous and possibly conflicting causal claims, and weave them into a coherent whole, converting them into relational causal triples and embedding them into a LCM. Addressing this technical challenge required inventing new categorical machine learning methods, which we can only briefly summarize in this paper, as it is focused more on the systems side of building DEMOCRITUS. We describe the implementation pipeline for DEMOCRITUS comprising of six modules, examine its computational cost profile to determine where the current bottlenecks in scaling the system to larger models. We describe the results of using DEMOCRITUS over a wide range of domains, spanning archaeology, biology, climate change, economics, medicine and technology. We discuss the limitations of the current DEMOCRITUS system, and outline directions for extending its capabilities.
