Generative transformations and patterns in LLM-native approaches for software verification and falsification
Víctor A. Braberman, Flavia Bonomo-Braberman, Yiannis Charalambous, Juan G. Colonna, Lucas C. Cordeiro, Rosiane de Freitas
TL;DR
This paper introduces a transformation-centric lens to study LLM-native software engineering for verification and falsification, arguing that treating prompt-driven interactions as discrete generative transformations enables principled analysis and modular design. Through a systematic mapping of over 100 proposals, the authors extract 284 transformations across 111 studies, organize them into a hierarchical taxonomy, and identify inter-transformation patterns that resemble software design patterns. They show how transformation types align with SE problems (e.g., unit testing, fuzzing, vulnerability detection) and reveal common motifs such as Generate and Fix and Build Model, shedding light on compositional strategies and gaps. The work provides a foundational vocabulary and taxonomy to support benchmarks, transformation-specialized tooling, and ex-ante methodologies for reliable, modular LLM-native verification workflows, with implications for future engineering practices in this rapidly evolving domain.
Abstract
The emergence of prompting as the dominant paradigm for leveraging Large Language Models (LLMs) has led to a proliferation of LLM-native software, where application behavior arises from complex, stochastic data transformations. However, the engineering of such systems remains largely exploratory and ad-hoc, hampered by the absence of conceptual frameworks, ex-ante methodologies, design guidelines, and specialized benchmarks. We argue that a foundational step towards a more disciplined engineering practice is a systematic understanding of the core functional units--generative transformations--and their compositional patterns within LLM-native applications. Focusing on the rich domain of software verification and falsification, we conduct a secondary study of over 100 research proposals to address this gap. We first present a fine-grained taxonomy of generative transformations, abstracting prompt-based interactions into conceptual signatures. This taxonomy serves as a scaffolding to identify recurrent transformation relationship patterns--analogous to software design patterns--that characterize solution approaches in the literature. Our analysis not only validates the utility of the taxonomy but also surfaces strategic gaps and cross-dimensional relationships, offering a structured foundation for future research in modular and compositional LLM application design, benchmarking, and the development of reliable LLM-native systems.
