ExploreGen: Large Language Models for Envisioning the Uses and Risks of AI Technologies
Viviane Herdel, Sanja Šćepanović, Edyta Bogucka, Daniele Quercia
TL;DR
This work tackles the challenge of envisioning AI uses and regulatory risks early in development by introducing ExploreGen, an LLM-based framework that generates diverse uses of a technology and assesses risk per the EU AI Act. Using Facial Recognition and Analysis as a case, the authors demonstrate that UsesGen can produce realistic uses, including overlooked ones, and that RiskLabelling achieves high alignment with expert classifications. The framework combines a generation module, a legal-risk classification module, and a literature-mapping filter (OverlookedFilter), and is evaluated through scoping reviews and nine practitioner studies, revealing strong literature coverage and practical utility for ideation and compliance. Despite promising results, the study notes limitations in data biases and inter-rater variability and suggests avenues for broader generalization and deeper, domain-specific analyses to further support responsible AI design across technologies and contexts.
Abstract
Responsible AI design is increasingly seen as an imperative by both AI developers and AI compliance experts. One of the key tasks is envisioning AI technology uses and risks. Recent studies on the model and data cards reveal that AI practitioners struggle with this task due to its inherently challenging nature. Here, we demonstrate that leveraging a Large Language Model (LLM) can support AI practitioners in this task by enabling reflexivity, brainstorming, and deliberation, especially in the early design stages of the AI development process. We developed an LLM framework, ExploreGen, which generates realistic and varied uses of AI technology, including those overlooked by research, and classifies their risk level based on the EU AI Act regulation. We evaluated our framework using the case of Facial Recognition and Analysis technology in nine user studies with 25 AI practitioners. Our findings show that ExploreGen is helpful to both developers and compliance experts. They rated the uses as realistic and their risk classification as accurate (94.5%). Moreover, while unfamiliar with many of the uses, they rated them as having high adoption potential and transformational impact.
