LEXI: Large Language Models Experimentation Interface
Guy Laban, Tomer Laban, Hatice Gunes
TL;DR
LEXI addresses a critical gap in HAI research by providing an open-source GUI platform for deploying and controlling LLM-powered agents in social interaction experiments. It enables standardized experiment management, agent construction, and integrated data collection (logs and questionnaires), enhancing reproducibility and cross-disciplinary access. A proof-of-concept study shows empathetic agents are perceived as more social and capable, with positive effects on user mood and engagement, supporting the platform’s validity for behavioral research. The work emphasizes open science and ethical considerations, and outlines a roadmap for future features to broaden methodological control and model diversity, enabling deeper investigations into human-agent interactions.
Abstract
The recent developments in Large Language Models (LLM), mark a significant moment in the research and development of social interactions with artificial agents. These agents are widely deployed in a variety of settings, with potential impact on users. However, the study of social interactions with agents powered by LLM is still emerging, limited by access to the technology and to data, the absence of standardised interfaces, and challenges to establishing controlled experimental setups using the currently available business-oriented platforms. To answer these gaps, we developed LEXI, LLMs Experimentation Interface, an open-source tool enabling the deployment of artificial agents powered by LLM in social interaction behavioural experiments. Using a graphical interface, LEXI allows researchers to build agents, and deploy them in experimental setups along with forms and questionnaires while collecting interaction logs and self-reported data. The outcomes of usability testing indicate LEXI's broad utility, high usability and minimum mental workload requirement, with distinctive benefits observed across disciplines. A proof-of-concept study exploring the tool's efficacy in evaluating social HAIs was conducted, resulting in high-quality data. A comparison of empathetic versus neutral agents indicated that people perceive empathetic agents as more social, and write longer and more positive messages towards them.
