Designing a Dashboard for Transparency and Control of Conversational AI

Yida Chen; Aoyu Wu; Trevor DePodesta; Catherine Yeh; Kenneth Li; Nicholas Castillo Marin; Oam Patel; Jan Riecke; Shivam Raval; Olivia Seow; Martin Wattenberg; Fernanda Viégas

Designing a Dashboard for Transparency and Control of Conversational AI

Yida Chen, Aoyu Wu, Trevor DePodesta, Catherine Yeh, Kenneth Li, Nicholas Castillo Marin, Oam Patel, Jan Riecke, Shivam Raval, Olivia Seow, Martin Wattenberg, Fernanda Viégas

TL;DR

This work tackles the problem of transparency in conversational AI by exposing and enabling control over the system's internal representation of the user. It introduces TalkTuner, an end-to-end prototype that connects interpretability techniques (linear probes and activation-based control) with a user-facing dashboard to display and adjust four demographic attributes in real time. Through a design-probe methodology and a user study with 19 participants, the paper demonstrates that exposing internal states can enhance user understanding, reveal biases, and increase a sense of control, while also highlighting privacy concerns and the need for careful design. The findings offer a practical design pathway for instrumented, more transparent chatbots and propose concrete directions for expanding attributes, refining controls, and extending to other modalities.

Abstract

Conversational LLMs function as black box systems, leaving users guessing about why they see the output they do. This lack of transparency is potentially problematic, especially given concerns around bias and truthfulness. To address this issue, we present an end-to-end prototype-connecting interpretability techniques with user experience design-that seeks to make chatbots more transparent. We begin by showing evidence that a prominent open-source LLM has a "user model": examining the internal state of the system, we can extract data related to a user's age, gender, educational level, and socioeconomic status. Next, we describe the design of a dashboard that accompanies the chatbot interface, displaying this user model in real time. The dashboard can also be used to control the user model and the system's behavior. Finally, we discuss a study in which users conversed with the instrumented system. Our results suggest that users appreciate seeing internal states, which helped them expose biased behavior and increased their sense of control. Participants also made valuable suggestions that point to future directions for both design and machine learning research. The project page and video demo of our TalkTuner system are available at https://bit.ly/talktuner-project-page

Designing a Dashboard for Transparency and Control of Conversational AI

TL;DR

Abstract

Paper Structure (50 sections, 23 figures, 3 tables)

This paper contains 50 sections, 23 figures, 3 tables.

Introduction
Background and related work
Overall design methodology
Probes for identifying an internal user model
Creating the conversation dataset
Reading probe training and results
Probes for controlling the user model
Causality test results
Designing a dashboard for end users
UI components
User study design
User study results and discussion
Goal 1: Offer transparency into internal representations of users
Goal 2: Provide controls for adjusting and correcting user representations
Goal 3: Augment chat interface to enhance user experience
...and 35 more sections

Figures (23)

Figure 1: Reading probe's validation accuracy across layers.
Figure 2: Dashboard interface. (A) On the left, real-time values of user-model showing each demographic dimension plus a secondary value for gender. (B) The user modifies "Gender" dimension by pinning down "Male." (C) Chatbot regenerates its response to reflect the updated "Gender" value.
Figure 3: User-model accuracy measured by chat turn in study sessions.
Figure 4: Questionnaire responses with Wilcoxon signed rank test. See Appendix \ref{['appendix:post-task-questionnaires']} for full-length questions.
Figure 5: Effect of training data size on the reading and control probe's performance. The accuracy is measured on a held-out validation set of each attribute. Probes were trained and validated on the internal representation at 30th layer. In the plots above, the starting training size for gender is 64, for age and socioeconomic status attribute is 80, for education is 90.
...and 18 more figures

Designing a Dashboard for Transparency and Control of Conversational AI

TL;DR

Abstract

Designing a Dashboard for Transparency and Control of Conversational AI

Authors

TL;DR

Abstract

Table of Contents

Figures (23)