BeautyGuard: Designing a Multi-Agent Roundtable System for Proactive Beauty Tech Compliance through Stakeholder Collaboration
Junwei Li, Wenqing Wang, Huiliu Mao, Jiazhe Ni, Zeyu Xiong
TL;DR
BeautyGuard addresses the challenge of enterprise compliance in beauty tech by introducing a multi-agent LLM roundtable that mirrors organizational roles to surface early, explainable, and actionable guidance. The four role-specific agents—Legal Interpreter, Rule Checker, Precedent Researcher, and Risk Planner—work in streaming, progressively revealing their reasoning to support proactive mitigation rather than binary gating. A formative study with six experts informs design (DC1–DC5), and a user study (N=6) demonstrates high usability (SUS 77.5/100) and low workload across most NASA-TLX dimensions, with results showing effective knowledge preservation, preference for information augmentation, and good alignment with existing organizational structures. The work provides design principles for human–AI collaboration in regulated domains and suggests generalizability to other industries and governance processes beyond beauty tech.
Abstract
As generative AI enters enterprise workflows, ensuring compliance with legal, ethical, and reputational standards becomes a pressing challenge. In beauty tech, where biometric and personal data are central, traditional reviews are often manual, fragmented, and reactive. To examine these challenges, we conducted a formative study with six experts (four IT managers, two legal managers) at a multinational beauty company. The study revealed pain points in rule checking, precedent use, and the lack of proactive guidance. Motivated by these findings, we designed a multi-agent "roundtable" system powered by a large language model. The system assigns role-specialized agents for legal interpretation, checklist review, precedent search, and risk mitigation, synthesizing their perspectives into structured compliance advice. We evaluated the prototype with the same experts using System Usability Scale(SUS), The Official NASA Task Load Index(NASA-TLX), and interviews. Results show exceptional usability (SUS: 77.5/100) and minimal cognitive workload, with three key findings: (1) multi-agent systems can preserve tacit knowledge into standardized workflows, (2) information augmentation achieves higher acceptance than decision automation, and (3) successful enterprise AI should mirror organizational structures. This work contributes design principles for human-AI collaboration in compliance review, with broader implications for regulated industries beyond beauty tech.
