AI Governance and Accountability: An Analysis of Anthropic's Claude
Aman Priyanshu, Yash Maurya, Zuofei Hong
TL;DR
The paper addresses governance and accountability challenges for high-impact LLMs, focusing on Anthropic's Claude. It applies the NIST AI Risk Management Framework and the EU AI Act to map threats and evaluate governance controls, including Anthropic's Constitutional AI. Key findings include transparency gaps in privacy policies, risks of hallucinations and biases, and data-usage concerns in partnerships. The authors propose mitigations—transparent data practices, rigorous benchmarking, and a remediation pipeline for data deletion and unlearning—that aim to enhance trust and guide responsible deployment.
Abstract
As AI systems become increasingly prevalent and impactful, the need for effective AI governance and accountability measures is paramount. This paper examines the AI governance landscape, focusing on Anthropic's Claude, a foundational AI model. We analyze Claude through the lens of the NIST AI Risk Management Framework and the EU AI Act, identifying potential threats and proposing mitigation strategies. The paper highlights the importance of transparency, rigorous benchmarking, and comprehensive data handling processes in ensuring the responsible development and deployment of AI systems. We conclude by discussing the social impact of AI governance and the ethical considerations surrounding AI accountability.
