GPT in Sheep's Clothing: The Risk of Customized GPTs
Sagiv Antebi, Noam Azulay, Edan Habler, Ben Ganon, Asaf Shabtai, Yuval Elovici
TL;DR
The paper investigates security risks posed by OpenAI's customizable GPTs, arguing that attackers can abuse these shared AI agents to conduct privacy- and security-threatening activities. It introduces a threat taxonomy spanning Vulnerability Steering, Malicious Injection, and Information Theft, and demonstrates concrete attack scenarios, including $N$-Day exploit guidance, insecure code generation, malicious libraries, and phishing via GPTs. The authors propose practical mitigations—self-checks, verification, community reputation, explicit link presentation, and API-call scrutiny—showing these defenses can detect or deter many attacks, though they acknowledge the need for more robust, systemic safeguards. The work highlights the importance of cautious use, transparent builder authentication, and community mechanisms to protect users as GPT customization evolves.
Abstract
In November 2023, OpenAI introduced a new service allowing users to create custom versions of ChatGPT (GPTs) by using specific instructions and knowledge to guide the model's behavior. We aim to raise awareness of the fact that GPTs can be used maliciously, posing privacy and security risks to their users.
