Generating Phishing Attacks using ChatGPT
Sayak Saha Roy, Krishna Vamsi Naragam, Shirin Nilizadeh
TL;DR
This paper investigates how ChatGPT can be exploited to generate phishing websites, including evasive variants, by engineering prompts that bypass safeguards. It decomposes prompt design into design, credential theft, exploit, and data transfer components to assemble functional attack code. The authors demonstrate multiple attack types (regular and evasive) targeting 50 brands, and validate feasibility by hosting on free services with minimal prompts. The work highlights the rapid deployment risk posed by LLMs and underscores the need for robust defenses and policy responses.
Abstract
The ability of ChatGPT to generate human-like responses and understand context has made it a popular tool for conversational agents, content creation, data analysis, and research and innovation. However, its effectiveness and ease of accessibility makes it a prime target for generating malicious content, such as phishing attacks, that can put users at risk. In this work, we identify several malicious prompts that can be provided to ChatGPT to generate functional phishing websites. Through an iterative approach, we find that these phishing websites can be made to imitate popular brands and emulate several evasive tactics that have been known to avoid detection by anti-phishing entities. These attacks can be generated using vanilla ChatGPT without the need of any prior adversarial exploits (jailbreaking).
