GEO: Generative Engine Optimization
Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande
TL;DR
This work formalizes Generative Engine Optimization (GEO), a black-box optimization framework that helps content creators improve their visibility in Generative Engine responses. It introduces a concrete set of impression metrics, including $Imp_{wc}$, $Imp_{pwc}$, and $SubjectiveImpression$, and defines a consumer-centric objective that balances citation relevance and presence within GE outputs. The authors validate GEO using GEO-bench, a large, diverse 10K-query benchmark across nine datasets, and demonstrate up to 40% gains in GE-visible content, with domain-dependent effectiveness and strong gains from combining GEO strategies. They further validate practicality by testing on Perplexity.ai, a deployed GE, and discuss implications for the creator economy, providing a foundation for future domain-specific optimization and broader adoption in real-world GE ecosystems.
Abstract
The advent of large language models (LLMs) has ushered in a new paradigm of search engines that use generative models to gather and summarize information to answer user queries. This emerging technology, which we formalize under the unified framework of generative engines (GEs), can generate accurate and personalized responses, rapidly replacing traditional search engines like Google and Bing. Generative Engines typically satisfy queries by synthesizing information from multiple sources and summarizing them using LLMs. While this shift significantly improves $\textit{user}$ utility and $\textit{generative search engine}$ traffic, it poses a huge challenge for the third stakeholder -- website and content creators. Given the black-box and fast-moving nature of generative engines, content creators have little to no control over $\textit{when}$ and $\textit{how}$ their content is displayed. With generative engines here to stay, we must ensure the creator economy is not disadvantaged. To address this, we introduce Generative Engine Optimization (GEO), the first novel paradigm to aid content creators in improving their content visibility in generative engine responses through a flexible black-box optimization framework for optimizing and defining visibility metrics. We facilitate systematic evaluation by introducing GEO-bench, a large-scale benchmark of diverse user queries across multiple domains, along with relevant web sources to answer these queries. Through rigorous evaluation, we demonstrate that GEO can boost visibility by up to $40\%$ in generative engine responses. Moreover, we show the efficacy of these strategies varies across domains, underscoring the need for domain-specific optimization methods. Our work opens a new frontier in information discovery systems, with profound implications for both developers of generative engines and content creators.
