Characterizing Web Search in The Age of Generative AI
Elisabeth Kirsten, Jost Grosse Perdekamp, Mihir Upadhyay, Krishna P. Gummadi, Muhammad Bilal Zafar
TL;DR
This study benchmarks traditional Google Organic search against four generative search engines across diverse domains to characterize how generative search outputs differ. It reveals that generative systems tend to draw from a broader and different mix of sources, with varying degrees of reliance on internal model knowledge versus retrieved web content, and they condense information into cohesive narratives rather than lists. While overall topic coverage is similar to organic search, the specific concepts surfaced vary across engines, affecting diversity and perspective. The findings underscore the need for new evaluation criteria that capture source diversity, grounding, synthesis quality, and temporal reliability in GenAI-powered search, especially for time-sensitive queries where retrieval-based systems hold an edge. The work highlights practical implications for search design and evaluation in the age of Generative AI, including how users encounter information and how up-to-date content is integrated into answers.
Abstract
The advent of LLMs has given rise to a new type of web search: Generative search, where LLMs retrieve web pages related to a query and generate a single, coherent text as a response. This output modality stands in stark contrast to traditional web search, where results are returned as a ranked list of independent web pages. In this paper, we ask: Along what dimensions do generative search outputs differ from traditional web search? We compare Google, a traditional web search engine, with four generative search engines from two providers (Google and OpenAI) across queries from four domains. Our analysis reveals intriguing differences. Most generative search engines cover a wider range of sources compared to web search. Generative search engines vary in the degree to which they rely on internal knowledge contained within the model parameters v.s. external knowledge retrieved from the web. Generative search engines surface varying sets of concepts, creating new opportunities for enhancing search diversity and serendipity. Our results also highlight the need for revisiting evaluation criteria for web search in the age of Generative AI.
