Auditing Google's AI Overviews and Featured Snippets: A Case Study on Baby Care and Pregnancy
Desheng Hu, Joachim Baumann, Aleksandra Urman, Elsa Lichtenegger, Robin Forsberg, Aniko Hannak, Christo Wilson
TL;DR
This study probes Google's AI Overviews (AIO) and Featured Snippets (FS) in pregnancy and baby care searches to assess information quality in a high-stakes domain. Using a 1,508-query audit with a rigorous manual evaluation framework across prevalence, consistency, relevance, safeguards, sources, and sentiment, the authors reveal substantial AIO dominance and notable inconsistencies between co-displayed AIO and FS content. Safeguard cues are rare, while health-related sources predominate yet include low- and medium-credibility domains, and FS shows greater reliance on Shopping/Business domains, raising concerns about information reliability. The findings demonstrate the importance of robust quality controls and provide a transferable framework for auditing AI-mediated health information across high-stakes domains. The work highlights implications for user trust, decision-making in pregnancy-related contexts, and policy considerations for health information dissemination via AI-assisted search interfaces.
Abstract
Google Search increasingly surfaces AI-generated content through features like AI Overviews (AIO) and Featured Snippets (FS), which users frequently rely on despite having no control over their presentation. Through a systematic algorithm audit of 1,508 real baby care and pregnancy-related queries, we evaluate the quality and consistency of these information displays. Our robust evaluation framework assesses multiple quality dimensions, including answer consistency, relevance, presence of medical safeguards, source categories, and sentiment alignment. Our results reveal concerning gaps in information consistency, with information in AIO and FS displayed on the same search result page being inconsistent with each other in 33% of cases. Despite high relevance scores, both features critically lack medical safeguards (present in just 11% of AIO and 7% of FS responses). While health and wellness websites dominate source categories for both, AIO and FS, FS also often link to commercial sources. These findings have important implications for public health information access and demonstrate the need for stronger quality controls in AI-mediated health information. Our methodology provides a transferable framework for auditing AI systems across high-stakes domains where information quality directly impacts user well-being.
