SnappyMeal: Design and Longitudinal Evaluation of a Multimodal AI Food Logging Application
Liam Bakar, Zachary Englhardt, Vidya Srinivas, Girish Narayanswamy, Dilini Nissanka, Shwetak Patel, Vikram Iyer
TL;DR
This work addresses the rigidity and inaccuracies of traditional food logging by introducing SnappyMeal, a multimodal AI-powered system for flexible dietary tracking. It combines image, text, and audio inputs with retrieval-augmented context from receipts and nutritional databases, plus goal-driven follow-up questions to fill in missing details. The approach is validated through public nutrition benchmarks and a 3-week longitudinal deployment (n>500 logs) that shows high user engagement and perceived accuracy, while highlighting trade-offs where follow-up prompts can introduce cognitive load. The study demonstrates the value of context-aware, restrained AI in self-tracking, laying groundwork for intelligent, user-centric nutrition logging tools and informing design principles for future health-domain applications.
Abstract
Food logging, both self-directed and prescribed, plays a critical role in uncovering correlations between diet, medical, fitness, and health outcomes. Through conversations with nutritional experts and individuals who practice dietary tracking, we find current logging methods, such as handwritten and app-based journaling, are inflexible and result in low adherence and potentially inaccurate nutritional summaries. These findings, corroborated by prior literature, emphasize the urgent need for improved food logging methods. In response, we propose SnappyMeal, an AI-powered dietary tracking system that leverages multimodal inputs to enable users to more flexibly log their food intake. SnappyMeal introduces goal-dependent follow-up questions to intelligently seek missing context from the user and information retrieval from user grocery receipts and nutritional databases to improve accuracy. We evaluate SnappyMeal through publicly available nutrition benchmarks and a multi-user, 3-week, in-the-wild deployment capturing over 500 logged food instances. Users strongly praised the multiple available input methods and reported a strong perceived accuracy. These insights suggest that multimodal AI systems can be leveraged to significantly improve dietary tracking flexibility and context-awareness, laying the groundwork for a new class of intelligent self-tracking applications.
