CookingSense: A Culinary Knowledgebase with Multidisciplinary Assertions
Donghee Choi, Mogan Gim, Donghyeon Park, Mujeen Sung, Hyunjae Kim, Jaewoo Kang, Jihun Choi
TL;DR
CookingSense presents a large-scale culinary knowledge base constructed from web data, scientific literature, and recipes, combined with dictionary- and LM-driven filtering to yield multidimensional culinary assertions. The authors also introduce FoodBench, a benchmark for evaluating culinary decision-support tasks and to assess knowledge augmentation for retrieval-augmented generation. The pipeline generates 34 million categorized assertions from 68 million inputs, and semantic categorization enables targeted retrieval for downstream tasks. Empirical results show that CookingSense consistently improves RAG performance across culinary QA, flavor, and cultural tasks, illustrating its potential to power practical culinary decision-support tools and future domain-specific chatbots.
Abstract
This paper introduces CookingSense, a descriptive collection of knowledge assertions in the culinary domain extracted from various sources, including web data, scientific papers, and recipes, from which knowledge covering a broad range of aspects is acquired. CookingSense is constructed through a series of dictionary-based filtering and language model-based semantic filtering techniques, which results in a rich knowledgebase of multidisciplinary food-related assertions. Additionally, we present FoodBench, a novel benchmark to evaluate culinary decision support systems. From evaluations with FoodBench, we empirically prove that CookingSense improves the performance of retrieval augmented language models. We also validate the quality and variety of assertions in CookingSense through qualitative analysis.
