Multi-Modal Zero-Shot Prediction of Color Trajectories in Food Drying
Shichen Li, Ahmadreza Eslaminia, Chenhui Shao
TL;DR
<3-5 sentence high-level summary>Addressing the challenges of predicting color-change trajectories in food drying under unseen processing conditions, the paper introduces a multi-modal, zero-shot framework that represents color evolution as a weighted sum of predefined basis functions with learned coefficients. The approach integrates DCT-based denoising, initial-state image features via multi-modal fusion, and similarity-based training data selection to enhance generalization, outperforming a baseline moment-by-moment predictor. Case studies on cookie and apple drying demonstrate strong zero-shot generalization and substantial RMSE improvements, highlighting robust performance and practical potential for in-line quality monitoring. The work advances trajectory-aware quality control in industrial drying by leveraging high-dimensional color data, process parameters, and early visual cues to generalize across varied conditions.
Abstract
Food drying is widely used to reduce moisture content, ensure safety, and extend shelf life. Color evolution of food samples is an important indicator of product quality in food drying. Although existing studies have examined color changes under different drying conditions, current approaches primarily rely on low-dimensional color features and cannot fully capture the complex, dynamic color trajectories of food samples. Moreover, existing modeling approaches lack the ability to generalize to unseen process conditions. To address these limitations, we develop a novel multi-modal color-trajectory prediction method that integrates high-dimensional temporal color information with drying process parameters to enable accurate and data-efficient color trajectory prediction. Under unseen drying conditions, the model attains RMSEs of 2.12 for cookie drying and 1.29 for apple drying, reducing errors by over 90% compared with baseline models. These experimental results demonstrate the model's superior accuracy, robustness, and broad applicability.
