Vision-Based Approach for Food Weight Estimation from 2D Images

Chathura Wimalasiri; Prasan Kumar Sahoo

Vision-Based Approach for Food Weight Estimation from 2D Images

Chathura Wimalasiri, Prasan Kumar Sahoo

TL;DR

This paper addresses estimating food weight from 2D images to enable non-invasive dietary assessment. It combines Faster R-CNN for food detection with a MobileNetV3-based regressor, trained on a 2380-image dataset covering 14 foods. Key results include $mAP=83.41\%$, $IoU=91.82\%$, and classifier accuracy $100\%$ for detection, and $RMSE=6.3204$, $MAPE=0.0640\%$, and $R^2=0.9865$ for weight estimation. Significance: demonstrates a robust, practical framework for nutrition counseling, wellness monitoring, and waste reduction by enabling accurate weight estimation from 2D imagery.

Abstract

In response to the increasing demand for efficient and non-invasive methods to estimate food weight, this paper presents a vision-based approach utilizing 2D images. The study employs a dataset of 2380 images comprising fourteen different food types in various portions, orientations, and containers. The proposed methodology integrates deep learning and computer vision techniques, specifically employing Faster R-CNN for food detection and MobileNetV3 for weight estimation. The detection model achieved a mean average precision (mAP) of 83.41\%, an average Intersection over Union (IoU) of 91.82\%, and a classification accuracy of 100\%. For weight estimation, the model demonstrated a root mean squared error (RMSE) of 6.3204, a mean absolute percentage error (MAPE) of 0.0640\%, and an R-squared value of 98.65\%. The study underscores the potential applications of this technology in healthcare for nutrition counseling, fitness and wellness for dietary intake assessment, and smart food storage solutions to reduce waste. The results indicate that the combination of Faster R-CNN and MobileNetV3 provides a robust framework for accurate food weight estimation from 2D images, showcasing the synergy of computer vision and deep learning in practical applications.

Vision-Based Approach for Food Weight Estimation from 2D Images

TL;DR

, and classifier accuracy

for detection, and

, and

for weight estimation. Significance: demonstrates a robust, practical framework for nutrition counseling, wellness monitoring, and waste reduction by enabling accurate weight estimation from 2D imagery.

Abstract

Paper Structure (5 sections, 15 equations, 6 figures, 5 tables)

This paper contains 5 sections, 15 equations, 6 figures, 5 tables.

Introduction
Related Works
Proposed Methodology
Results and Discussions
Conclusion

Figures (6)

Figure 1: The high-level architecture of the proposed solution
Figure 2: Food weight distribution of training, testing and validation data
Figure 3: The proposed weight estimation model
Figure 4: Wrong prediction results of RetinaNet
Figure 5: Faster R-CNN Food Detection and Recognition Results
...and 1 more figures

Vision-Based Approach for Food Weight Estimation from 2D Images

TL;DR

Abstract

Vision-Based Approach for Food Weight Estimation from 2D Images

Authors

TL;DR

Abstract

Table of Contents

Figures (6)