Table of Contents
Fetching ...

Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets

Luis Chuquimarca, Boris Vintimilla, Sergio Velastin

TL;DR

This work tackles automatic banana ripeness level classification by generating a large synthetic image dataset using Unreal Engine and pairing it with a smaller real dataset. A lightweight CNN, CIDIS, is first trained on synthetic data (CNN1) and then refined on real data through transfer learning (CNN2), achieving higher accuracy than several state-of-the-art architectures. Compared against InceptionV3, ResNet50, Inception-ResNetV2, and VGG19, the CIDIS approach reaches 0.917 accuracy with fast inference, demonstrating that synthetic-to-real transfer can significantly improve generalization in agricultural quality inspection tasks. The methodology offers a cost-effective path to scalable, industrial-grade banana maturity assessment across four maturity levels.

Abstract

The level of ripeness is essential in determining the quality of bananas. To correctly estimate banana maturity, the metrics of international marketing standards need to be considered. However, the process of assessing the maturity of bananas at an industrial level is still carried out using manual methods. The use of CNN models is an attractive tool to solve the problem, but there is a limitation regarding the availability of sufficient data to train these models reliably. On the other hand, in the state-of-the-art, existing CNN models and the available data have reported that the accuracy results are acceptable in identifying banana maturity. For this reason, this work presents the generation of a robust dataset that combines real and synthetic data for different levels of banana ripeness. In addition, it proposes a simple CNN architecture, which is trained with synthetic data and using the transfer learning technique, the model is improved to classify real data, managing to determine the level of maturity of the banana. The proposed CNN model is evaluated with several architectures, then hyper-parameter configurations are varied, and optimizers are used. The results show that the proposed CNN model reaches a high accuracy of 0.917 and a fast execution time.

Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets

TL;DR

This work tackles automatic banana ripeness level classification by generating a large synthetic image dataset using Unreal Engine and pairing it with a smaller real dataset. A lightweight CNN, CIDIS, is first trained on synthetic data (CNN1) and then refined on real data through transfer learning (CNN2), achieving higher accuracy than several state-of-the-art architectures. Compared against InceptionV3, ResNet50, Inception-ResNetV2, and VGG19, the CIDIS approach reaches 0.917 accuracy with fast inference, demonstrating that synthetic-to-real transfer can significantly improve generalization in agricultural quality inspection tasks. The methodology offers a cost-effective path to scalable, industrial-grade banana maturity assessment across four maturity levels.

Abstract

The level of ripeness is essential in determining the quality of bananas. To correctly estimate banana maturity, the metrics of international marketing standards need to be considered. However, the process of assessing the maturity of bananas at an industrial level is still carried out using manual methods. The use of CNN models is an attractive tool to solve the problem, but there is a limitation regarding the availability of sufficient data to train these models reliably. On the other hand, in the state-of-the-art, existing CNN models and the available data have reported that the accuracy results are acceptable in identifying banana maturity. For this reason, this work presents the generation of a robust dataset that combines real and synthetic data for different levels of banana ripeness. In addition, it proposes a simple CNN architecture, which is trained with synthetic data and using the transfer learning technique, the model is improved to classify real data, managing to determine the level of maturity of the banana. The proposed CNN model is evaluated with several architectures, then hyper-parameter configurations are varied, and optimizers are used. The results show that the proposed CNN model reaches a high accuracy of 0.917 and a fast execution time.

Paper Structure

This paper contains 9 sections, 7 figures, 5 tables.

Figures (7)

  • Figure 1: Banana maturity identification process.
  • Figure 2: Real Dataset Refinement.
  • Figure 3: Virtual scenario for the generation of the synthetic images using Unreal Engine.
  • Figure 4: Synthetic images of banana maturity levels using different backgrounds.
  • Figure 5: Variation in the amount of bananas.
  • ...and 2 more figures