Table of Contents
Fetching ...

Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach

Pronay Debnath, Usafa Akther Rifa, Busra Kamal Rafa, Ali Haider Talukder Akib, Md. Aminur Rahman

TL;DR

This paper introduces Celeb-FBI, a public benchmark dataset containing 7,211 full-body celebrity images with labeled height, weight, age, and gender to address the scarcity of multi-attribute full-body datasets. It outlines a preprocessing and modeling pipeline using CNN, ResNet-50, and VGG-16, with SMOTE applied to balance classes, and concludes that ResNet-50 provides the strongest performance across all four attributes (age 79.18%, gender 95.43%, height 85.60%, weight 81.91%). The work establishes a new resource for research in security, healthcare analytics, and fashion-related applications, and discusses validation by experts and planned dataset expansions. Future directions include dataset balancing refinements and direct BMI/BMR estimation from full-body imagery, underscoring the dataset’s potential to spur practical biometric and health-related insights.

Abstract

The scarcity of comprehensive datasets in surveillance, identification, image retrieval systems, and healthcare poses a significant challenge for researchers in exploring new methodologies and advancing knowledge in these respective fields. Furthermore, the need for full-body image datasets with detailed attributes like height, weight, age, and gender is particularly significant in areas such as fashion industry analytics, ergonomic design assessment, virtual reality avatar creation, and sports performance analysis. To address this gap, we have created the 'Celeb-FBI' dataset which contains 7,211 full-body images of individuals accompanied by detailed information on their height, age, weight, and gender. Following the dataset creation, we proceed with the preprocessing stages, including image cleaning, scaling, and the application of Synthetic Minority Oversampling Technique (SMOTE). Subsequently, utilizing this prepared dataset, we employed three deep learning approaches: Convolutional Neural Network (CNN), 50-layer ResNet, and 16-layer VGG, which are used for estimating height, weight, age, and gender from human full-body images. From the results obtained, ResNet-50 performed best for the system with an accuracy rate of 79.18% for age, 95.43% for gender, 85.60% for height and 81.91% for weight.

Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach

TL;DR

This paper introduces Celeb-FBI, a public benchmark dataset containing 7,211 full-body celebrity images with labeled height, weight, age, and gender to address the scarcity of multi-attribute full-body datasets. It outlines a preprocessing and modeling pipeline using CNN, ResNet-50, and VGG-16, with SMOTE applied to balance classes, and concludes that ResNet-50 provides the strongest performance across all four attributes (age 79.18%, gender 95.43%, height 85.60%, weight 81.91%). The work establishes a new resource for research in security, healthcare analytics, and fashion-related applications, and discusses validation by experts and planned dataset expansions. Future directions include dataset balancing refinements and direct BMI/BMR estimation from full-body imagery, underscoring the dataset’s potential to spur practical biometric and health-related insights.

Abstract

The scarcity of comprehensive datasets in surveillance, identification, image retrieval systems, and healthcare poses a significant challenge for researchers in exploring new methodologies and advancing knowledge in these respective fields. Furthermore, the need for full-body image datasets with detailed attributes like height, weight, age, and gender is particularly significant in areas such as fashion industry analytics, ergonomic design assessment, virtual reality avatar creation, and sports performance analysis. To address this gap, we have created the 'Celeb-FBI' dataset which contains 7,211 full-body images of individuals accompanied by detailed information on their height, age, weight, and gender. Following the dataset creation, we proceed with the preprocessing stages, including image cleaning, scaling, and the application of Synthetic Minority Oversampling Technique (SMOTE). Subsequently, utilizing this prepared dataset, we employed three deep learning approaches: Convolutional Neural Network (CNN), 50-layer ResNet, and 16-layer VGG, which are used for estimating height, weight, age, and gender from human full-body images. From the results obtained, ResNet-50 performed best for the system with an accuracy rate of 79.18% for age, 95.43% for gender, 85.60% for height and 81.91% for weight.
Paper Structure (25 sections, 4 equations, 10 figures, 3 tables)

This paper contains 25 sections, 4 equations, 10 figures, 3 tables.

Figures (10)

  • Figure 1: Sample of collected images
  • Figure 2: Dataset (Age)
  • Figure 3: Dataset (Gender)
  • Figure 4: Dataset (Height)
  • Figure 5: Dataset (Weight)
  • ...and 5 more figures