Generalizable Blood Cell Detection via Unified Dataset and Faster R-CNN

Siddharth Sahay

Generalizable Blood Cell Detection via Unified Dataset and Faster R-CNN

Siddharth Sahay

TL;DR

The paper addresses automated peripheral blood cell detection amid heterogeneous data sources by constructing a unified multi-source dataset and evaluating Faster R-CNN with a ResNet-50-FPN backbone. It demonstrates that transfer learning from COCO accelerates convergence and improves most detection metrics compared to random initialization. Per-class analysis reveals strong performance on common cell types but critical data scarcity for rare classes, underscoring the need for targeted data augmentation and synthetic data or few-shot approaches. The study provides a robust, generalizable pipeline for deployable hematology diagnostics, with implications for scalable automated screening in clinical settings.

Abstract

This paper presents a comprehensive methodology and comparative performance analysis for the automated classification and object detection of peripheral blood cells (PBCs) in microscopic images. Addressing the critical challenge of data scarcity and heterogeneity, robust data pipeline was first developed to standardize and merge four public datasets (PBC, BCCD, Chula, Sickle Cell) into a unified resource. Then employed a state-of-the-art Faster R-CNN object detection framework, leveraging a ResNet-50-FPN backbone. Comparative training rigorously evaluated a randomly initialized baseline model (Regimen 1) against a Transfer Learning Regimen (Regimen 2), initialized with weights pre-trained on the Microsoft COCO dataset. The results demonstrate that the Transfer Learning approach achieved significantly faster convergence and superior stability, culminating in a final validation loss of 0.08666, a substantial improvement over the baseline. This validated methodology establishes a robust foundation for building high-accuracy, deployable systems for automated hematological diagnosis.

Generalizable Blood Cell Detection via Unified Dataset and Faster R-CNN

TL;DR

Abstract

Generalizable Blood Cell Detection via Unified Dataset and Faster R-CNN

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)