Table of Contents
Fetching ...

Optimizing CNN Architectures for Advanced Thoracic Disease Classification

Tejas Mirthipati

TL;DR

The paper addresses thoracic disease classification from chest X-ray images under challenges of dataset imbalance, multi-label outputs, and imaging variations. It evaluates binary, multi-label, and ResNet50 CNN architectures, augmented with PCA-based image compression and a class-weighted loss to mitigate bias. Key findings show 66% accuracy and an AUC of 0.708 for binary classification, while multi-label classification remains challenging due to rare diseases and visually similar conditions, though ResNet50 offers better generalization under computational constraints. The work suggests future directions including data augmentation, synthetic data generation, transfer learning, and incorporating patient demographics and imaging conditions to improve robustness and clinical applicability.

Abstract

Machine learning, particularly convolutional neural networks (CNNs), has shown promise in medical image analysis, especially for thoracic disease detection using chest X-ray images. In this study, we evaluate various CNN architectures, including binary classification, multi-label classification, and ResNet50 models, to address challenges like dataset imbalance, variations in image quality, and hidden biases. We introduce advanced preprocessing techniques such as principal component analysis (PCA) for image compression and propose a novel class-weighted loss function to mitigate imbalance issues. Our results highlight the potential of CNNs in medical imaging but emphasize that issues like unbalanced datasets and variations in image acquisition methods must be addressed for optimal model performance.

Optimizing CNN Architectures for Advanced Thoracic Disease Classification

TL;DR

The paper addresses thoracic disease classification from chest X-ray images under challenges of dataset imbalance, multi-label outputs, and imaging variations. It evaluates binary, multi-label, and ResNet50 CNN architectures, augmented with PCA-based image compression and a class-weighted loss to mitigate bias. Key findings show 66% accuracy and an AUC of 0.708 for binary classification, while multi-label classification remains challenging due to rare diseases and visually similar conditions, though ResNet50 offers better generalization under computational constraints. The work suggests future directions including data augmentation, synthetic data generation, transfer learning, and incorporating patient demographics and imaging conditions to improve robustness and clinical applicability.

Abstract

Machine learning, particularly convolutional neural networks (CNNs), has shown promise in medical image analysis, especially for thoracic disease detection using chest X-ray images. In this study, we evaluate various CNN architectures, including binary classification, multi-label classification, and ResNet50 models, to address challenges like dataset imbalance, variations in image quality, and hidden biases. We introduce advanced preprocessing techniques such as principal component analysis (PCA) for image compression and propose a novel class-weighted loss function to mitigate imbalance issues. Our results highlight the potential of CNNs in medical imaging but emphasize that issues like unbalanced datasets and variations in image acquisition methods must be addressed for optimal model performance.

Paper Structure

This paper contains 13 sections, 2 equations, 6 figures.

Figures (6)

  • Figure 1: Patient Demographics Analysis
  • Figure 2: PCA Compression Analysis for Variance Retention
  • Figure 3: ROC Curve for Baseline Binary Classification Model
  • Figure 4: ROC Curve for Optimized Binary Classification Model
  • Figure 5: Example X-ray Image: Cardiomegaly
  • ...and 1 more figures