Table of Contents
Fetching ...

A Novel Multi-branch ConvNeXt Architecture for Identifying Subtle Pathological Features in CT Scans

Irash Perera, Uthayasanker Thayasivam

TL;DR

The paper tackles the challenge of diagnosing subtle pathological features in CT scans, using COVID-19 as the representative task. It introduces a novel multi-branch ConvNeXt architecture that fuses global, salient, and attention-weighted features, built on a robust end-to-end pipeline including CLAHE-based enhancement, lung ROI cropping, and balanced data augmentation. A two-phase transfer-learning strategy—initially training a frozen base, then fine-tuning part of the base—yields a ROC-AUC of 0.9937 and an F1-score of 0.9825 on a 701-slice validation set, outperforming several state-of-the-art baselines on the same datasets. The results demonstrate the approach’s high discriminative power and practical potential for robust medical diagnostics, with broader applicability to other CT-based pathology classification tasks.

Abstract

Intelligent analysis of medical imaging plays a crucial role in assisting clinical diagnosis, especially for identifying subtle pathological features. This paper introduces a novel multi-branch ConvNeXt architecture designed specifically for the nuanced challenges of medical image analysis. While applied here to the specific problem of COVID-19 diagnosis, the methodology offers a generalizable framework for classifying a wide range of pathologies from CT scans. The proposed model incorporates a rigorous end-to-end pipeline, from meticulous data preprocessing and augmentation to a disciplined two-phase training strategy that leverages transfer learning effectively. The architecture uniquely integrates features extracted from three parallel branches: Global Average Pooling, Global Max Pooling, and a new Attention-weighted Pooling mechanism. The model was trained and validated on a combined dataset of 2,609 CT slices derived from two distinct datasets. Experimental results demonstrate a superior performance on the validation set, achieving a final ROC-AUC of 0.9937, a validation accuracy of 0.9757, and an F1-score of 0.9825 for COVID-19 cases, outperforming all previously reported models on this dataset. These findings indicate that a modern, multi-branch architecture, coupled with careful data handling, can achieve performance comparable to or exceeding contemporary state-of-the-art models, thereby proving the efficacy of advanced deep learning techniques for robust medical diagnostics.

A Novel Multi-branch ConvNeXt Architecture for Identifying Subtle Pathological Features in CT Scans

TL;DR

The paper tackles the challenge of diagnosing subtle pathological features in CT scans, using COVID-19 as the representative task. It introduces a novel multi-branch ConvNeXt architecture that fuses global, salient, and attention-weighted features, built on a robust end-to-end pipeline including CLAHE-based enhancement, lung ROI cropping, and balanced data augmentation. A two-phase transfer-learning strategy—initially training a frozen base, then fine-tuning part of the base—yields a ROC-AUC of 0.9937 and an F1-score of 0.9825 on a 701-slice validation set, outperforming several state-of-the-art baselines on the same datasets. The results demonstrate the approach’s high discriminative power and practical potential for robust medical diagnostics, with broader applicability to other CT-based pathology classification tasks.

Abstract

Intelligent analysis of medical imaging plays a crucial role in assisting clinical diagnosis, especially for identifying subtle pathological features. This paper introduces a novel multi-branch ConvNeXt architecture designed specifically for the nuanced challenges of medical image analysis. While applied here to the specific problem of COVID-19 diagnosis, the methodology offers a generalizable framework for classifying a wide range of pathologies from CT scans. The proposed model incorporates a rigorous end-to-end pipeline, from meticulous data preprocessing and augmentation to a disciplined two-phase training strategy that leverages transfer learning effectively. The architecture uniquely integrates features extracted from three parallel branches: Global Average Pooling, Global Max Pooling, and a new Attention-weighted Pooling mechanism. The model was trained and validated on a combined dataset of 2,609 CT slices derived from two distinct datasets. Experimental results demonstrate a superior performance on the validation set, achieving a final ROC-AUC of 0.9937, a validation accuracy of 0.9757, and an F1-score of 0.9825 for COVID-19 cases, outperforming all previously reported models on this dataset. These findings indicate that a modern, multi-branch architecture, coupled with careful data handling, can achieve performance comparable to or exceeding contemporary state-of-the-art models, thereby proving the efficacy of advanced deep learning techniques for robust medical diagnostics.

Paper Structure

This paper contains 17 sections, 11 figures, 2 tables.

Figures (11)

  • Figure 1: Examples of CT scans, first two rows contain images from healthy subjects, whereas the last two rows contain images from COVID-19 patients.b4
  • Figure 2: Comparison of original and CLAHE-enhanced CT scans with corresponding histograms
  • Figure 3: CT Scans and Cropped Lung Regions of COVID-19 CT segmentation dataset
  • Figure 4: Pre-processing steps done for the CT scans
  • Figure 5: Generated CT scans from data augmentation
  • ...and 6 more figures