Table of Contents
Fetching ...

Efficient and Accurate Tuberculosis Diagnosis: Attention Residual U-Net and Vision Transformer Based Detection Framework

Greeshma K, Vishnukumar S

TL;DR

The paper addresses automated TB diagnosis from bright-field sputum smear images by proposing a two-stage pipeline: segmentation of bacilli ROIs using an Attention Residual U-Net and subsequent classification with a TBViT Vision Transformer. The approach relies on contour-based ROI extraction and balance-aware transformer training with focal loss to handle class imbalance, achieving state-of-the-art performance across multiple datasets, including a new DCA-CUSAT dataset. Quantitative results show high segmentation fidelity ($J$ and $D$ metrics) and superior bacilli detection metrics ( Accuracy, Precision, Recall, F1) on DCA-CUSAT, Costa, and ZNSM-iDB datasets. This work advances automated TB detection by delivering higher automation, improved accuracy, and robust cross-dataset generalization, with a strong dataset contribution to the community.

Abstract

Tuberculosis (TB), an infectious disease caused by Mycobacterium tuberculosis, continues to be a major global health threat despite being preventable and curable. This burden is particularly high in low and middle income countries. Microscopy remains essential for diagnosing TB by enabling direct visualization of Mycobacterium tuberculosis in sputum smear samples, offering a cost effective approach for early detection and effective treatment. Given the labour-intensive nature of microscopy, automating the detection of bacilli in microscopic images is crucial to improve both the expediency and reliability of TB diagnosis. The current methodologies for detecting tuberculosis bacilli in bright field microscopic sputum smear images are hindered by limited automation capabilities, inconsistent segmentation quality, and constrained classification precision. This paper proposes a twostage deep learning methodology for tuberculosis bacilli detection, comprising bacilli segmentation followed by classification. In the initial phase, an advanced U-Net model employing attention blocks and residual connections is proposed to segment microscopic sputum smear images, enabling the extraction of Regions of Interest (ROIs). The extracted ROIs are then classified using a Vision Transformer, which we specifically customized as TBViT to enhance the precise detection of bacilli within the images. For the experiments, a newly developed dataset of microscopic sputum smear images derived from Ziehl-Neelsen-stained slides is used in conjunction with existing public datasets. The qualitative and quantitative evaluation of the experiments using various metrics demonstrates that the proposed model achieves significantly improved segmentation performance, higher classification accuracy, and a greater level of automation, surpassing existing methods.

Efficient and Accurate Tuberculosis Diagnosis: Attention Residual U-Net and Vision Transformer Based Detection Framework

TL;DR

The paper addresses automated TB diagnosis from bright-field sputum smear images by proposing a two-stage pipeline: segmentation of bacilli ROIs using an Attention Residual U-Net and subsequent classification with a TBViT Vision Transformer. The approach relies on contour-based ROI extraction and balance-aware transformer training with focal loss to handle class imbalance, achieving state-of-the-art performance across multiple datasets, including a new DCA-CUSAT dataset. Quantitative results show high segmentation fidelity ( and metrics) and superior bacilli detection metrics ( Accuracy, Precision, Recall, F1) on DCA-CUSAT, Costa, and ZNSM-iDB datasets. This work advances automated TB detection by delivering higher automation, improved accuracy, and robust cross-dataset generalization, with a strong dataset contribution to the community.

Abstract

Tuberculosis (TB), an infectious disease caused by Mycobacterium tuberculosis, continues to be a major global health threat despite being preventable and curable. This burden is particularly high in low and middle income countries. Microscopy remains essential for diagnosing TB by enabling direct visualization of Mycobacterium tuberculosis in sputum smear samples, offering a cost effective approach for early detection and effective treatment. Given the labour-intensive nature of microscopy, automating the detection of bacilli in microscopic images is crucial to improve both the expediency and reliability of TB diagnosis. The current methodologies for detecting tuberculosis bacilli in bright field microscopic sputum smear images are hindered by limited automation capabilities, inconsistent segmentation quality, and constrained classification precision. This paper proposes a twostage deep learning methodology for tuberculosis bacilli detection, comprising bacilli segmentation followed by classification. In the initial phase, an advanced U-Net model employing attention blocks and residual connections is proposed to segment microscopic sputum smear images, enabling the extraction of Regions of Interest (ROIs). The extracted ROIs are then classified using a Vision Transformer, which we specifically customized as TBViT to enhance the precise detection of bacilli within the images. For the experiments, a newly developed dataset of microscopic sputum smear images derived from Ziehl-Neelsen-stained slides is used in conjunction with existing public datasets. The qualitative and quantitative evaluation of the experiments using various metrics demonstrates that the proposed model achieves significantly improved segmentation performance, higher classification accuracy, and a greater level of automation, surpassing existing methods.
Paper Structure (9 sections, 6 equations, 13 figures, 4 tables)

This paper contains 9 sections, 6 equations, 13 figures, 4 tables.

Figures (13)

  • Figure 1: Proposed System Architecture.
  • Figure 2: Proposed Attention Residual U-Net for segmentation.
  • Figure 3: Architecture of the Residual Convolution.
  • Figure 4: Architecture of Vision Transformer
  • Figure 5: Nikon Ti2-u Eclipse microscope, integrated with the NIS-elements software package at Department of Biotechnology, Cochin University of Science and Technology.
  • ...and 8 more figures