Table of Contents
Fetching ...

Hough-CNN: Deep Learning for Segmentation of Deep Brain Regions in MRI and Ultrasound

Fausto Milletari, Seyed-Ahmad Ahmadi, Christine Kroll, Annika Plate, Verena Rozanski, Juliana Maiostre, Johannes Levin, Olaf Dietrich, Birgit Ertl-Wagner, Kai Bötzel, Nassir Navab

TL;DR

This work introduces Hough-CNN, a segmentation framework that combines CNN-based voxel classification with a generalized Hough voting scheme to localize and segment multiple deep brain structures in MRI and transcranial ultrasound. By leveraging intermediate CNN features and a patch-wise database of segmentation patches, the method achieves robust, registration-free, multi-region segmentation across 2D, 2.5D, and 3D inputs, often using far fewer training patches than voxel-wise approaches. A comprehensive study across six architectures and two modalities demonstrates that Hough-CNN outperforms standard voxel-wise segmentation, with 3D data enhancing MRI performance and deep networks providing gains in ultrasound. The approach offers practical, scalable segmentation suitable for clinical workflows, with potential for transfer learning and broader disease parameter analyses.

Abstract

In this work we propose a novel approach to perform segmentation by leveraging the abstraction capabilities of convolutional neural networks (CNNs). Our method is based on Hough voting, a strategy that allows for fully automatic localisation and segmentation of the anatomies of interest. This approach does not only use the CNN classification outcomes, but it also implements voting by exploiting the features produced by the deepest portion of the network. We show that this learning-based segmentation method is robust, multi-region, flexible and can be easily adapted to different modalities. In the attempt to show the capabilities and the behaviour of CNNs when they are applied to medical image analysis, we perform a systematic study of the performances of six different network architectures, conceived according to state-of-the-art criteria, in various situations. We evaluate the impact of both different amount of training data and different data dimensionality (2D, 2.5D and 3D) on the final results. We show results on both MRI and transcranial US volumes depicting respectively 26 regions of the basal ganglia and the midbrain.

Hough-CNN: Deep Learning for Segmentation of Deep Brain Regions in MRI and Ultrasound

TL;DR

This work introduces Hough-CNN, a segmentation framework that combines CNN-based voxel classification with a generalized Hough voting scheme to localize and segment multiple deep brain structures in MRI and transcranial ultrasound. By leveraging intermediate CNN features and a patch-wise database of segmentation patches, the method achieves robust, registration-free, multi-region segmentation across 2D, 2.5D, and 3D inputs, often using far fewer training patches than voxel-wise approaches. A comprehensive study across six architectures and two modalities demonstrates that Hough-CNN outperforms standard voxel-wise segmentation, with 3D data enhancing MRI performance and deep networks providing gains in ultrasound. The approach offers practical, scalable segmentation suitable for clinical workflows, with potential for transfer learning and broader disease parameter analyses.

Abstract

In this work we propose a novel approach to perform segmentation by leveraging the abstraction capabilities of convolutional neural networks (CNNs). Our method is based on Hough voting, a strategy that allows for fully automatic localisation and segmentation of the anatomies of interest. This approach does not only use the CNN classification outcomes, but it also implements voting by exploiting the features produced by the deepest portion of the network. We show that this learning-based segmentation method is robust, multi-region, flexible and can be easily adapted to different modalities. In the attempt to show the capabilities and the behaviour of CNNs when they are applied to medical image analysis, we perform a systematic study of the performances of six different network architectures, conceived according to state-of-the-art criteria, in various situations. We evaluate the impact of both different amount of training data and different data dimensionality (2D, 2.5D and 3D) on the final results. We show results on both MRI and transcranial US volumes depicting respectively 26 regions of the basal ganglia and the midbrain.

Paper Structure

This paper contains 14 sections, 6 equations, 9 figures, 2 tables.

Figures (9)

  • Figure 1: Example of MRI and ultrasound slices (left) and their respective segmentations (right) as estimated by Hough-CNN. Anatomies shown include midbrain in US (red) and in MRI (yellow). Further, in upper half of MRI slice: hippocampus (pink), thalamus (green), red nucleus (red), substantia nigra (green/red stripes within midbrain) and amygdala (cyan)
  • Figure 2: Schematic representation in 2D of the Hough-CNN segmentation approach. a) The volume is interpreted patch-wise and classified using the CNN. b) Every pixel of the foreground (red) casts one or multiple votes in order to localise the anatomy centroid. c) The votes accumulate in a vote-map, represented here in jet colormap, and the object centroid is found at the location of maximum vote accumulation. d) All the votes that accumulated close to the detected anatomy centroid contribute to the final contour by projecting a binary segmentation patch (here shown in red and white to indicate foreground and background respectively) at the location they were cast from. e) A contour confidence map is constructed by accumulating all the contributions associated to the votes. f) The resulting contour, depicted in purple, is retrieved by thresholding the confidence map.
  • Figure 3: Visual comparison of semantic segmentation results (top) and Hough-CNN results (bottom) on the same ultrasound data using the best-performing CNN. Red areas represent ground truth annotation. Red contours represent segmentation outputs. Best viewed in digital format.
  • Figure 4: Visual comparison of semantic segmentation results (top two rows) and Hough-CNN results (bottom two rows) on same MRI volumes using the same trained CNN. Coloured areas represent ground truth annotation. Coloured contours represent segmentation outputs. Best viewed in digital format.
  • Figure 5: Midbrain segmentation results in 114 previously unseen TCUS volumes, using Hough-CNN with variations of architectures (single rows), patch dimensionalities (column blocks) and training set sizes (row blocks). The best result for each architecture (across the data dimensionalities) are highlighted by using bold typeface. The best results for each dimensionality (across the architectures) are underlined.
  • ...and 4 more figures