HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space

Alexandra Zhabitskaya; Alexander Sheshkus; Vladimir L. Arlazarov

HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space

Alexandra Zhabitskaya, Alexander Sheshkus, Vladimir L. Arlazarov

TL;DR

The paper addresses inefficiencies in HT-based neural networks for segmentation by introducing the HoughToRadon Transform (HRT), a fixed layer that converts the Hough space ($s,t$) to a Radon-like space ($\rho,\varphi$) and back via the RadonToHough Transform (RHT). The parameters $n$ (angles) and $scaleX$ control the number of angles and the width of the transformed feature map, enabling a significant reduction in intermediate feature-map size while preserving or improving segmentation accuracy, demonstrated on the MIDV-500 dataset with MIoU reaching up to $97.7\%$ and substantial time savings over prior HT-based methods. The approach is implemented inside the HoughEncoder architecture, showing that inner convolutions can operate on smaller, linearly related representations, thereby accelerating training and inference. Overall, the work provides a practical, tunable method to accelerate HT-enabled neural networks for document segmentation and highlights the value of coordinate-space linearization in deep feature processing.

Abstract

In this paper, we introduce HoughToRadon Transform layer, a novel layer designed to improve the speed of neural networks incorporated with Hough Transform to solve semantic image segmentation problems. By placing it after a Hough Transform layer, "inner" convolutions receive modified feature maps with new beneficial properties, such as a smaller area of processed images and parameter space linearity by angle and shift. These properties were not presented in Hough Transform alone. Furthermore, HoughToRadon Transform layer allows us to adjust the size of intermediate feature maps using two new parameters, thus allowing us to balance the speed and quality of the resulting neural network. Our experiments on the open MIDV-500 dataset show that this new approach leads to time savings in document segmentation tasks and achieves state-of-the-art 97.7% accuracy, outperforming HoughEncoder with larger computational complexity.

HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space

TL;DR

The paper addresses inefficiencies in HT-based neural networks for segmentation by introducing the HoughToRadon Transform (HRT), a fixed layer that converts the Hough space (

) to a Radon-like space (

) and back via the RadonToHough Transform (RHT). The parameters

(angles) and

control the number of angles and the width of the transformed feature map, enabling a significant reduction in intermediate feature-map size while preserving or improving segmentation accuracy, demonstrated on the MIDV-500 dataset with MIoU reaching up to

and substantial time savings over prior HT-based methods. The approach is implemented inside the HoughEncoder architecture, showing that inner convolutions can operate on smaller, linearly related representations, thereby accelerating training and inference. Overall, the work provides a practical, tunable method to accelerate HT-enabled neural networks for document segmentation and highlights the value of coordinate-space linearization in deep feature processing.

Abstract

Paper Structure (9 sections, 5 equations, 4 figures, 2 tables)

This paper contains 9 sections, 5 equations, 4 figures, 2 tables.

Introduction
Hough Transform
HoughToRadon Transform
Experiments
Implementing in the HoughEncoder architecture
Dataset description
Evaluation metrics
Results
Conclusion and future work

Figures (4)

Figure 1: Two pairs of lines with the same (a, b) change correspond to different angles.
Figure 2: Visual interpretation of the new NN.
Figure 3: Examples of 'inner' convolution feature maps from FHT (left) and FHT + HoughToRadon transform (right) layers at a consistent scale.
Figure 4: Examples of input and output images of the NN with HoughToRadon transform.

HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space

TL;DR

Abstract

HoughToRadon Transform: New Neural Network Layer for Features Improvement in Projection Space

Authors

TL;DR

Abstract

Table of Contents

Figures (4)