Table of Contents
Fetching ...

Deep Learning, Machine Learning -- Digital Signal and Image Processing: From Theory to Application

Weiche Hsieh, Ziqian Bi, Junyu Liu, Benji Peng, Sen Zhang, Xuanhe Pan, Jiawei Xu, Jinlang Wang, Keyu Chen, Caitlyn Heqi Yin, Pohsun Feng, Yizhu Wen, Tianyang Wang, Ming Li, Jintao Ren, Xinyuan Song, Qian Niu, Silin Chen, Ming Liu

TL;DR

<3-5 sentence high-level summary>This paper surveys core digital signal and image processing techniques with emphasis on classic transforms (DTFT, DFT/FFT, Z-transform) and their application to filtering, sampling-rate conversion, and multirate systems, then extends these concepts to digital image processing (acquisition, quantization, color spaces, spatial/frequency-domain enhancement, restoration, segmentation, and edge detection). It presents practical Python-based examples for plotting signals, computing transforms, designing FIR/IIR filters, and performing image processing tasks such as histogram equalization, thresholding, and frequency-domain filtering. The work highlights concrete methods for analyzing and improving signals and images, including stability/causality considerations, polyphase multirate structures, and standard restoration techniques (inverse/Wiener/regularization), underscoring how these tools support real-time, AI-assisted computer vision and multimedia applications. Overall, it provides a practical, theory-grounded toolkit for engineers and researchers to implement robust DSP and DIP pipelines using Python, with clear pathways from fundamental theory to application.

Abstract

Digital Signal Processing (DSP) and Digital Image Processing (DIP) with Machine Learning (ML) and Deep Learning (DL) are popular research areas in Computer Vision and related fields. We highlight transformative applications in image enhancement, filtering techniques, and pattern recognition. By integrating frameworks like the Discrete Fourier Transform (DFT), Z-Transform, and Fourier Transform methods, we enable robust data manipulation and feature extraction essential for AI-driven tasks. Using Python, we implement algorithms that optimize real-time data processing, forming a foundation for scalable, high-performance solutions in computer vision. This work illustrates the potential of ML and DL to advance DSP and DIP methodologies, contributing to artificial intelligence, automated feature extraction, and applications across diverse domains.

Deep Learning, Machine Learning -- Digital Signal and Image Processing: From Theory to Application

TL;DR

<3-5 sentence high-level summary>This paper surveys core digital signal and image processing techniques with emphasis on classic transforms (DTFT, DFT/FFT, Z-transform) and their application to filtering, sampling-rate conversion, and multirate systems, then extends these concepts to digital image processing (acquisition, quantization, color spaces, spatial/frequency-domain enhancement, restoration, segmentation, and edge detection). It presents practical Python-based examples for plotting signals, computing transforms, designing FIR/IIR filters, and performing image processing tasks such as histogram equalization, thresholding, and frequency-domain filtering. The work highlights concrete methods for analyzing and improving signals and images, including stability/causality considerations, polyphase multirate structures, and standard restoration techniques (inverse/Wiener/regularization), underscoring how these tools support real-time, AI-assisted computer vision and multimedia applications. Overall, it provides a practical, theory-grounded toolkit for engineers and researchers to implement robust DSP and DIP pipelines using Python, with clear pathways from fundamental theory to application.

Abstract

Digital Signal Processing (DSP) and Digital Image Processing (DIP) with Machine Learning (ML) and Deep Learning (DL) are popular research areas in Computer Vision and related fields. We highlight transformative applications in image enhancement, filtering techniques, and pattern recognition. By integrating frameworks like the Discrete Fourier Transform (DFT), Z-Transform, and Fourier Transform methods, we enable robust data manipulation and feature extraction essential for AI-driven tasks. Using Python, we implement algorithms that optimize real-time data processing, forming a foundation for scalable, high-performance solutions in computer vision. This work illustrates the potential of ML and DL to advance DSP and DIP methodologies, contributing to artificial intelligence, automated feature extraction, and applications across diverse domains.

Paper Structure

This paper contains 624 sections, 139 equations.