Resource-efficient medical image classification for edge devices
Mahsa Lavaei, Zahra Abadi, Salar Beigzad, Alireza Maleki
TL;DR
The paper tackles the challenge of deploying medical image classifiers on resource-constrained edge devices by combining quantization-aware training (QAT) with Parameterized Clipping Activation (PACT) and Saliency-Guided Training (SGT) to maintain diagnostic accuracy and robust interpretability. Using the Kvasir GI endoscopy dataset, it demonstrates substantial reductions in model size and inference cost while preserving clinically acceptable accuracy and stable saliency maps. The approach shows improvements over a saliency-guided baseline and validates the practicality of edge deployment for real-time AI-based GI diagnostics in remote settings. Overall, it offers a practical, interpretable pathway for deploying efficient AI-assisted healthcare tools on low-resource hardware.
Abstract
Medical image classification is a critical task in healthcare, enabling accurate and timely diagnosis. However, deploying deep learning models on resource-constrained edge devices presents significant challenges due to computational and memory limitations. This research investigates a resource-efficient approach to medical image classification by employing model quantization techniques. Quantization reduces the precision of model parameters and activations, significantly lowering computational overhead and memory requirements without sacrificing classification accuracy. The study focuses on the optimization of quantization-aware training (QAT) and post-training quantization (PTQ) methods tailored for edge devices, analyzing their impact on model performance across medical imaging datasets. Experimental results demonstrate that quantized models achieve substantial reductions in model size and inference latency, enabling real-time processing on edge hardware while maintaining clinically acceptable diagnostic accuracy. This work provides a practical pathway for deploying AI-driven medical diagnostics in remote and resource-limited settings, enhancing the accessibility and scalability of healthcare technologies.
