MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation

Romina Aalishah; Mozhgan Navardi; Tinoosh Mohsenin

MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation

Romina Aalishah, Mozhgan Navardi, Tinoosh Mohsenin

TL;DR

This work addresses the challenge of deploying image super-resolution on resource-constrained edge devices by proposing MambaLiteSR, a lightweight Vision Mamba-based model augmented with low-rank Mamba and knowledge distillation from a larger teacher. The method optimizes embedding dimension, employs a low-rank factorization to reduce computations, and trains a compact student model that closely matches a stronger teacher’s SR performance. Experimental results demonstrate a 15% parameter reduction with competitive PSNR/SSIM and up to 58% power savings, plus significant training energy reductions via low-rank design, validated on NVIDIA Jetson Orin Nano. The approach offers a practical pathway for real-time, energy-efficient SR on edge hardware while maintaining accuracy comparable to state-of-the-art edge models.

Abstract

Generative Artificial Intelligence (AI) has gained significant attention in recent years, revolutionizing various applications across industries. Among these, advanced vision models for image super-resolution are in high demand, particularly for deployment on edge devices where real-time processing is crucial. However, deploying such models on edge devices is challenging due to limited computing power and memory. In this paper, we present MambaLiteSR, a novel lightweight image Super-Resolution (SR) model that utilizes the architecture of Vision Mamba. It integrates State Space Blocks and a reconstruction module for efficient feature extraction. To optimize efficiency without affecting performance, MambaLiteSR employs knowledge distillation to transfer key insights from a larger Mamba-based teacher model to a smaller student model via hyperparameter tuning. Through mathematical analysis of model parameters and their impact on PSNR, we identify key factors and adjust them accordingly. Our comprehensive evaluation shows that MambaLiteSR outperforms state-of-the-art edge SR methods by reducing power consumption while maintaining competitive PSNR and SSIM scores across benchmark datasets. It also reduces power usage during training via low-rank approximation. Moreover, MambaLiteSR reduces parameters with minimal performance loss, enabling efficient deployment of generative AI models on resource-constrained devices. Deployment on the embedded NVIDIA Jetson Orin Nano confirms the superior balance of MambaLiteSR size, latency, and efficiency. Experiments show that MambaLiteSR achieves performance comparable to both the baseline and other edge models while using 15% fewer parameters. It also improves power consumption by up to 58% compared to state-of-the-art SR edge models, all while maintaining low energy use during training.

MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation

TL;DR

Abstract

MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)