Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing
Shiyang Zhou, Haijin Zeng, Yunfan Lu, Tong Shao, Ke Tang, Yongyong Chen, Jie Liu, Jingyong Su
TL;DR
Quad Bayer HybridEVS demosaicing on mobile devices faces high computational costs when leveraging global dependency models. The authors introduce BMTNet, a lightweight binarized Mamba-Transformer that fuses a Bi-Mamba-Transformer core with a binarized global visual encoder, preserving Selective Scan in full precision to maintain accuracy while dramatically reducing parameters and FLOPs. The approach achieves PSNR improvements over other BNNs and competitive results against full-precision models across diverse datasets, enabling practical edge deployment for HybridEVS. This work broadens the applicability of binarized networks and space-models in vision tasks, offering a scalable solution for real-world demosaicing on resource-constrained devices.
Abstract
Quad Bayer demosaicing is the central challenge for enabling the widespread application of Hybrid Event-based Vision Sensors (HybridEVS). Although existing learning-based methods that leverage long-range dependency modeling have achieved promising results, their complexity severely limits deployment on mobile devices for real-world applications. To address these limitations, we propose a lightweight Mamba-based binary neural network designed for efficient and high-performing demosaicing of HybridEVS RAW images. First, to effectively capture both global and local dependencies, we introduce a hybrid Binarized Mamba-Transformer architecture that combines the strengths of the Mamba and Swin Transformer architectures. Next, to significantly reduce computational complexity, we propose a binarized Mamba (Bi-Mamba), which binarizes all projections while retaining the core Selective Scan in full precision. Bi-Mamba also incorporates additional global visual information to enhance global context and mitigate precision loss. We conduct quantitative and qualitative experiments to demonstrate the effectiveness of BMTNet in both performance and computational efficiency, providing a lightweight demosaicing solution suited for real-world edge devices. Our codes and models are available at https://github.com/Clausy9/BMTNet.
