Attention-Aware Laparoscopic Image Desmoking Network with Lightness Embedding and Hybrid Guided Embedding
Ziteng Liu, Jiahua Zhu, Bainan Liu, Hao Liu, Wenpeng Gao, Yili Fu
TL;DR
This work tackles the problem of surgical smoke degrading laparoscopic image quality by proposing L-SAHGNet, a two-stage framework that first estimates smoke distribution with a Smoke Attention Estimator and then reconstructs a desmoked image through Hybrid Guided Embedding guided by both the original image and a predicted smoke mask. The design integrates a Lightness Embedding Module to inject channel-aware priors and a Restricted Field Transformation to stabilize learning of weight terms, with an explicit reconstruction equation $J(x)= I(x) + K(x)I(x) + B(x)I_s(x) + \varepsilon$, where $K(x)=\frac{1}{t(x)}-1$ and $B(x)=-\frac{1}{t(x)}$. Quantitative results show a $2.79\%$ PSNR improvement over state-of-the-art methods and a $38.2\%$ reduction in runtime, achieving real-time performance on real MIS data, while ablation studies confirm the contributions of LE, SPE, and RFT. The approach offers a transparent, ASM-consistent framework that balances smoke removal quality with computational efficiency, making it well-suited for real-time surgical visualization and downstream image analysis. The work provides practical impact by enabling clearer intraoperative views and robust downstream processing in MIS settings.
Abstract
This paper presents a novel method of smoke removal from the laparoscopic images. Due to the heterogeneous nature of surgical smoke, a two-stage network is proposed to estimate the smoke distribution and reconstruct a clear, smoke-free surgical scene. The utilization of the lightness channel plays a pivotal role in providing vital information pertaining to smoke density. The reconstruction of smoke-free image is guided by a hybrid embedding, which combines the estimated smoke mask with the initial image. Experimental results demonstrate that the proposed method boasts a Peak Signal to Noise Ratio that is $2.79\%$ higher than the state-of-the-art methods, while also exhibits a remarkable $38.2\%$ reduction in run-time. Overall, the proposed method offers comparable or even superior performance in terms of both smoke removal quality and computational efficiency when compared to existing state-of-the-art methods. This work will be publicly available on http://homepage.hit.edu.cn/wpgao
