An Interpretable X-ray Style Transfer via Trainable Local Laplacian Filter
Dominik Eckert, Ludwig Ritschl, Christopher Syben, Christian Hümmer, Julia Wicklein, Marcel Beister, Steffen Kappler, Sebastian Stober
TL;DR
This work tackles automatic, interpretable X-ray style transfer by introducing a trainable Local Laplacian Filter (LLF). The method replaces the fixed, three-parameter remap with a learnable multimodal mapping $m(\cdot)$ and adds a trainable normalization to better match target styles while preserving interpretability through monotonicity of the remap. On mammographic data, the trainable LLF achieves a peak $SSIM$ of $0.944$ and a low $MSE$ around $0.0105$, outperforming the baseline $LLF$ approach ($SSIM \approx 0.817$, $MSE \approx 0.0738$). The approach remains differentiable and interpretable, enabling integration as an operator in neural networks and offering potential extensions to other imaging modalities with style-aware metrics.
Abstract
Radiologists have preferred visual impressions or 'styles' of X-ray images that are manually adjusted to their needs to support their diagnostic performance. In this work, we propose an automatic and interpretable X-ray style transfer by introducing a trainable version of the Local Laplacian Filter (LLF). From the shape of the LLF's optimized remap function, the characteristics of the style transfer can be inferred and reliability of the algorithm can be ensured. Moreover, we enable the LLF to capture complex X-ray style features by replacing the remap function with a Multi-Layer Perceptron (MLP) and adding a trainable normalization layer. We demonstrate the effectiveness of the proposed method by transforming unprocessed mammographic X-ray images into images that match the style of target mammograms and achieve a Structural Similarity Index (SSIM) of 0.94 compared to 0.82 of the baseline LLF style transfer method from Aubry et al.
