Distilling Knowledge for Designing Computational Imaging Systems
Leon Suarez-Rodriguez, Roman Jacome, Henry Arguello
TL;DR
The paper tackles the problem that end-to-end optimization of computational imaging systems is hampered by physical encoder constraints and gradient issues. It introduces a knowledge distillation (KD) framework in which a less-constrained, pretrained teacher guides a constrained student across three stages: teacher creation via relaxation, teacher optimization, and knowledge transfer through two losses that align encoders and decoders. Across MRI, SPC, and SD-CASSI, the KD approach yields higher reconstruction quality (up to ~1.53 dB PSNR gains) and improved encoder designs (lower condition numbers, reduced Gram matrix coherence, and better spectral sampling) compared to E2E baselines, with robustness to noise. The method is generalizable to other CI modalities and tasks, offering a practical alternative to traditional E2E design while maintaining feasible inference. The KD framework thus provides a principled, flexible pathway to design physically feasible, high-performance CI systems.
Abstract
Designing the physical encoder is crucial for accurate image reconstruction in computational imaging (CI) systems. Currently, these systems are designed via end-to-end (E2E) optimization, where the encoder is modeled as a neural network layer and is jointly optimized with the decoder. However, the performance of E2E optimization is significantly reduced by the physical constraints imposed on the encoder. Also, since the E2E learns the parameters of the encoder by backpropagating the reconstruction error, it does not promote optimal intermediate outputs and suffers from gradient vanishing. To address these limitations, we reinterpret the concept of knowledge distillation (KD) for designing a physically constrained CI system by transferring the knowledge of a pretrained, less-constrained CI system. Our approach involves three steps: (1) Given the original CI system (student), a teacher system is created by relaxing the constraints on the student's encoder. (2) The teacher is optimized to solve a less-constrained version of the student's problem. (3) The teacher guides the training of the student through two proposed knowledge transfer functions, targeting both the encoder and the decoder feature space. The proposed method can be employed to any imaging modality since the relaxation scheme and the loss functions can be adapted according to the physical acquisition and the employed decoder. This approach was validated on three representative CI modalities: magnetic resonance, single-pixel, and compressive spectral imaging. Simulations show that a teacher system with an encoder that has a structure similar to that of the student encoder provides effective guidance. Our approach achieves significantly improved reconstruction performance and encoder design, outperforming both E2E optimization and traditional non-data-driven encoder designs.
