CuNeRF: Cube-Based Neural Radiance Field for Zero-Shot Medical Image Arbitrary-Scale Super Resolution
Zixuan Chen, Jian-Huang Lai, Lingxiao Yang, Xiaohua Xie
TL;DR
CuNeRF tackles medical image arbitrary-scale super-resolution without HR references by learning a continuous coordinate-intensity representation from a single LR volume. It introduces cube-based sampling, isotropic volume rendering, and cube-based hierarchical rendering to address hole artifacts inherent to NeRF in medical data, enabling free-viewpoint and arbitrary-scale slice synthesis. The approach yields state-of-the-art or competitive results on MRI and CT across 3D MISR and volumetric MISR tasks, with robust ablations affirming each component's contribution. By eliminating the need for LR-HR training pairs, CuNeRF offers a flexible, zero-shot solution with practical clinical utility, and code is publicly released.
Abstract
Medical image arbitrary-scale super-resolution (MIASSR) has recently gained widespread attention, aiming to super sample medical volumes at arbitrary scales via a single model. However, existing MIASSR methods face two major limitations: (i) reliance on high-resolution (HR) volumes and (ii) limited generalization ability, which restricts their application in various scenarios. To overcome these limitations, we propose Cube-based Neural Radiance Field (CuNeRF), a zero-shot MIASSR framework that can yield medical images at arbitrary scales and viewpoints in a continuous domain. Unlike existing MIASSR methods that fit the mapping between low-resolution (LR) and HR volumes, CuNeRF focuses on building a coordinate-intensity continuous representation from LR volumes without the need for HR references. This is achieved by the proposed differentiable modules: including cube-based sampling, isotropic volume rendering, and cube-based hierarchical rendering. Through extensive experiments on magnetic resource imaging (MRI) and computed tomography (CT) modalities, we demonstrate that CuNeRF outperforms state-of-the-art MIASSR methods. CuNeRF yields better visual verisimilitude and reduces aliasing artifacts at various upsampling factors. Moreover, our CuNeRF does not need any LR-HR training pairs, which is more flexible and easier to be used than others. Our code is released at https://github.com/NarcissusEx/CuNeRF.
