Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis

Myeongseok Nam; Wongi Park; Minsol Kim; Hyejin Hur; Soomok Lee

Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis

Myeongseok Nam, Wongi Park, Minsol Kim, Hyejin Hur, Soomok Lee

TL;DR

Veta-GS addresses the challenge of robust thermal infrared novel-view synthesis by introducing a view-dependent deformation field that uses camera pose $x$ and view direction $v$ to modulate 3D Gaussian primitives, along with a Thermal Feature Extractor (TFE) and a MonoSSIM loss to capture appearance, edge, and frequency information. A frustum-based masking strategy confines deformation to Gaussians inside the camera frustum, accelerating training. Evaluated on TI-NSD, Veta-GS outperforms state-of-the-art methods across indoor, outdoor, and UAV scenes, demonstrating improved PSNR, SSIM, and LPIPS while reducing artifacts such as floaters and blur. The approach combines explicit 3D Gaussian splatting with view-conditioned deformation and multi-branch perceptual losses, enabling robust, real-time-friendly thermal NVIS. Future work suggests extending to dynamic TIR scenes and further optimizing the computational cost of the Thermal Feature Extractor.

Abstract

Recently, 3D Gaussian Splatting (3D-GS) based on Thermal Infrared (TIR) imaging has gained attention in novel-view synthesis, showing real-time rendering. However, novel-view synthesis with thermal infrared images suffers from transmission effects, emissivity, and low resolution, leading to floaters and blur effects in rendered images. To address these problems, we introduce Veta-GS, which leverages a view-dependent deformation field and a Thermal Feature Extractor (TFE) to precisely capture subtle thermal variations and maintain robustness. Specifically, we design view-dependent deformation field that leverages camera position and viewing direction, which capture thermal variations. Furthermore, we introduce the Thermal Feature Extractor (TFE) and MonoSSIM loss, which consider appearance, edge, and frequency to maintain robustness. Extensive experiments on the TI-NSD benchmark show that our method achieves better performance over existing methods.

Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis

TL;DR

Veta-GS addresses the challenge of robust thermal infrared novel-view synthesis by introducing a view-dependent deformation field that uses camera pose

and view direction

to modulate 3D Gaussian primitives, along with a Thermal Feature Extractor (TFE) and a MonoSSIM loss to capture appearance, edge, and frequency information. A frustum-based masking strategy confines deformation to Gaussians inside the camera frustum, accelerating training. Evaluated on TI-NSD, Veta-GS outperforms state-of-the-art methods across indoor, outdoor, and UAV scenes, demonstrating improved PSNR, SSIM, and LPIPS while reducing artifacts such as floaters and blur. The approach combines explicit 3D Gaussian splatting with view-conditioned deformation and multi-branch perceptual losses, enabling robust, real-time-friendly thermal NVIS. Future work suggests extending to dynamic TIR scenes and further optimizing the computational cost of the Thermal Feature Extractor.

Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis

TL;DR

Abstract

Veta-GS: View-dependent deformable 3D Gaussian Splatting for thermal infrared Novel-view Synthesis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)