GeoScaler: Geometry and Rendering-Aware Downsampling of 3D Mesh Textures
Sai Karthikey Pentapati, Anshul Rai, Arkady Ten, Chaitanya Atluru, Alan Bovik
TL;DR
GeoScaler tackles the problem of downsampling high-resolution texture maps for 3D meshes by making the downsampling geometry- and UV-aware and optimizing for rendered view fidelity rather than UV-space fidelity. It introduces GeoCoding and UVWarper modules within a DNN that encodes geometry, fuses it with texture information, and locally warps the UV layout, all guided by a differentiable rendering loss. The approach yields sizable perceptual improvements (PSNR/SSIM) over traditional resampling methods at 4x and 8x downsampling across multiple real-world datasets, while enabling more memory-efficient rendering suitable for devices with limited budgets. While effective, the method depends on high-resolution input textures and incurs additional computation time, suggesting future work in faster per-mesh optimization and extending the framework to other texture-like maps.
Abstract
High-resolution texture maps are necessary for representing real-world objects accurately with 3D meshes. The large sizes of textures can bottleneck the real-time rendering of high-quality virtual 3D scenes on devices having low computational budgets and limited memory. Downsampling the texture maps directly addresses the issue, albeit at the cost of visual fidelity. Traditionally, downsampling of texture maps is performed using methods like bicubic interpolation and the Lanczos algorithm. These methods ignore the geometric layout of the mesh and its UV parametrization and also do not account for the rendering process used to obtain the final visualization that the users will experience. Towards filling these gaps, we introduce GeoScaler, which is a method of downsampling texture maps of 3D meshes while incorporating geometric cues, and by maximizing the visual fidelity of the rendered views of the textured meshes. We show that the textures generated by GeoScaler deliver significantly better quality rendered images compared to those generated by traditional downsampling methods
