Seeing through Satellite Images at Street Views
Ming Qian, Bin Tan, Qiuyu Wang, Xianwei Zheng, Hanjiang Xiong, Gui-Song Xia, Yujun Shen, Nan Xue
TL;DR
Sat2Density++ tackles SatStreet-view synthesis by learning an illumination-adaptive neural radiance field conditioned on a satellite image, using a tri-plane 3D representation and a dedicated sky-illumination pathway to render photorealistic street-view panoramas and videos. The method jointly learns geometry and appearance, with a sky branch, histogram-based illumination features, and adversarial and reconstruction losses to ensure multi-view consistency and fidelity to the satellite input. It demonstrates state-of-the-art performance on suburban CVUSA/CVACT and urban VIGOR datasets, showing improved video quality, depth-like cues, and illumination controllability, while generalizing to unseen locations (Seattle) without 3D annotations. The approach enables practical applications in navigation, urban planning, and virtual environment generation by enabling illumination-controlled, satellite-ground aligned street-view synthesis from a single satellite image and camera trajectory.
Abstract
This paper studies the task of SatStreet-view synthesis, which aims to render photorealistic street-view panorama images and videos given any satellite image and specified camera positions or trajectories. We formulate to learn neural radiance field from paired images captured from satellite and street viewpoints, which comes to be a challenging learning problem due to the sparse-view natural and the extremely-large viewpoint changes between satellite and street-view images. We tackle the challenges based on a task-specific observation that street-view specific elements, including the sky and illumination effects are only visible in street-view panoramas, and present a novel approach Sat2Density++ to accomplish the goal of photo-realistic street-view panoramas rendering by modeling these street-view specific in neural networks. In the experiments, our method is testified on both urban and suburban scene datasets, demonstrating that Sat2Density++ is capable of rendering photorealistic street-view panoramas that are consistent across multiple views and faithful to the satellite image.
