FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
Ulas Gunes, Matias Turkulainen, Xuqian Ren, Arno Solin, Juho Kannala, Esa Rahtu
TL;DR
FIORD tackles the limitations of narrow FoV datasets by introducing a high-resolution 360° fisheye dataset captured with dual $200^{\circ}$ lenses, paired with dense $Faro\ Focus\ 3D$ LiDAR ground truth for accurate geometry benchmarking. The dataset comprises ten scenes (five indoor, five outdoor), each with SfM-sparse clouds from COLMAP and dense LiDAR scans, plus rectified images for compatibility with Gaussian Splatting and Nerfacto baselines. A careful calibration, data collection, and alignment pipeline (including CloudCompare-based registration and image rectification) enables robust evaluation of reconstruction and novel view synthesis under challenging conditions such as occlusions and reflections. Baseline experiments show that Gaussian Splatting often yields stronger quantitative metrics than Nerfacto on fisheye data, and that incorporating dense LiDAR data for initialization can improve rendering quality, highlighting FIORD’s potential as a versatile benchmark for future wide-FOV 3D reconstruction methods.
Abstract
The development of large-scale 3D scene reconstruction and novel view synthesis methods mostly rely on datasets comprising perspective images with narrow fields of view (FoV). While effective for small-scale scenes, these datasets require large image sets and extensive structure-from-motion (SfM) processing, limiting scalability. To address this, we introduce a fisheye image dataset tailored for scene reconstruction tasks. Using dual 200-degree fisheye lenses, our dataset provides full 360-degree coverage of 5 indoor and 5 outdoor scenes. Each scene has sparse SfM point clouds and precise LIDAR-derived dense point clouds that can be used as geometric ground-truth, enabling robust benchmarking under challenging conditions such as occlusions and reflections. While the baseline experiments focus on vanilla Gaussian Splatting and NeRF based Nerfacto methods, the dataset supports diverse approaches for scene reconstruction, novel view synthesis, and image-based rendering.
