MSD: A Benchmark Dataset for Floor Plan Generation of Building Complexes
Casper van Engelenburg, Fatemeh Mostafavi, Emanuel Kuhn, Yuntae Jeon, Michael Franzen, Matthias Standfest, Jan van Gemert, Seyran Khademi
TL;DR
This work introduces MSD, a large-scale European floor-plan dataset that targets realistic, multi-apartment building complexes to benchmark floor-plan generation. It formalizes a multi-modal generation task conditioned on building structure and a zoning graph, and provides both image and graph representations along with structural annotations. Two baselines, Modified HouseDiffusion (MHD) and Graph-informed U-Net (UN), expose substantial gaps between current state-of-the-art methods and the complexity of MSD, highlighting the need for graph- and boundary-aware approaches that can handle irregular geometries and inter-apartment connectivity. MSD, with its rich annotations and diverse topologies, enables rigorous evaluation via MIoU and graph compatibility, and is poised to drive future research in realistic floor-plan understanding and generation across European-building contexts.
Abstract
Diverse and realistic floor plan data are essential for the development of useful computer-aided methods in architectural design. Today's large-scale floor plan datasets predominantly feature simple floor plan layouts, typically representing single-apartment dwellings only. To compensate for the mismatch between current datasets and the real world, we develop \textbf{Modified Swiss Dwellings} (MSD) -- the first large-scale floor plan dataset that contains a significant share of layouts of multi-apartment dwellings. MSD features over 5.3K floor plans of medium- to large-scale building complexes, covering over 18.9K distinct apartments. We validate that existing approaches for floor plan generation, while effective in simpler scenarios, cannot yet seamlessly address the challenges posed by MSD. Our benchmark calls for new research in floor plan machine understanding. Code and data are open.
