MMLSv2: A Multimodal Dataset for Martian Landslide Detection in Remote Sensing Imagery
Sidike Paheding, Abel Reyes-Angulo, Leo Thomas Ramos, Angel D. Sappa, Rajaneesh A., Hiral P. B., Sajin Kumar K. S., Thomas Oommen
TL;DR
MMLSv2 tackles the scarcity and generalization gap in Martian landslide segmentation by introducing a seven-channel multimodal dataset (RGB, DEM, slope, thermal inertia, grayscale) across 664 images plus an isolated 276-image test set for spatial generalization. The authors implement a careful data fusion and labeling workflow, co-registration, and a patch-based, spatially independent partitioning strategy to provide robust, reproducible benchmarks. Across multiple segmentation architectures, the baseline performance is competitive ($mIoU$ around $0.81$–$0.83$ on the standard split) but drops on the isolated test set, highlighting regional generalization challenges; adding modalities consistently improves performance, with the full seven-band configuration achieving the strongest delineation of landslide boundaries. This dataset and evaluation protocol advance reproducible research in planetary geomorphology by enabling robust, multimodal landslide mapping and fair assessment of generalization under distribution shifts.
Abstract
We present MMLSv2, a dataset for landslide segmentation on Martian surfaces. MMLSv2 consists of multimodal imagery with seven bands: RGB, digital elevation model, slope, thermal inertia, and grayscale channels. MMLSv2 comprises 664 images distributed across training, validation, and test splits. In addition, an isolated test set of 276 images from a geographically disjoint region from the base dataset is released to evaluate spatial generalization. Experiments conducted with multiple segmentation models show that the dataset supports stable training and achieves competitive performance, while still posing challenges in fragmented, elongated, and small-scale landslide regions. Evaluation on the isolated test set leads to a noticeable performance drop, indicating increased difficulty and highlighting its value for assessing model robustness and generalization beyond standard in-distribution settings. Dataset will be available at: https://github.com/MAIN-Lab/MMLS_v2
