HyTAS: A Hyperspectral Image Transformer Architecture Search Benchmark and Analysis
Fangqin Zhou, Mert Kilickaya, Joaquin Vanschoren, Ran Piao
TL;DR
HyTAS introduces the first hyperspectral transformer architecture search benchmark, enabling systematic evaluation of training-free proxies for HSI classification. The authors construct a search space of 2000 hyperspectral transformer subnetworks and benchmark 12 proxies across 5 datasets, demonstrating that proxies can often identify architectures that surpass a human-crafted baseline while tending toward larger models. They propose ZiCo++ as an enhanced proxy with superior correlation to true performance and analyze factors influencing both model performance and proxy scores. Additionally, HyTAS shows proxies can complement other search methods, notably by enabling predictive modeling (e.g., Random Forest) to forecast network performance with low training cost, guiding more efficient TAS in hyperspectral domains.
Abstract
Hyperspectral Imaging (HSI) plays an increasingly critical role in precise vision tasks within remote sensing, capturing a wide spectrum of visual data. Transformer architectures have significantly enhanced HSI task performance, while advancements in Transformer Architecture Search (TAS) have improved model discovery. To harness these advancements for HSI classification, we make the following contributions: i) We propose HyTAS, the first benchmark on transformer architecture search for Hyperspectral imaging, ii) We comprehensively evaluate 12 different methods to identify the optimal transformer over 5 different datasets, iii) We perform an extensive factor analysis on the Hyperspectral transformer search performance, greatly motivating future research in this direction. All benchmark materials are available at HyTAS.
