FOS: A Large-Scale Temporal Graph Benchmark for Scientific Interdisciplinary Link Prediction
Kiyan Rezaee, Morteza Ziabakhsh, Niloofar Nikfarjam, Mohammad M. Ghassemi, Yazdan Rezaee Jouryabi, Sadegh Eskandari, Reza Lashgari
TL;DR
FOS introduces a large-scale, time-aware benchmark for scientific interdisciplinarity by modeling yearly co-occurrence graphs of 65,027 fields across 19 domains, with semantic node embeddings to capture field meaning. Forecasting is formulated as a temporal link-prediction task to identify first-time field pairings, and a reproducible pipeline with multiple negative-sampling regimes is provided to evaluate state-of-the-art temporal GNNs. Through comprehensive experiments on the Art+Business subset, the study shows that no single model dominates across all regimes and that long historical context and rich semantic features, especially description embeddings, are crucial for predicting novel interdisciplinary links. The results align high-scoring predictions with later real-world publications, underscoring FOS's practical utility for surfacing emerging scientific directions; the dataset, splits, and code are released to advance research in forecasting scientific frontiers.
Abstract
Interdisciplinary scientific breakthroughs mostly emerge unexpectedly, and forecasting the formation of novel research fields remains a major challenge. We introduce FOS (Future Of Science), a comprehensive time-aware graph-based benchmark that reconstructs annual co-occurrence graphs of 65,027 research sub-fields (spanning 19 general domains) over the period 1827-2024. In these graphs, edges denote the co-occurrence of two fields in a single publication and are timestamped with the corresponding publication year. Nodes are enriched with semantic embeddings, and edges are characterized by temporal and topological descriptors. We formulate the prediction of new field-pair linkages as a temporal link-prediction task, emphasizing the "first-time" connections that signify pioneering interdisciplinary directions. Through extensive experiments, we evaluate a suite of state-of-the-art temporal graph architectures under multiple negative-sampling regimes and show that (i) embedding long-form textual descriptions of fields significantly boosts prediction accuracy, and (ii) distinct model classes excel under different evaluation settings. Case analyses show that top-ranked link predictions on FOS align with field pairings that emerge in subsequent years of academic publications. We publicly release FOS, along with its temporal data splits and evaluation code, to establish a reproducible benchmark for advancing research in predicting scientific frontiers.
