Enhancing Information Retrieval in Digital Libraries through Unit Harmonisation in Scholarly Knowledge Graphs
Golsa Heidari, Markus Stocker, Sören Auer
TL;DR
The paper tackles the problem of cross-study, content-level retrieval in digital libraries by introducing a unit-aware faceted search framework built on the Open Research Knowledge Graph (ORKG). It leverages the QUDT ontology and UCUM-based unit conversions to semantically align measured data across heterogeneous units, enabling dynamic, unit-driven filtering and comparison. The approach is implemented within ORKG, demonstrates real-world examples (e.g., sea level projections), and supports federation with remote knowledge graphs to broaden coverage. This work advances digital libraries toward intelligent, quantitative, and cross-domain scholarly discovery by transforming measured data into harmonized, searchable facets.
Abstract
Scientists have always used the studies and research of other researchers to achieve new objectives and perspectives. In particular, employing and operating the measured data in previous studies is so practical. Searching the content of other scientists' articles is a challenge that researchers have always struggled with. Nowadays, the use of knowledge graphs as a semantic database has helped a lot in saving and retrieving scholarly knowledge. Such technologies are crucial to upgrading traditional search systems to smart knowledge retrieval, which is crucial to getting the most relevant answers for a user query, especially in information and knowledge management. However, in most cases, only the metadata of a paper is searchable, and it is still cumbersome for scientists to have access to the content of the papers. In this paper, we present a novel method of faceted search \emph{structured content} for comparing and filtering measured data in scholarly knowledge graphs while different units of measurement are used in different studies. This search system proposes applicable units as facets to the user and would dynamically integrate content from further remote knowledge graphs to materialize the scholarly knowledge graph and achieve a higher order of exploration usability on scholarly content, which can be filtered to better satisfy the user's information needs. The state of the art is that, by using our faceted search system, users can not only search the contents of scientific articles, but also compare and filter heterogeneous data.
