Navigating Taxonomic Expansions of Entity Sets Driven by Knowledge Bases
Pietro Cofone, Giovanni Amendola, Marco Manna, Aldo Ricioppo
TL;DR
The paper extends entity set expansion by introducing taxonomic expansion graphs derived from knowledge bases to reveal hierarchical generalizations beyond linear expansion. It formalizes three local navigation primitives (SIM, PREC, INC) and three core tasks (can, core, ess) within the NCF/SKB framework, enabling efficient local exploration without full graph materialization. A rigorous complexity landscape is developed under broad, medium, and handy assumptions, showing intractability in broad settings but tractability in the handy setting, with concrete reductions and algorithms linking canonical/core/essential expansions to navigation tasks. These results provide a principled foundation for practical, real-time navigation of taxonomic expansion graphs and point to future work on approximation, alternative explanation languages, and richer summaries.
Abstract
Recognizing similarities among entities is central to both human cognition and computational intelligence. Within this broader landscape, Entity Set Expansion is one prominent task aimed at taking an initial set of (tuples of) entities and identifying additional ones that share relevant semantic properties with the former -- potentially repeating the process to form increasingly broader sets. However, this ``linear'' approach does not unveil the richer ``taxonomic'' structures present in knowledge resources. A recent logic-based framework introduces the notion of an expansion graph: a rooted directed acyclic graph where each node represents a semantic generalization labeled by a logical formula, and edges encode strict semantic inclusion. This structure supports taxonomic expansions of entity sets driven by knowledge bases. Yet, the potentially large size of such graphs may make full materialization impractical in real-world scenarios. To overcome this, we formalize reasoning tasks that check whether two tuples belong to comparable, incomparable, or the same nodes in the graph. Our results show that, under realistic assumptions -- such as bounding the input or limiting entity descriptions -- these tasks can be implemented efficiently. This enables local, incremental navigation of expansion graphs, supporting practical applications without requiring full graph construction.
