COTET: Cross-view Optimal Transport for Knowledge Graph Entity Typing
Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan
TL;DR
COTET tackles knowledge graph entity typing by jointly leveraging three perspectives of knowledge: fine-grained entity-type, coarse-grained entity-cluster, and type-cluster type relations. It introduces three modules—multi-view generation/encoding, cross-view optimal transport to align heterogeneous view embeddings, and a pooling-based neighbor aggregator with a Beta-distribution loss to reduce false negatives. Empirical results on FB15kET and YAGO43kET show state-of-the-art performance, especially in hard and sparse settings, validating the benefits of cross-view alignment and cluster information. The approach offers a practical, scalable path to more complete KG typing and opens avenues for inductive KGET and integration of textual context with structural cues.
Abstract
Knowledge graph entity typing (KGET) aims to infer missing entity type instances in knowledge graphs. Previous research has predominantly centered around leveraging contextual information associated with entities, which provides valuable clues for inference. However, they have long ignored the dual nature of information inherent in entities, encompassing both high-level coarse-grained cluster knowledge and fine-grained type knowledge. This paper introduces Cross-view Optimal Transport for knowledge graph Entity Typing (COTET), a method that effectively incorporates the information on how types are clustered into the representation of entities and types. COTET comprises three modules: i) Multi-view Generation and Encoder, which captures structured knowledge at different levels of granularity through entity-type, entity-cluster, and type-cluster-type perspectives; ii) Cross-view Optimal Transport, transporting view-specific embeddings to a unified space by minimizing the Wasserstein distance from a distributional alignment perspective; iii) Pooling-based Entity Typing Prediction, employing a mixture pooling mechanism to aggregate prediction scores from diverse neighbors of an entity. Additionally, we introduce a distribution-based loss function to mitigate the occurrence of false negatives during training. Extensive experiments demonstrate the effectiveness of COTET when compared to existing baselines.
