Table of Contents
Fetching ...

Geometric Relational Embeddings

Bo Xiong

TL;DR

This work introduces Geometric Relational Embeddings, a framework that maps relational data to geometric objects (points, balls, boxes, cones) and non-Euclidean manifolds to faithfully capture discrete, symbolic structures such as hierarchies, cycles, and logical constraints. It develops several coordinated models, including pseudo-Riemannian Graph Convolutional Networks (Q-GCN), UltraE (ultrahyperbolic KG embeddings), BoxEL for EL++ ontologies, and Hyperbolic-structured approaches for structured ML constraints (HMI), along with high-order extensions ShrinkE and FactE for hyper-relational and nested facts. The methods provide soundness guarantees (BoxEL) and scalable inference (Manhattan-like distances, diffeomorphic mappings) while achieving strong empirical performance on graph reconstruction, link prediction, ontology reasoning, and structured multi-label classification tasks. Collectively, the contributions offer a unified geometric toolkit to model complex relational phenomena—hierarchies, cycles, logical rules, and high-order relations—across knowledge graphs, ontologies, and structured ML settings, with demonstrated advantages in low-dimensional representations. The work lays groundwork for integrating symbolic knowledge with geometric embeddings, enabling more reliable reasoning and interpretability in KG-based AI systems and downstream applications in biology and knowledge management.

Abstract

Relational representation learning transforms relational data into continuous and low-dimensional vector representations. However, vector-based representations fall short in capturing crucial properties of relational data that are complex and symbolic. We propose geometric relational embeddings, a paradigm of relational embeddings that respect the underlying symbolic structures. Specifically, this dissertation introduces various geometric relational embedding models capable of capturing: 1) complex structured patterns like hierarchies and cycles in networks and knowledge graphs; 2) logical structures in ontologies and logical constraints applicable for constraining machine learning model outputs; and 3) high-order structures between entities and relations. Our results obtained from benchmark and real-world datasets demonstrate the efficacy of geometric relational embeddings in adeptly capturing these discrete, symbolic, and structured properties inherent in relational data.

Geometric Relational Embeddings

TL;DR

This work introduces Geometric Relational Embeddings, a framework that maps relational data to geometric objects (points, balls, boxes, cones) and non-Euclidean manifolds to faithfully capture discrete, symbolic structures such as hierarchies, cycles, and logical constraints. It develops several coordinated models, including pseudo-Riemannian Graph Convolutional Networks (Q-GCN), UltraE (ultrahyperbolic KG embeddings), BoxEL for EL++ ontologies, and Hyperbolic-structured approaches for structured ML constraints (HMI), along with high-order extensions ShrinkE and FactE for hyper-relational and nested facts. The methods provide soundness guarantees (BoxEL) and scalable inference (Manhattan-like distances, diffeomorphic mappings) while achieving strong empirical performance on graph reconstruction, link prediction, ontology reasoning, and structured multi-label classification tasks. Collectively, the contributions offer a unified geometric toolkit to model complex relational phenomena—hierarchies, cycles, logical rules, and high-order relations—across knowledge graphs, ontologies, and structured ML settings, with demonstrated advantages in low-dimensional representations. The work lays groundwork for integrating symbolic knowledge with geometric embeddings, enabling more reliable reasoning and interpretability in KG-based AI systems and downstream applications in biology and knowledge management.

Abstract

Relational representation learning transforms relational data into continuous and low-dimensional vector representations. However, vector-based representations fall short in capturing crucial properties of relational data that are complex and symbolic. We propose geometric relational embeddings, a paradigm of relational embeddings that respect the underlying symbolic structures. Specifically, this dissertation introduces various geometric relational embedding models capable of capturing: 1) complex structured patterns like hierarchies and cycles in networks and knowledge graphs; 2) logical structures in ontologies and logical constraints applicable for constraining machine learning model outputs; and 3) high-order structures between entities and relations. Our results obtained from benchmark and real-world datasets demonstrate the efficacy of geometric relational embeddings in adeptly capturing these discrete, symbolic, and structured properties inherent in relational data.
Paper Structure (114 sections, 63 theorems, 116 equations, 25 figures, 36 tables)

This paper contains 114 sections, 63 theorems, 116 equations, 25 figures, 36 tables.

Key Result

Theorem 1

For any point $\mathbf{x} \in \mathcal{Q}_{\beta}^{s, t}$, there exists a diffeomorphism $\psi: \mathcal{Q}_{\beta}^{s, t} \rightarrow \mathbb{S}_{1}^{t} \times \mathbb{R}^{s}$ that maps $\mathbf{x}$ into the product manifolds of an unit sphere and the Euclidean space.

Figures (25)

  • Figure 1.1: A schematic illustration of a symbolic knowledge base. It consists of a ABox describing the factual knowledge over entities and a TBox describing the conceptual knowledge over concepts.
  • Figure 1.2: A schematic illustration of the discrete properties in relational data. (a) Structural patterns include hierarchical and cyclic structures in graphs; (b) Relational patterns are logical rules/implication over relations; (c) A logical/set-theoretical structure described by logical or set operators (inclusion and exclusion). (d) A high-order structure described by multi-fold relations among multiple entities.
  • Figure 3.1: The different submanifolds of a four-dimensional pseudo-hyperboloid of curvature $-1$ with two time dimensions. By fixing one time dimension $x_0$, the induced submanifolds include (a) an one-sheet hyperboloid, (b) the double cone, and (c) a two-sheet hyperboloid.
  • Figure 3.2: The histograms of sectional curvature for all used datasets.
  • Figure 3.3: The mAP of graph reconstruction with varying number of time dimensions.
  • ...and 20 more figures

Theorems & Definitions (114)

  • Definition 1: Directed edge-labeled graph hogan2021knowledge
  • Definition 2: Heterogeneous graph hogan2021knowledge
  • Definition 3: Property graph hogan2021knowledge
  • Definition 4: Homogeneous graph
  • Definition 5: Single-relational graph
  • Definition 6: Knowledge graph
  • Definition 7: Hyper-relational knowledge graph
  • Definition 8: Hyper-relational fact
  • Definition 9: Partial fact DBLP:conf/acl/GuanJGWC20
  • Definition 10: Qualifier monotonicity
  • ...and 104 more