A fuzzy loss for ontology classification

Simon Flügel; Martin Glauer; Till Mossakowski; Fabian Neuhaus

A fuzzy loss for ontology classification

Simon Flügel, Martin Glauer, Till Mossakowski, Fabian Neuhaus

TL;DR

The paper tackles the problem of ensuring logical consistency in deep learning for ontology classification by introducing a fuzzy loss that penalizes subsumption and disjointness violations. It defines the loss using product and Lukasiewicz t-norms, with a balanced variant to counteract class-imbalance, and demonstrates its application on the CHEBI ontology with CHEBI_100 data, leveraging PubChem pretraining and semi-supervised learning. Empirical results show the fuzzy loss markedly reduces consistency violations (by about two orders of magnitude) while preserving or only modestly affecting classification performance, and data from unlabelled sources further improves out-of-distribution consistency. The work situates its approach relative to semantic loss methods and discusses trade-offs due to hierarchical data imbalance, offering directions for extension to other ontology axioms and integration with broader neuro-symbolic frameworks. All mathematical notation used for the ontology constraints and loss terms is expressed with $...$ delimiters to maintain clarity and reproducibility.

Abstract

Deep learning models are often unaware of the inherent constraints of the task they are applied to. However, many downstream tasks require logical consistency. For ontology classification tasks, such constraints include subsumption and disjointness relations between classes. In order to increase the consistency of deep learning models, we propose a fuzzy loss that combines label-based loss with terms penalising subsumption- or disjointness-violations. Our evaluation on the ChEBI ontology shows that the fuzzy loss is able to decrease the number of consistency violations by several orders of magnitude without decreasing the classification performance. In addition, we use the fuzzy loss for unsupervised learning. We show that this can further improve consistency on data from a

A fuzzy loss for ontology classification

TL;DR

delimiters to maintain clarity and reproducibility.

Abstract

Paper Structure (12 sections, 14 equations, 2 figures, 7 tables)

This paper contains 12 sections, 14 equations, 2 figures, 7 tables.

Introduction
Related Work
Fuzzy Loss
Balanced implication loss
Experimental setup
Datasets
Loss function
Violation metrics
Results
Discussion
Conclusion
Result for individual runs

Figures (2)

Figure 1: Performance of the evaluated models regarding implication violations classification performance. In both figures, the standard deviation is indicated by a black line for each bar.
Figure 2: Value of the fuzzy loss variants $L_{prod}$, $L_{luka}$, $L_{Xu}$ and $L^B_{prod}$ with $k=2$ for a subsumption relation $A \sqsubseteq B$ with different values of $h_A(x)$ and $h_B(x)$. $L_{Xu}$ has been cut off at $L_{Xu} = 1$ since $\lim_{p_a\to0, p_b\to1} L_{Xu}(A \sqsubseteq B, p) = \infty$

A fuzzy loss for ontology classification

TL;DR

Abstract

A fuzzy loss for ontology classification

Authors

TL;DR

Abstract

Table of Contents

Figures (2)