Table of Contents
Fetching ...

Disjointness Violations in Wikidata

Ege Atacan Doğan, Peter F. Patel-Schneider

TL;DR

The current modeling of disjointness on Wikidata is analyzed, patterns that cause disjointness violations are identified, and how disjointness information could be better modeled and expanded in Wikidata in the future is discussed.

Abstract

Disjointness checks are among the most important constraint checks in a knowledge base and can be used to help detect and correct incorrect statements and internal contradictions. Wikidata is a very large, community-managed knowledge base. Because of both its size and construction, Wikidata contains many incorrect statements and internal contradictions. We analyze the current modeling of disjointness on Wikidata, identify patterns that cause these disjointness violations and categorize them. We use SPARQL queries to identify each ``culprit'' causing a disjointness violation and lay out formulas to identify and fix conflicting information. We finally discuss how disjointness information could be better modeled and expanded in Wikidata in the future.

Disjointness Violations in Wikidata

TL;DR

The current modeling of disjointness on Wikidata is analyzed, patterns that cause disjointness violations are identified, and how disjointness information could be better modeled and expanded in Wikidata in the future is discussed.

Abstract

Disjointness checks are among the most important constraint checks in a knowledge base and can be used to help detect and correct incorrect statements and internal contradictions. Wikidata is a very large, community-managed knowledge base. Because of both its size and construction, Wikidata contains many incorrect statements and internal contradictions. We analyze the current modeling of disjointness on Wikidata, identify patterns that cause these disjointness violations and categorize them. We use SPARQL queries to identify each ``culprit'' causing a disjointness violation and lay out formulas to identify and fix conflicting information. We finally discuss how disjointness information could be better modeled and expanded in Wikidata in the future.

Paper Structure

This paper contains 28 sections, 3 equations, 3 figures.

Figures (3)

  • Figure 1: Class Violations per Disjointness Statement (log scale)
  • Figure 2: Instance Violations per Disjointness Statement (log scale)
  • Figure 3: Culprits per Disjointness Statement (log scale)