Table of Contents
Fetching ...

Annotation Guidelines for Corpus Novelties: Part 1 -- Named Entity Recognition

Arthur Amalvy, Vincent Labatut

TL;DR

The guidelines applied during the annotation of the Novelties corpus are described, as well as a number of examples retrieved from the annotated novels, and illustrating expressions that should be marked as entities as well as expressions that should not.

Abstract

The Novelties corpus is a collection of novels (and parts of novels) annotated for Named Entity Recognition (NER) among other tasks. This document describes the guidelines applied during its annotation. It contains the instructions used by the annotators, as well as a number of examples retrieved from the annotated novels, and illustrating expressions that should be marked as entities as well as expressions that should not.

Annotation Guidelines for Corpus Novelties: Part 1 -- Named Entity Recognition

TL;DR

The guidelines applied during the annotation of the Novelties corpus are described, as well as a number of examples retrieved from the annotated novels, and illustrating expressions that should be marked as entities as well as expressions that should not.

Abstract

The Novelties corpus is a collection of novels (and parts of novels) annotated for Named Entity Recognition (NER) among other tasks. This document describes the guidelines applied during its annotation. It contains the instructions used by the annotators, as well as a number of examples retrieved from the annotated novels, and illustrating expressions that should be marked as entities as well as expressions that should not.
Paper Structure (155 sections)