NLP for The Greek Language: A Longer Survey
Katerina Papantoniou, Yannis Tzitzikas
TL;DR
This survey comprehensively maps three decades of Greek NLP research across Ancient, Modern, and dialectal varieties by organizing works along processing layers such as OCR, morphology, POS/parsing, embeddings, semantics, pragmatics, and MT. It highlights a notable rise in neural approaches, abundant sentiment analysis work, and growing dialogue-system interest, while noting relatively fewer efforts in QA, summarization, and coreference. The authors compile extensive resources, corpora, datasets, tools, and web interfaces, underscoring the increasing availability of Greek NLP resources (with Ancient Greek research often led by institutions outside Greece). Overall, the paper provides a practical framework and reference catalog for researchers and educators to advance Greek language processing and related knowledge-management tasks.
Abstract
English language is in the spotlight of the Natural Language Processing (NLP) community with other languages, like Greek, lagging behind in terms of offered methods, tools and resources. Due to the increasing interest in NLP, in this paper we try to condense research efforts for the automatic processing of Greek language covering the last three decades. In particular, we list and briefly discuss related works, resources and tools, categorized according to various processing layers and contexts. We are not restricted to the modern form of Greek language but also cover Ancient Greek and various Greek dialects. This survey can be useful for researchers and students interested in NLP tasks, Information Retrieval and Knowledge Management for the Greek language.
