Table of Contents
Fetching ...

OpenAutoNLU: Open Source AutoML Library for NLU

Grigory Arshinov, Aleksandr Boriskin, Sergey Senichev, Ayaz Zaripov, Daria Galimzianova, Daniil Karpov, Leonid Sanochkin

TL;DR

OpenAutoNLU is an open-source automated machine learning library for natural language understanding (NLU) tasks, covering both text classification and named entity recognition, that introduces data-aware training regime selection that requires no manual configuration from the user.

Abstract

OpenAutoNLU is an open-source automated machine learning library for natural language understanding (NLU) tasks, covering both text classification and named entity recognition (NER). Unlike existing solutions, we introduce data-aware training regime selection that requires no manual configuration from the user. The library also provides integrated data quality diagnostics, configurable out-of-distribution (OOD) detection, and large language model (LLM) features, all within a minimal lowcode API. The demo app is accessible here https://openautonlu.dev.

OpenAutoNLU: Open Source AutoML Library for NLU

TL;DR

OpenAutoNLU is an open-source automated machine learning library for natural language understanding (NLU) tasks, covering both text classification and named entity recognition, that introduces data-aware training regime selection that requires no manual configuration from the user.

Abstract

OpenAutoNLU is an open-source automated machine learning library for natural language understanding (NLU) tasks, covering both text classification and named entity recognition (NER). Unlike existing solutions, we introduce data-aware training regime selection that requires no manual configuration from the user. The library also provides integrated data quality diagnostics, configurable out-of-distribution (OOD) detection, and large language model (LLM) features, all within a minimal lowcode API. The demo app is accessible here https://openautonlu.dev.
Paper Structure (25 sections, 2 figures, 6 tables)

This paper contains 25 sections, 2 figures, 6 tables.

Figures (2)

  • Figure 1: Text Classification Training Pipeline and Inference Pipeline flow
  • Figure 2: This graph illustrates the ratio between performance in macro F1 on classification tasks and time in seconds took to train the solution. Measures are averaged between four different text classification datasets.