Table of Contents
Fetching ...

Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech

Menglin Li, Kwan Hui Lim

TL;DR

The paper tackles financial relation extraction (FinRE) by augmenting a RoBERTa-based model with linguistic signals from Named Entity Recognition (NER) and Part-of-Speech (POS) tagging. The authors propose replacing textual content with NER tokens and concatenating POS tokens (TrNP) to enrich input representations, and conduct extensive ablations on the REFinD dataset. Their results show state-of-the-art Micro-F1 and Macro-F1 scores, with Micro-F1 of 0.7721 and Macro-F1 of 0.5507, and demonstrate that NER replacement yields larger gains than POS. This approach offers a simple yet effective way to boost FinRE performance and suggests broad applicability to domain-specific relation extraction tasks.

Abstract

The Financial Relation Extraction (FinRE) task involves identifying the entities and their relation, given a piece of financial statement/text. To solve this FinRE problem, we propose a simple but effective strategy that improves the performance of pre-trained language models by augmenting them with Named Entity Recognition (NER) and Part-Of-Speech (POS), as well as different approaches to combine these information. Experiments on a financial relations dataset show promising results and highlights the benefits of incorporating NER and POS in existing models. Our dataset and codes are available at https://github.com/kwanhui/FinRelExtract.

Enhancing Language Models for Financial Relation Extraction with Named Entities and Part-of-Speech

TL;DR

The paper tackles financial relation extraction (FinRE) by augmenting a RoBERTa-based model with linguistic signals from Named Entity Recognition (NER) and Part-of-Speech (POS) tagging. The authors propose replacing textual content with NER tokens and concatenating POS tokens (TrNP) to enrich input representations, and conduct extensive ablations on the REFinD dataset. Their results show state-of-the-art Micro-F1 and Macro-F1 scores, with Micro-F1 of 0.7721 and Macro-F1 of 0.5507, and demonstrate that NER replacement yields larger gains than POS. This approach offers a simple yet effective way to boost FinRE performance and suggests broad applicability to domain-specific relation extraction tasks.

Abstract

The Financial Relation Extraction (FinRE) task involves identifying the entities and their relation, given a piece of financial statement/text. To solve this FinRE problem, we propose a simple but effective strategy that improves the performance of pre-trained language models by augmenting them with Named Entity Recognition (NER) and Part-Of-Speech (POS), as well as different approaches to combine these information. Experiments on a financial relations dataset show promising results and highlights the benefits of incorporating NER and POS in existing models. Our dataset and codes are available at https://github.com/kwanhui/FinRelExtract.
Paper Structure (9 sections, 2 figures, 2 tables)

This paper contains 9 sections, 2 figures, 2 tables.

Figures (2)

  • Figure 1: An example of the Financial Relation Extraction (FinRE) problem, with the relation org:org:acquired_by extracted from this financial statement.
  • Figure 2: Architecture of Our Proposed Model for the FinRE Task.