Interpretable Recognition of Cognitive Distortions in Natural Language Texts

Anton Kolonin; Anna Arinicheva

Interpretable Recognition of Cognitive Distortions in Natural Language Texts

Anton Kolonin, Anna Arinicheva

TL;DR

The paper tackles automated recognition of cognitive distortions in therapeutic texts while prioritizing interpretability and efficiency. It introduces an interpretable multiclass classifier built on heterarchical N-grams and a convolution-based recognition algorithm that respects pattern hierarchy. The approach achieves state-of-the-art F1 scores on two public datasets and releases open-source code and models for community use, demonstrating real-time applicability. The work advances practical AI-assisted psychological care by providing auditable linguistic patterns and a path toward broader linguistic generalization and dataset expansion.

Abstract

We propose a new approach to multi-factor classification of natural language texts based on weighted structured patterns such as N-grams, taking into account the heterarchical relationships between them, applied to solve such a socially impactful problem as the automation of detection of specific cognitive distortions in psychological care, relying on an interpretable, robust and transparent artificial intelligence model. The proposed recognition and learning algorithms improve the current state of the art in this field. The improvement is tested on two publicly available datasets, with significant improvements over literature-known F1 scores for the task, with optimal hyper-parameters determined, having code and models available for future use by the community.

Interpretable Recognition of Cognitive Distortions in Natural Language Texts

TL;DR

Abstract

Interpretable Recognition of Cognitive Distortions in Natural Language Texts

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)