Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI

Tanjim Taharat Aurpa

Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI

Tanjim Taharat Aurpa

TL;DR

This study formulates mathematical problem interpretation as a Mathematical Entity Relation Extraction task, where operands are treated as entities and operators as their relationships, and reveals how specific textual and mathematical features influence relation prediction.

Abstract

Mathematical text understanding is a challenging task due to the presence of specialized entities and complex relationships between them. This study formulates mathematical problem interpretation as a Mathematical Entity Relation Extraction (MERE) task, where operands are treated as entities and operators as their relationships. Transformer-based models are applied to automatically extract these relations from mathematical text, with Bidirectional Encoder Representations from Transformers (BERT) achieving the best performance, reaching an accuracy of 99.39%. To enhance transparency and trust in the model's predictions, Explainable Artificial Intelligence (XAI) is incorporated using Shapley Additive Explanations (SHAP). The explainability analysis reveals how specific textual and mathematical features influence relation prediction, providing insights into feature importance and model behavior. By combining transformer-based learning, a task-specific dataset, and explainable modeling, this work offers an effective and interpretable framework for MERE, supporting future applications in automated problem solving, knowledge graph construction, and intelligent educational systems.

Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI

TL;DR

Abstract

Paper Structure (21 sections, 7 equations, 12 figures, 3 tables)

This paper contains 21 sections, 7 equations, 12 figures, 3 tables.

Introduction
Related Work
Proposed Methodology
Data Collection and Preprocessing
Model Building, Training and Evaluation
Proposed BERT Model
Hyperparameter Tuning
Model Evaluation
Model Explainability
Proposed Approach
Code Availability
Results
Discussion
Limitation
Conclusion and Future Work
...and 6 more sections

Figures (12)

Figure 1: Step-by-step pipeline for constructing the Mathematical Entity–Entity Relation Dataset, illustrating text collection from Bangla_MER and Somikoron datasets, duplicate handling, entity extraction, equation analysis, and manual relation labeling.
Figure 2: Donut chart showing the percentage distribution of mathematical relation classes in the constructed dataset, highlighting the dominance of subtle Square Root operations and the balanced representation of Addition, Subtraction, Multiplication, and Division.
Figure 3: Illustration of the data preprocessing workflow, demonstrating raw text normalization through character cleaning, stop-word removal, lemmatization, and stemming to generate model-ready mathematical text.
Figure 4: Proposed BERT Model
Figure 5: The working process of SHAP
...and 7 more figures

Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI

TL;DR

Abstract

Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI

Authors

TL;DR

Abstract

Table of Contents

Figures (12)