Table of Contents
Fetching ...

Mathematical Derivation Graphs: A Relation Extraction Task in STEM Manuscripts

Vishesh Prasad, Brian Kim, Nickvash Kani

TL;DR

This work defines a new relation extraction task focused on inter-equation dependencies in STEM manuscripts by introducing the Mathematical Derivation Graphs Dataset (MDGD) derived from 107 arXiv papers. It formalizes derivation graphs as directed acyclic graphs where nodes are key equations and edges encode derivational dependencies, and evaluates a spectrum of analytical, ML, and LLM-based methods to extract these edges. The study finds that baseline methods and zero-shot LLMs achieve $F_1$ around $0.45$–$0.52$, with targeted fixes offering incremental improvements but no decisive gains, highlighting the challenge of mathematical relation extraction. The results motivate hybrid analytic-LLM pipelines and task-specific modeling as promising directions for improving machine understanding and reconstruction of mathematical derivations in scholarly texts.

Abstract

Recent advances in natural language processing (NLP), particularly with the emergence of large language models (LLMs), have significantly enhanced the field of textual analysis. However, while these developments have yielded substantial progress in analyzing natural language text, applying analysis to mathematical equations and their relationships within texts has produced mixed results. This paper takes the initial steps in expanding the problem of relation extraction towards understanding the dependency relationships between mathematical expressions in STEM articles. The authors construct the Mathematical Derivation Graphs Dataset (MDGD), sourced from a random sampling of the arXiv corpus, containing an analysis of $107$ published STEM manuscripts with over $2000$ manually labeled inter-equation dependency relationships, resulting in a new object referred to as a derivation graph that summarizes the mathematical content of the manuscript. The authors exhaustively evaluate analytical and machine learning (ML) based models to assess their capability to identify and extract the derivation relationships for each article and compare the results with the ground truth. The authors show that the best tested LLMs achieve $F_1$ scores of $\sim45\%-52\%$, and attempt to improve their performance by combining them with analytic algorithms and other methods.

Mathematical Derivation Graphs: A Relation Extraction Task in STEM Manuscripts

TL;DR

This work defines a new relation extraction task focused on inter-equation dependencies in STEM manuscripts by introducing the Mathematical Derivation Graphs Dataset (MDGD) derived from 107 arXiv papers. It formalizes derivation graphs as directed acyclic graphs where nodes are key equations and edges encode derivational dependencies, and evaluates a spectrum of analytical, ML, and LLM-based methods to extract these edges. The study finds that baseline methods and zero-shot LLMs achieve around , with targeted fixes offering incremental improvements but no decisive gains, highlighting the challenge of mathematical relation extraction. The results motivate hybrid analytic-LLM pipelines and task-specific modeling as promising directions for improving machine understanding and reconstruction of mathematical derivations in scholarly texts.

Abstract

Recent advances in natural language processing (NLP), particularly with the emergence of large language models (LLMs), have significantly enhanced the field of textual analysis. However, while these developments have yielded substantial progress in analyzing natural language text, applying analysis to mathematical equations and their relationships within texts has produced mixed results. This paper takes the initial steps in expanding the problem of relation extraction towards understanding the dependency relationships between mathematical expressions in STEM articles. The authors construct the Mathematical Derivation Graphs Dataset (MDGD), sourced from a random sampling of the arXiv corpus, containing an analysis of published STEM manuscripts with over manually labeled inter-equation dependency relationships, resulting in a new object referred to as a derivation graph that summarizes the mathematical content of the manuscript. The authors exhaustively evaluate analytical and machine learning (ML) based models to assess their capability to identify and extract the derivation relationships for each article and compare the results with the ground truth. The authors show that the best tested LLMs achieve scores of , and attempt to improve their performance by combining them with analytic algorithms and other methods.

Paper Structure

This paper contains 27 sections, 5 equations, 8 figures, 4 tables.

Figures (8)

  • Figure 1: Ground truth derivation graph from article 0907.2648 bretzel2009barnett. Each node represents a key equation in the article, and edges represent derivation relationships.
  • Figure 2: Primary category count breakdown for all articles in the MDGD math_derivation_graphs based on the arXiv category taxonomy (n=107).
  • Figure 3: A brute force derivative edge from Eq. (4) to Eq. (8) in article 1701.00003 zhu2017local. Eq. (8) is implied as being key since it is explicitly labeled with a reference number. The definite edge shown in the text before the equation represents a derivation relationship from Eq. (4), pointing to Eq. (8).
  • Figure 4: Example of derivative relation found via token similarity between (3)and(6) in article 0907.3505, and confirmed by brute force edgefernández2010smallamplitudeapproximationdifferentialequation.
  • Figure 5: Best performing prompt template for zero-shot extracting equation derivations from an article.
  • ...and 3 more figures