Table of Contents
Fetching ...

Joint Extraction of Uyghur Medicine Knowledge with Edge Computing

Fan Lu, Quan Qi, Huaibin Qin

TL;DR

The paper addresses privacy and latency challenges in extracting Uyghur traditional medical knowledge by proposing CoEx-Bert, a joint, parameter-sharing extraction model deployed on edge devices. By coupling two sub-models for entity and relation extraction with shared hidden representations and a unified loss, CoEx-Bert reduces error propagation and leverages contextual interdependence to improve accuracy. Empirical results on a Uyghur medical dataset show superior precision, recall, and F1 (0.9065, 0.9245, 0.9154 respectively) over strong baselines, with favorable edge-device performance (65 ms latency, 100 MB bandwidth). The approach also demonstrates generalization to related languages (e.g., Mongolian texts) and highlights practical benefits for real-time, privacy-preserving medical knowledge extraction, with future work aimed at enhancing named entity recognition via integration with high-performing NER models.

Abstract

Medical knowledge extraction methods based on edge computing deploy deep learning models on edge devices to achieve localized entity and relation extraction. This approach avoids transferring substantial sensitive data to cloud data centers, effectively safeguarding the privacy of healthcare services. However, existing relation extraction methods mainly employ a sequential pipeline approach, which classifies relations between determined entities after entity recognition. This mode faces challenges such as error propagation between tasks, insufficient consideration of dependencies between the two subtasks, and the neglect of interrelations between different relations within a sentence. To address these challenges, a joint extraction model with parameter sharing in edge computing is proposed, named CoEx-Bert. This model leverages shared parameterization between two models to jointly extract entities and relations. Specifically, CoEx-Bert employs two models, each separately sharing hidden layer parameters, and combines these two loss functions for joint backpropagation to optimize the model parameters. Additionally, it effectively resolves the issue of entity overlapping when extracting knowledge from unstructured Uyghur medical texts by considering contextual relations. Finally, this model is deployed on edge devices for real-time extraction and inference of Uyghur medical knowledge. Experimental results demonstrate that CoEx-Bert outperforms existing state-of-the-art methods, achieving accuracy, recall, and F1 scores of 90.65\%, 92.45\%, and 91.54\%, respectively, in the Uyghur traditional medical literature dataset. These improvements represent a 6.45\% increase in accuracy, a 9.45\% increase in recall, and a 7.95\% increase in F1 score compared to the baseline.

Joint Extraction of Uyghur Medicine Knowledge with Edge Computing

TL;DR

The paper addresses privacy and latency challenges in extracting Uyghur traditional medical knowledge by proposing CoEx-Bert, a joint, parameter-sharing extraction model deployed on edge devices. By coupling two sub-models for entity and relation extraction with shared hidden representations and a unified loss, CoEx-Bert reduces error propagation and leverages contextual interdependence to improve accuracy. Empirical results on a Uyghur medical dataset show superior precision, recall, and F1 (0.9065, 0.9245, 0.9154 respectively) over strong baselines, with favorable edge-device performance (65 ms latency, 100 MB bandwidth). The approach also demonstrates generalization to related languages (e.g., Mongolian texts) and highlights practical benefits for real-time, privacy-preserving medical knowledge extraction, with future work aimed at enhancing named entity recognition via integration with high-performing NER models.

Abstract

Medical knowledge extraction methods based on edge computing deploy deep learning models on edge devices to achieve localized entity and relation extraction. This approach avoids transferring substantial sensitive data to cloud data centers, effectively safeguarding the privacy of healthcare services. However, existing relation extraction methods mainly employ a sequential pipeline approach, which classifies relations between determined entities after entity recognition. This mode faces challenges such as error propagation between tasks, insufficient consideration of dependencies between the two subtasks, and the neglect of interrelations between different relations within a sentence. To address these challenges, a joint extraction model with parameter sharing in edge computing is proposed, named CoEx-Bert. This model leverages shared parameterization between two models to jointly extract entities and relations. Specifically, CoEx-Bert employs two models, each separately sharing hidden layer parameters, and combines these two loss functions for joint backpropagation to optimize the model parameters. Additionally, it effectively resolves the issue of entity overlapping when extracting knowledge from unstructured Uyghur medical texts by considering contextual relations. Finally, this model is deployed on edge devices for real-time extraction and inference of Uyghur medical knowledge. Experimental results demonstrate that CoEx-Bert outperforms existing state-of-the-art methods, achieving accuracy, recall, and F1 scores of 90.65\%, 92.45\%, and 91.54\%, respectively, in the Uyghur traditional medical literature dataset. These improvements represent a 6.45\% increase in accuracy, a 9.45\% increase in recall, and a 7.95\% increase in F1 score compared to the baseline.
Paper Structure (19 sections, 8 equations, 6 figures, 5 tables)

This paper contains 19 sections, 8 equations, 6 figures, 5 tables.

Figures (6)

  • Figure 1: BERT's Multi-Layer Encoder Structure.
  • Figure 2: Mainstream Methods of Entity Relation Extraction.
  • Figure 3: Overall structure diagram of CoEx-BERT.
  • Figure 4: Extracting Textual Information with CoEx-Bert.
  • Figure 5: Input and Three Embeddings in CoEx-Bert Model.
  • ...and 1 more figures