Table of Contents
Fetching ...

OPBench: A Graph Benchmark to Combat the Opioid Crisis

Tianyi Ma, Yiyang Li, Yiyue Qian, Zheyuan Zhang, Zehong Wang, Chuxu Zhang, Yanfang Ye

TL;DR

The paper addresses the lack of a standardized benchmark for graph learning in the opioid crisis. It proposes OPBench, a first-of-its-kind benchmark with five datasets spanning healthcare, social networks, and dietary data, incorporating heterogeneous graphs and hypergraphs. The authors provide a unified evaluation framework with predefined splits, reproducible baselines, and comprehensive experiments that show higher-order and heterogeneous models outperform homogeneous projections, and data-imbalance handling is crucial. The work includes open-source code and datasets to foster reproducible, fair comparisons and accelerate graph-based interventions in public health.

Abstract

The opioid epidemic continues to ravage communities worldwide, straining healthcare systems, disrupting families, and demanding urgent computational solutions. To combat this lethal opioid crisis, graph learning methods have emerged as a promising paradigm for modeling complex drug-related phenomena. However, a significant gap remains: there is no comprehensive benchmark for systematically evaluating these methods across real-world opioid crisis scenarios. To bridge this gap, we introduce OPBench, the first comprehensive opioid benchmark comprising five datasets across three critical application domains: opioid overdose detection from healthcare claims, illicit drug trafficking detection from digital platforms, and drug misuse prediction from dietary patterns. Specifically, OPBench incorporates diverse graph structures, including heterogeneous graphs and hypergraphs, to preserve the rich and complex relational information among drug-related data. To address data scarcity, we collaborate with domain experts and authoritative institutions to curate and annotate datasets while adhering to privacy and ethical guidelines. Furthermore, we establish a unified evaluation framework with standardized protocols, predefined data splits, and reproducible baselines to facilitate fair and systematic comparison among graph learning methods. Through extensive experiments, we analyze the strengths and limitations of existing graph learning methods, thereby providing actionable insights for future research in combating the opioid crisis. Our source code and datasets are available at https://github.com/Tianyi-Billy-Ma/OPBench.

OPBench: A Graph Benchmark to Combat the Opioid Crisis

TL;DR

The paper addresses the lack of a standardized benchmark for graph learning in the opioid crisis. It proposes OPBench, a first-of-its-kind benchmark with five datasets spanning healthcare, social networks, and dietary data, incorporating heterogeneous graphs and hypergraphs. The authors provide a unified evaluation framework with predefined splits, reproducible baselines, and comprehensive experiments that show higher-order and heterogeneous models outperform homogeneous projections, and data-imbalance handling is crucial. The work includes open-source code and datasets to foster reproducible, fair comparisons and accelerate graph-based interventions in public health.

Abstract

The opioid epidemic continues to ravage communities worldwide, straining healthcare systems, disrupting families, and demanding urgent computational solutions. To combat this lethal opioid crisis, graph learning methods have emerged as a promising paradigm for modeling complex drug-related phenomena. However, a significant gap remains: there is no comprehensive benchmark for systematically evaluating these methods across real-world opioid crisis scenarios. To bridge this gap, we introduce OPBench, the first comprehensive opioid benchmark comprising five datasets across three critical application domains: opioid overdose detection from healthcare claims, illicit drug trafficking detection from digital platforms, and drug misuse prediction from dietary patterns. Specifically, OPBench incorporates diverse graph structures, including heterogeneous graphs and hypergraphs, to preserve the rich and complex relational information among drug-related data. To address data scarcity, we collaborate with domain experts and authoritative institutions to curate and annotate datasets while adhering to privacy and ethical guidelines. Furthermore, we establish a unified evaluation framework with standardized protocols, predefined data splits, and reproducible baselines to facilitate fair and systematic comparison among graph learning methods. Through extensive experiments, we analyze the strengths and limitations of existing graph learning methods, thereby providing actionable insights for future research in combating the opioid crisis. Our source code and datasets are available at https://github.com/Tianyi-Billy-Ma/OPBench.
Paper Structure (50 sections, 2 figures, 10 tables)

This paper contains 50 sections, 2 figures, 10 tables.

Figures (2)

  • Figure 1: Dataset construction process for applications in Opbench. (a) Opioid Overdose Detection: We leverage the PDMP dataset from the State of Ohio to construct a heterogeneous graph with four node types to detect patients at risk of opioid overdose. (b) Illicit Online Drug Trafficking: We collect metadata from X/Twitter and construct a hypergraph and a multi-relation graph to capture group-wise interactions and diverse relation types among users for community detection and role classification tasks. (c) Opioid Misuse Detection: We employ the NHANES data to construct a heterogeneous graph over five node types to predict opioid misuse through dietary patterns.
  • Figure 2: Inference time (ms) across hidden dimensions (64, 128, 256, 512). Left and middle panels show hypergraph neural networks on X-HyDrug-Comm and X-HyDrug-Role; right panel shows heterogeneous graph neural networks on NHANES-Diet.

Theorems & Definitions (3)

  • definition 1
  • definition 2
  • definition 3