Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

Rajan; Ishaan Gupta

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

Rajan, Ishaan Gupta

TL;DR

A systematic benchmarking of four different GNN architectures across a diverse domain of datasets and a hierarchical fusion (GNN+FP) framework for target prediction, which consistently outperforms or matches the performance of standalone GNN (RMSE improvement>$7\%$) and baseline models.

Abstract

Molecules are commonly represented as SMILES strings, which can be readily converted to fixed-size molecular fingerprints. These fingerprints serve as feature vectors to train ML/DL models for molecular property prediction tasks in the field of computational chemistry, drug discovery, biochemistry, and materials science. Recent research has demonstrated that SMILES can be used to construct molecular graphs where atoms are nodes ($V$) and bonds are edges ($E$). These graphs can subsequently be used to train geometric DL models like GNN. GNN learns the inherent structural relationships within a molecule rather than depending on fixed-size fingerprints. Although GNN are powerful aggregators, their efficacy on smaller datasets and inductive biases across different architectures is less studied. In our present study, we performed a systematic benchmarking of four different GNN architectures across a diverse domain of datasets (physical chemistry, biological, and analytical). Additionally, we have also implemented a hierarchical fusion (GNN+FP) framework for target prediction. We observed that the fusion framework consistently outperforms or matches the performance of standalone GNN (RMSE improvement > $7\%$) and baseline models. Further, we investigated the representational similarity using centered kernel alignment (CKA) between GNN and fingerprint embeddings and found that they occupy highly independent latent spaces (CKA $\le0.46$). The cross-architectural CKA score suggests a high convergence between isotopic models like GCN, GraphSAGE and GIN (CKA $\geq0.88$), with GAT learning moderately independent representation (CKA $0.55-0.80$).

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

TL;DR

) and baseline models.

Abstract

) and bonds are edges (

). These graphs can subsequently be used to train geometric DL models like GNN. GNN learns the inherent structural relationships within a molecule rather than depending on fixed-size fingerprints. Although GNN are powerful aggregators, their efficacy on smaller datasets and inductive biases across different architectures is less studied. In our present study, we performed a systematic benchmarking of four different GNN architectures across a diverse domain of datasets (physical chemistry, biological, and analytical). Additionally, we have also implemented a hierarchical fusion (GNN+FP) framework for target prediction. We observed that the fusion framework consistently outperforms or matches the performance of standalone GNN (RMSE improvement >

) and baseline models. Further, we investigated the representational similarity using centered kernel alignment (CKA) between GNN and fingerprint embeddings and found that they occupy highly independent latent spaces (CKA

). The cross-architectural CKA score suggests a high convergence between isotopic models like GCN, GraphSAGE and GIN (CKA

), with GAT learning moderately independent representation (CKA

Paper Structure (10 sections, 3 equations, 5 figures, 2 tables)

This paper contains 10 sections, 3 equations, 5 figures, 2 tables.

Introduction
Methodology
Datasets and Preprocessing
Baseline Models
GNN Models
CKA with RBF Kernel
Experiments
Evaluation Metrics
Results and Discussion
Conclusion

Figures (5)

Figure 1: Violin plots showing the distributions and sample counts of each dataset.
Figure 2: GNN and Hybrid (GNN+FP) Model Architecture.
Figure 3: Barplot showing performance comparison (RMSE) across different models and datasets
Figure 4: Heatmap showing representational similarity between GNN and FP embeddings.
Figure 5: Heatmap showing cross-architecture GNN representational similarity.

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

TL;DR

Abstract

Benchmarking GNN Models on Molecular Regression Tasks with CKA-Based Representation Analysis

Authors

TL;DR

Abstract

Table of Contents

Figures (5)