Efficient Environmental Claim Detection with Hyperbolic Graph Neural Networks
Darpan Aswal, Manjira Sinha
TL;DR
This paper tackles environmental claim detection by moving away from heavyweight transformers to a graph-based framework that treats sentences as dependency graphs. It uses word2vec-based node features augmented with learnable POS embeddings, and encodes syntactic relations as edge features, with a weighted loss to handle data imbalance; classification is performed by GNNs and Hyperbolic GNNs in Euclidean and non-Euclidean spaces. The key finding is that hyperbolic graphs, especially in the Poincaré space with POS augmentation, achieve competitive or superior performance while using roughly 12M parameters (about 30x fewer than large transformers like RoBERTalarge), underscoring the value of explicit hierarchical modeling for NLP tasks. The results demonstrate the practical impact of hyperbolic geometry in NLP, offering a scalable, interpretable, and efficient alternative for specialized detection tasks and motivating broader exploration of non-Euclidean representations in language processing.
Abstract
Transformer based models, especially large language models (LLMs) dominate the field of NLP with their mass adoption in tasks such as text generation, summarization and fake news detection. These models offer ease of deployment and reliability for most applications, however, they require significant amounts of computational power for training as well as inference. This poses challenges in their adoption in resource-constrained applications, especially in the open-source community where compute availability is usually scarce. This work proposes a graph-based approach for Environmental Claim Detection, exploring Graph Neural Networks (GNNs) and Hyperbolic Graph Neural Networks (HGNNs) as lightweight yet effective alternatives to transformer-based models. Re-framing the task as a graph classification problem, we transform claim sentences into dependency parsing graphs, utilizing a combination of word2vec \& learnable part-of-speech (POS) tag embeddings for the node features and encoding syntactic dependencies in the edge relations. Our results show that our graph-based models, particularly HGNNs in the poincaré space (P-HGNNs), achieve performance superior to the state-of-the-art on environmental claim detection while using up to \textbf{30x fewer parameters}. We also demonstrate that HGNNs benefit vastly from explicitly modeling data in hierarchical (tree-like) structures, enabling them to significantly improve over their euclidean counterparts.
