PE: A Poincare Explanation Method for Fast Text Hierarchy Generation
Qian Chen, Dongyang Li, Xiaofeng He, Hongzhao Li, Hongyu Yi
TL;DR
PE addresses the opacity of deep NLP models by modeling feature interactions through hyperbolic geometry on the Poincaré ball with curvature $c=1$. It projects word embeddings into two hyperbolic spaces to capture semantic and syntactic structure, assigns feature contributions via a direct Shapley-style score, and decodes a text hierarchy as a minimum spanning tree in the projected space. The approach yields fast, non-contiguous interaction-aware HA with strong empirical support across three datasets, outperforming baselines in AOPC and construction time. This creates a scalable, linguistically informed explanation framework for large NLP models.
Abstract
The black-box nature of deep learning models in NLP hinders their widespread application. The research focus has shifted to Hierarchical Attribution (HA) for its ability to model feature interactions. Recent works model non-contiguous combinations with a time-costly greedy search in Eculidean spaces, neglecting underlying linguistic information in feature representations. In this work, we introduce a novel method, namely Poincare Explanation (PE), for modeling feature interactions with hyperbolic spaces in a time efficient manner. Specifically, we take building text hierarchies as finding spanning trees in hyperbolic spaces. First we project the embeddings into hyperbolic spaces to elicit inherit semantic and syntax hierarchical structures. Then we propose a simple yet effective strategy to calculate Shapley score. Finally we build the the hierarchy with proving the constructing process in the projected space could be viewed as building a minimum spanning tree and introduce a time efficient building algorithm. Experimental results demonstrate the effectiveness of our approach.
