Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks

Yingji Zhang; Danilo S. Carvalho; André Freitas

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks

Yingji Zhang, Danilo S. Carvalho, André Freitas

TL;DR

The paper tackles the challenge of disentangling sentence semantics to enable localized and controllable generation. It introduces a flow-based invertible neural network (INN) added to a frozen transformer-based Autoencoder to map sentence representations into a smooth Gaussian latent space, with two training regimes: unsupervised and cluster-supervised around semantic role-content clusters derived from AST. Geometric data augmentation is used to reinforce separability, enabling precise interpolation and retrieval of role-content while preserving predicate-argument structure. Empirical results show that cluster-supervised INN yields superior disentanglement and more controllable generation than prior language-variational models, with high invertibility and smoother latent traversals. This work bridges distributional and formal semantics in NLP and opens avenues for safer, interpretable, and semantically controllable generation in explanation-centric tasks.

Abstract

Disentangled latent spaces usually have better semantic separability and geometrical properties, which leads to better interpretability and more controllable data generation. While this has been well investigated in Computer Vision, in tasks such as image disentanglement, in the NLP domain sentence disentanglement is still comparatively under-investigated. Most previous work have concentrated on disentangling task-specific generative factors, such as sentiment, within the context of style transfer. In this work, we focus on a more general form of sentence disentanglement, targeting the localised modification and control of more general sentence semantic features. To achieve this, we contribute to a novel notion of sentence semantic disentanglement and introduce a flow-based invertible neural network (INN) mechanism integrated with a transformer-based language Autoencoder (AE) in order to deliver latent spaces with better separability properties. Experimental results demonstrate that the model can conform the distributed latent space into a better semantically disentangled sentence space, leading to improved language interpretability and controlled generation when compared to the recent state-of-the-art language VAE models.

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks

TL;DR

Abstract

Paper Structure (44 sections, 11 equations, 15 figures, 24 tables, 1 algorithm)

This paper contains 44 sections, 11 equations, 15 figures, 24 tables, 1 algorithm.

Introduction
Preliminaries
Controllability and interpretation in formal semantics.
Sentence semantic disentanglement.
Invertible Neural Networks (INNs).
Proposed Approach
Training Strategy
Unsupervised INN.
Cluster-supervised INN.
Geometrical Data Augmentation
Experiments
Disentanglement Encoding Evaluation
Disentanglement between ARG0 clusters.
Disentanglement between PRED clusters.
Disentanglement between ARG0,1,2 clusters.
...and 29 more sections

Figures (15)

Figure 1: Top: attribute space geometry. Bottom: general semantic geometry, where left: distributional semantic space of Optimus li2020optimus, right: our compositionality-induced semantic space where the geometrical location of sentence vectors can be located by the intersection of role-content clusters.
Figure 2: Transforming the representations of explanatory sentences from a language autoencoder (BERT-GPT2), into asemantically separable latent space with the support of the INN mechanism, where a sentence representation can be decomposed into a predicate-argument-level semantics (role-content).
Figure 3: ARG0: t-SNE plot, different colour represents different content regions (blue: animal, green: human, red: plant, purple: something) (left: Optimus, middle: unsupervised, right: cluster supervised), same order for remaining visualizations. We also provide the PCA plot in Figure \ref{['fig:a0_pca']}, both visualization shows that supervised embeddings concentrate on the respective cluster center.
Figure 4: PRED: t-SNE plot (blue: are, green: cause, red: is, purple: require). PCA plot is in Figure \ref{['fig:verb_pca']}.
Figure 5: Animal: t-SNE plot (blue: ARG0-animal, green: ARG1-animal, red: ARG2-animal), PCA plot is provided in Figure \ref{['fig:animal_pca']}.
...and 10 more figures

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks

TL;DR

Abstract

Learning Disentangled Semantic Spaces of Explanations via Invertible Neural Networks

Authors

TL;DR

Abstract

Table of Contents

Figures (15)