MorphNLI: A Stepwise Approach to Natural Language Inference Using Text Morphing
Vlad Andrei Negru, Robert Vacareanu, Camelia Lemnaru, Mihai Surdeanu, Rodica Potolea
TL;DR
We address semantic reasoning and dataset artifact challenges in natural language inference by proposing MorphNLI, a modular approach that morphs the premise toward the hypothesis via a sequence of atomic edits $M=(M_0,\dots,M_k)$, applies a standard NLI classifier to each adjacent pair, and aggregates the per-step labels to a final decision. The morphism generator is trained with a teacher–student setup using synthetic data generated with in-context learning and a filtering stage to prune low-quality morphs, enabling efficient fine-tuning of a smaller morphism model for inference. Cross-domain evaluation on MNLI and SICK shows MorphNLI outperforms vanilla NLI baselines in OOD settings and yields more faithful, interpretable explanations through the morphing chain, though results vary with dataset and potential contamination in LLM explanations. Overall, MorphNLI provides robust cross-domain NLI performance with transparent, stepwise reasoning, offering a practical route toward trustworthy and explainable inference in real-world applications.
Abstract
We introduce MorphNLI, a modular step-by-step approach to natural language inference (NLI). When classifying the premise-hypothesis pairs into {entailment, contradiction, neutral}, we use a language model to generate the necessary edits to incrementally transform (i.e., morph) the premise into the hypothesis. Then, using an off-the-shelf NLI model we track how the entailment progresses with these atomic changes, aggregating these intermediate labels into a final output. We demonstrate the advantages of our proposed method particularly in realistic cross-domain settings, where our method always outperforms strong baselines with improvements up to 12.6% (relative). Further, our proposed approach is explainable as the atomic edits can be used to understand the overall NLI label.
