QA-Noun: Representing Nominal Semantics via Natural Language Question-Answer Pairs
Maria Tseytlin, Paul Roit, Omri Abend, Ido Dagan, Ayal Klein
TL;DR
QA-Noun extends QA-based semantic representation to nominal semantics by introducing nine interpretable templates that elicit noun-centered predicate–argument relations. The authors release a dataset of over 2,000 noun mentions and a trained model integrated with QA-SRL to produce a unified, fine-grained decomposition of sentence meaning. Empirical results show QA-Noun nearly covers AMR noun arguments and, when combined with QA-SRL, yields substantially higher granularity than prior fact-based decomposition methods, improving cross-text alignment. The work also demonstrates the practicality of open-weight model pipelines (ICL and LoRA FT) for scalable noun-focused semantic annotation, and provides guidelines, tools, and a foundation for future cross-domain semantic decomposition research.
Abstract
Decomposing sentences into fine-grained meaning units is increasingly used to model semantic alignment. While QA-based semantic approaches have shown effectiveness for representing predicate-argument relations, they have so far left noun-centered semantics largely unaddressed. We introduce QA-Noun, a QA-based framework for capturing noun-centered semantic relations. QA-Noun defines nine question templates that cover both explicit syntactical and implicit contextual roles for nouns, producing interpretable QA pairs that complement verbal QA-SRL. We release detailed guidelines, a dataset of over 2,000 annotated noun mentions, and a trained model integrated with QA-SRL to yield a unified decomposition of sentence meaning into individual, highly fine-grained, facts. Evaluation shows that QA-Noun achieves near-complete coverage of AMR's noun arguments while surfacing additional contextually implied relations, and that combining QA-Noun with QA-SRL yields over 130\% higher granularity than recent fact-based decomposition methods such as FactScore and DecompScore. QA-Noun thus complements the broader QA-based semantic framework, forming a comprehensive and scalable approach to fine-grained semantic decomposition for cross-text alignment.
