RexUniNLU: Recursive Method with Explicit Schema Instructor for Universal NLU
Chengyuan Liu, Shihang Wang, Fubang Zhao, Kun Kuang, Yangyang Kang, Weiming Lu, Changlong Sun, Fei Wu
TL;DR
RexUniNLU presents a unified encoder-based approach to universal NLU by redefining UIE and introducing an Explicit Schema Instructor to constrain extractions. It recursively queries all schema types with three token-linking operations, enabling extraction of complex schemas such as quadruples and quintuples across IE, CLS, and cross-modality tasks. The framework demonstrates strong performance in both high- and low-resource settings, and its cross-language and multi-modal variants show substantial robustness and competitive results. The work highlights the value of explicit schema constraints for accurate, scalable NLU and provides open-source resources to support broader adoption.
Abstract
Information Extraction (IE) and Text Classification (CLS) serve as the fundamental pillars of NLU, with both disciplines relying on analyzing input sequences to categorize outputs into pre-established schemas. However, there is no existing encoder-based model that can unify IE and CLS tasks from this perspective. To fully explore the foundation shared within NLU tasks, we have proposed a Recursive Method with Explicit Schema Instructor for Universal NLU. Specifically, we firstly redefine the true universal information extraction (UIE) with a formal formulation that covers almost all extraction schemas, including quadruples and quintuples which remain unsolved for previous UIE models. Then, we expands the formulation to all CLS and multi-modal NLU tasks. Based on that, we introduce RexUniNLU, an universal NLU solution that employs explicit schema constraints for IE and CLS, which encompasses all IE and CLS tasks and prevent incorrect connections between schema and input sequence. To avoid interference between different schemas, we reset the position ids and attention mask matrices. Extensive experiments are conducted on IE, CLS in both English and Chinese, and multi-modality, revealing the effectiveness and superiority. Our codes are publicly released.
