SLFNet: Generating Semantic Logic Forms from Natural Language Using Semantic Probability Graphs
Hao Wu, Fan Xu
TL;DR
The paper tackles the NL-to-SLF parsing problem, which is hampered by order-sensitivity in traditional seq2seq models. It introduces SLFNet, a model that fuses dependent syntactic information via a Dependency-Fused BiLSTM, and uses Semantic Probability Graphs to encode local dependencies among predictor variables, enabling Seq-to-Slots predictions with Multi-Head SLF Attention. The task is decomposed into predicting the number of SLF groups, filling slot values, and performing a final logical inference to assemble the SLFs. Empirical results show state-of-the-art performance on ChineseQCI-TS and Okapi, with competitive results on ATIS/WikiSQL, highlighting improved NL-to-SLF generation and potential for robust natural language interfaces in data-driven systems.
Abstract
Building natural language interfaces typically uses a semantic parser to parse the user's natural language and convert it into structured \textbf{S}emantic \textbf{L}ogic \textbf{F}orms (SLFs). The mainstream approach is to adopt a sequence-to-sequence framework, which requires that natural language commands and SLFs must be represented serially. Since a single natural language may have multiple SLFs or multiple natural language commands may have the same SLF, training a sequence-to-sequence model is sensitive to the choice among them, a phenomenon recorded as "order matters". To solve this problem, we propose a novel neural network, SLFNet, which firstly incorporates dependent syntactic information as prior knowledge and can capture the long-range interactions between contextual information and words. Secondly construct semantic probability graphs to obtain local dependencies between predictor variables. Finally we propose the Multi-Head SLF Attention mechanism to synthesize SLFs from natural language commands based on Sequence-to-Slots. Experiments show that SLFNet achieves state-of-the-art performance on the ChineseQCI-TS and Okapi datasets, and competitive performance on the ATIS dataset.
