Decomposed Prompting: Probing Multilingual Linguistic Structure Knowledge in Large Language Models
Ercong Nie, Shuzhou Yuan, Bolei Ma, Helmut Schmid, Michael Färber, Frauke Kreuter, Hinrich Schütze
TL;DR
This work tackles probing multilingual linguistic structure knowledge in large language models by reframing sequence labeling as a set of per-token prompts. It introduces decomposed prompting, which generates individual prompts for each token and optionally leverages few-shot demonstrations, to improve both accuracy and inference efficiency over iterative prompting, evaluated on UDPOS across 38 languages with English-centric and multilingual LLMs. The results show that decomposed prompting yields superior performance in zero- and few-shot settings, with notable speed-ups, and reveal insights into how English-centric models transfer linguistic knowledge across languages depending on proximity and script familiarity. The findings offer a practical, training-free method to assess multilingual linguistic competence in LLMs and lay groundwork for extending such probing to more complex linguistic tasks.
Abstract
Probing the multilingual knowledge of linguistic structure in LLMs, often characterized as sequence labeling, faces challenges with maintaining output templates in current text-to-text prompting strategies. To solve this, we introduce a decomposed prompting approach for sequence labeling tasks. Diverging from the single text-to-text prompt, our prompt method generates for each token of the input sentence an individual prompt which asks for its linguistic label. We test our method on the Universal Dependencies part-of-speech tagging dataset for 38 languages, using both English-centric and multilingual LLMs. Our findings show that decomposed prompting surpasses the iterative prompting baseline in efficacy and efficiency under zero- and few-shot settings. Moreover, our analysis of multilingual performance of English-centric LLMs yields insights into the transferability of linguistic knowledge via multilingual prompting.
