Language Models are Few-shot Multilingual Learners
Genta Indra Winata, Andrea Madotto, Zhaojiang Lin, Rosanne Liu, Jason Yosinski, Pascale Fung
TL;DR
This paper demonstrates that large pre-trained language models can perform few-shot, multilingual intent classification without gradient updates by using carefully crafted prompts in an in-context learning setup. It introduces a Maximum Confidence Prediction mechanism to select among multiple label options and shows that English-context prompts can transfer to non-English test inputs across MTOP and MultiNLU datasets. Results indicate that model size substantially boosts performance, with GPT variants generally outperforming T5 and cross-lingual predictions improving as prompts scale, though non-English outputs lag English somewhat. The findings highlight the practical potential of zero-shot and few-shot cross-lingual inference for low-resource languages and suggest directions for future work on broader language coverage and prompt optimization.
Abstract
General-purpose language models have demonstrated impressive capabilities, performing on par with state-of-the-art approaches on a range of downstream natural language processing (NLP) tasks and benchmarks when inferring instructions from very few examples. Here, we evaluate the multilingual skills of the GPT and T5 models in conducting multi-class classification on non-English languages without any parameter updates. We show that, given a few English examples as context, pre-trained language models can predict not only English test samples but also non-English ones. Finally, we find the in-context few-shot cross-lingual prediction results of language models are significantly better than random prediction, and they are competitive compared to the existing state-of-the-art cross-lingual models.
