Table of Contents
Fetching ...

Comparative Study of Zero-Shot Cross-Lingual Transfer for Bodo POS and NER Tagging Using Gemini 2.0 Flash Thinking Experimental Model

Sanjib Narzary, Bihung Brahma, Haradip Mahilary, Mahananda Brahma, Bidisha Som, Sukumar Nandi

TL;DR

This study tackles POS and NER tagging for Bodo, a low-resource language, by evaluating zero-shot cross-lingual transfer using the Gemini 2.0 Flash Thinking Experimental model. It compares two pipelines: translation-based tag transfer and prompt-based tag transfer with parallel English–Bodo pairs, outputting Bodo-tagged sequences in CONLL-2003 and assessed through qualitative expert evaluation. The results show that prompt-based transfer yields superior NER performance while POS tagging remains moderate in both approaches, with translation quality and cross-lingual divergences as key limiting factors. The work demonstrates the potential of large-language-model–driven bootstrapping for LRL NLP and outlines a path toward improved few-shot strategies, hybrid approaches, and community-driven resource development to reach higher accuracy.

Abstract

Named Entity Recognition (NER) and Part-of-Speech (POS) tagging are critical tasks for Natural Language Processing (NLP), yet their availability for low-resource languages (LRLs) like Bodo remains limited. This article presents a comparative empirical study investigating the effectiveness of Google's Gemini 2.0 Flash Thinking Experiment model for zero-shot cross-lingual transfer of POS and NER tagging to Bodo. We explore two distinct methodologies: (1) direct translation of English sentences to Bodo followed by tag transfer, and (2) prompt-based tag transfer on parallel English-Bodo sentence pairs. Both methods leverage the machine translation and cross-lingual understanding capabilities of Gemini 2.0 Flash Thinking Experiment to project English POS and NER annotations onto Bodo text in CONLL-2003 format. Our findings reveal the capabilities and limitations of each approach, demonstrating that while both methods show promise for bootstrapping Bodo NLP, prompt-based transfer exhibits superior performance, particularly for NER. We provide a detailed analysis of the results, highlighting the impact of translation quality, grammatical divergences, and the inherent challenges of zero-shot cross-lingual transfer. The article concludes by discussing future research directions, emphasizing the need for hybrid approaches, few-shot fine-tuning, and the development of dedicated Bodo NLP resources to achieve high-accuracy POS and NER tagging for this low-resource language.

Comparative Study of Zero-Shot Cross-Lingual Transfer for Bodo POS and NER Tagging Using Gemini 2.0 Flash Thinking Experimental Model

TL;DR

This study tackles POS and NER tagging for Bodo, a low-resource language, by evaluating zero-shot cross-lingual transfer using the Gemini 2.0 Flash Thinking Experimental model. It compares two pipelines: translation-based tag transfer and prompt-based tag transfer with parallel English–Bodo pairs, outputting Bodo-tagged sequences in CONLL-2003 and assessed through qualitative expert evaluation. The results show that prompt-based transfer yields superior NER performance while POS tagging remains moderate in both approaches, with translation quality and cross-lingual divergences as key limiting factors. The work demonstrates the potential of large-language-model–driven bootstrapping for LRL NLP and outlines a path toward improved few-shot strategies, hybrid approaches, and community-driven resource development to reach higher accuracy.

Abstract

Named Entity Recognition (NER) and Part-of-Speech (POS) tagging are critical tasks for Natural Language Processing (NLP), yet their availability for low-resource languages (LRLs) like Bodo remains limited. This article presents a comparative empirical study investigating the effectiveness of Google's Gemini 2.0 Flash Thinking Experiment model for zero-shot cross-lingual transfer of POS and NER tagging to Bodo. We explore two distinct methodologies: (1) direct translation of English sentences to Bodo followed by tag transfer, and (2) prompt-based tag transfer on parallel English-Bodo sentence pairs. Both methods leverage the machine translation and cross-lingual understanding capabilities of Gemini 2.0 Flash Thinking Experiment to project English POS and NER annotations onto Bodo text in CONLL-2003 format. Our findings reveal the capabilities and limitations of each approach, demonstrating that while both methods show promise for bootstrapping Bodo NLP, prompt-based transfer exhibits superior performance, particularly for NER. We provide a detailed analysis of the results, highlighting the impact of translation quality, grammatical divergences, and the inherent challenges of zero-shot cross-lingual transfer. The article concludes by discussing future research directions, emphasizing the need for hybrid approaches, few-shot fine-tuning, and the development of dedicated Bodo NLP resources to achieve high-accuracy POS and NER tagging for this low-resource language.

Paper Structure

This paper contains 19 sections, 4 figures, 29 tables.

Figures (4)

  • Figure 1: Abstract view of Google Gemini Architecture. Image Credit
  • Figure 2: Prompt with translation based tag transfer architecture.
  • Figure 3: Parallel sentence based tag transfer architecture.
  • Figure 4: Comparison of NER results for Bodo and English (Part 1).