ULTRA:Urdu Language Transformer-based Recommendation Architecture
Alishbah Bashir, Fatima Qaiser, Ijaz Hussain
TL;DR
This work tackles the challenge of semantic content recommendation for Urdu, a low-resource language, by introducing ULTRA, a dual-embedding Transformer-based architecture with query-length routing. Short queries are matched against headline embeddings using CLS pooling, while long queries leverage mean-pooled content embeddings reduced via PCA, enabling scalable, high-fidelity semantic retrieval. Empirical evaluation on a large Urdu news corpus shows precision improvements over baselines, with Precision@15 of approximately 0.9435 for short queries and 0.9853 for long queries, and PCA identified as the effective dimensionality reduction technique across both pathways. The approach delivers production-grade performance, demonstrates generalizable insights for pooling and dimensionality reduction, and outlines practical directions for deployment and extension to other languages and modalities. Key contributions include the adaptive routing threshold, optimized pooling strategies, and a robust dual-path retrieval architecture that significantly advances semantic retrieval in low-resource settings.
Abstract
Urdu, as a low-resource language, lacks effective semantic content recommendation systems, particularly in the domain of personalized news retrieval. Existing approaches largely rely on lexical matching or language-agnostic techniques, which struggle to capture semantic intent and perform poorly under varying query lengths and information needs. This limitation results in reduced relevance and adaptability in Urdu content recommendation. We propose ULTRA (Urdu Language Transformer-based Recommendation Architecture),an adaptive semantic recommendation framework designed to address these challenges. ULTRA introduces a dual-embedding architecture with a query-length aware routing mechanism that dynamically distinguishes between short, intent-focused queries and longer, context-rich queries. Based on a threshold-driven decision process, user queries are routed to specialized semantic pipelines optimized for either title/headline-level or full-content/document level representations, ensuring appropriate semantic granularity during retrieval. The proposed system leverages transformer-based embeddings and optimized pooling strategies to move beyond surface-level keyword matching and enable context-aware similarity search. Extensive experiments conducted on a large-scale Urdu news corpus demonstrate that the proposed architecture consistently improves recommendation relevance across diverse query types. Results show gains in precision above 90% compared to single-pipeline baselines, highlighting the effectiveness of query-adaptive semantic alignment for low-resource languages. The findings establish ULTRA as a robust and generalizable content recommendation architecture, offering practical design insights for semantic retrieval systems in low-resource language settings.
