Research on Tibetan Tourism Viewpoints information generation system based on LLM
Jinhu Qi, Shuai Yan, Wentao Zhang, Yibo Zhang, Zirui Liu, Ke Wang
TL;DR
This paper tackles the challenge of providing high-quality, structured tourist viewpoint information for Tibet by addressing data sparsity and the need for domain-tailored LLM systems. It introduces the DualGen Bridge AI framework, which fuses a location-keyword extractor with a bridge-based nearest-viewpoint recommender and a generation module, all trained with SFT and LoRA-based fine-tuning, enhanced by ORPO. A novel multi-structure evaluation framework is proposed to assess structured outputs, complemented by empirical validation across dataset construction, keyword extraction, and generation experiments, establishing effective configurations and demonstrating significant improvements in information quality and relevance. The work advances practical, LLM-driven smart tourism for Tibet and lays groundwork for broader, multi-modal, domain-specific tourism information systems. It also offers concrete guidance on parameter-efficient fine-tuning strategies and evaluation methodologies for large-scale language models in resource-constrained, real-world settings.
Abstract
Tibet, ensconced within China's territorial expanse, is distinguished by its labyrinthine and heterogeneous topography, a testament to its profound historical heritage, and the cradle of a unique religious ethos. The very essence of these attributes, however, has impeded the advancement of Tibet's tourism service infrastructure, rendering existing smart tourism services inadequate for the region's visitors. This study delves into the ramifications of informational disparities at tourist sites on Tibetan tourism and addresses the challenge of establishing the Large Language Model (LLM) evaluation criteria. It introduces an innovative approach, the DualGen Bridge AI system, employing supervised fine-tuning techniques to bolster model functionality and enhance optimization processes. Furthermore, it pioneers a multi-structured generative results assessment framework. Empirical validation confirms the efficacy of this framework. The study also explores the application of the supervised fine-tuning method within the proprietary DualGen Bridge AI, aimed at refining the generation of tourist site information. The study's findings offer valuable insights for optimizing system performance and provide support and inspiration for the application of LLM technology in Tibet's tourism services and beyond, potentially revolutionizing the smart tourism industry with advanced, tailored information generation capabilities.
