From Questions to Insightful Answers: Building an Informed Chatbot for University Resources
Subash Neupane, Elias Hossain, Jason Keith, Himanshu Tripathi, Farbod Ghiasi, Noorbakhsh Amiri Golilarz, Amin Amirlatifi, Sudip Mittal, Shahram Rahimi
TL;DR
This work introduces BarkPlug v.2, an LLM-based chatbot for university resources that uses a Retrieval Augmented Generation pipeline to ground answers in a university-wide external corpus. The system vectorizes 42 MSU campus resources with text embeddings, stores them in Chroma DB, and retrieves relevant chunks via LangChain before generating responses with GPT-3.5-turbo, enabling domain-specific, interactive information access. Evaluation with the RAGAS framework yields high grounding and relevance (average ~0.97 across categories), while a System Usability Scale (SUS) assessment reports a practical mean of 67.75 from 50 students, indicating solid usability with room for improvements. The work discusses implementation details, dataset construction, limitations (e.g., no ASR, multilingual support) and future directions (multilingual support, ASR integration, improved retrieval strategies), highlighting BarkPlug v.2’s potential for broader deployment across universities and sectors that require structured, searchable campus information.
Abstract
This paper presents BARKPLUG V.2, a Large Language Model (LLM)-based chatbot system built using Retrieval Augmented Generation (RAG) pipelines to enhance the user experience and access to information within academic settings.The objective of BARKPLUG V.2 is to provide information to users about various campus resources, including academic departments, programs, campus facilities, and student resources at a university setting in an interactive fashion. Our system leverages university data as an external data corpus and ingests it into our RAG pipelines for domain-specific question-answering tasks. We evaluate the effectiveness of our system in generating accurate and pertinent responses for Mississippi State University, as a case study, using quantitative measures, employing frameworks such as Retrieval Augmented Generation Assessment(RAGAS). Furthermore, we evaluate the usability of this system via subjective satisfaction surveys using the System Usability Scale (SUS). Our system demonstrates impressive quantitative performance, with a mean RAGAS score of 0.96, and experience, as validated by usability assessments.
