Table of Contents
Fetching ...

SPRINT: Scalable and Predictive Intent Refinement for LLM-Enhanced Session-based Recommendation

Gyuseok Lee, Wonbin Kweon, Zhenrui Yue, Yaokun Liu, Yifan Liu, Susik Yoon, Dong Wang, SeongKu Kang

TL;DR

This work tackles the dual challenges of context scarcity and scalability in applying LLM-based profiling to session-based recommendation. It introduces SPRINT, a two-stage framework that first derives reliable session intents using uncertainty-aware LLM invocation and a global intent pool, then leverages a lightweight intent predictor and collaborative enrichment to integrate multi-intent signals into SBR without LLMs at inference. The approach significantly improves recommendation accuracy while reducing training and inference costs, outperforming state-of-the-art baselines across three real-world datasets. The results demonstrate improved explainability and practical viability for real-time SBR deployments, highlighting the value of constrained, predicate-driven intent generation and cross-session collaboration. Overall, SPRINT offers a scalable, interpretable path to harness LLM knowledge for personalized, efficient session-based recommendations.

Abstract

Large language models (LLMs) have enhanced conventional recommendation models via user profiling, which generates representative textual profiles from users' historical interactions. However, their direct application to session-based recommendation (SBR) remains challenging due to severe session context scarcity and poor scalability. In this paper, we propose SPRINT, a scalable SBR framework that incorporates reliable and informative intents while ensuring high efficiency in both training and inference. SPRINT constrains LLM-based profiling with a global intent pool and validates inferred intents based on recommendation performance to mitigate noise and hallucinations under limited context. To ensure scalability, LLMs are selectively invoked only for uncertain sessions during training, while a lightweight intent predictor generalizes intent prediction to all sessions without LLM dependency at inference time. Experiments on real-world datasets show that SPRINT consistently outperforms state-of-the-art methods while providing more explainable recommendations.

SPRINT: Scalable and Predictive Intent Refinement for LLM-Enhanced Session-based Recommendation

TL;DR

This work tackles the dual challenges of context scarcity and scalability in applying LLM-based profiling to session-based recommendation. It introduces SPRINT, a two-stage framework that first derives reliable session intents using uncertainty-aware LLM invocation and a global intent pool, then leverages a lightweight intent predictor and collaborative enrichment to integrate multi-intent signals into SBR without LLMs at inference. The approach significantly improves recommendation accuracy while reducing training and inference costs, outperforming state-of-the-art baselines across three real-world datasets. The results demonstrate improved explainability and practical viability for real-time SBR deployments, highlighting the value of constrained, predicate-driven intent generation and cross-session collaboration. Overall, SPRINT offers a scalable, interpretable path to harness LLM knowledge for personalized, efficient session-based recommendations.

Abstract

Large language models (LLMs) have enhanced conventional recommendation models via user profiling, which generates representative textual profiles from users' historical interactions. However, their direct application to session-based recommendation (SBR) remains challenging due to severe session context scarcity and poor scalability. In this paper, we propose SPRINT, a scalable SBR framework that incorporates reliable and informative intents while ensuring high efficiency in both training and inference. SPRINT constrains LLM-based profiling with a global intent pool and validates inferred intents based on recommendation performance to mitigate noise and hallucinations under limited context. To ensure scalability, LLMs are selectively invoked only for uncertain sessions during training, while a lightweight intent predictor generalizes intent prediction to all sessions without LLM dependency at inference time. Experiments on real-world datasets show that SPRINT consistently outperforms state-of-the-art methods while providing more explainable recommendations.

Paper Structure

This paper contains 28 sections, 17 equations, 7 figures, 5 tables, 1 algorithm.

Figures (7)

  • Figure 1: A conceptual comparison of (a) existing LLM-based user profiling and (b) our approach. Best viewed in color.
  • Figure 2: Overview of the SPRINT framework. Best viewed in color.
  • Figure 3: Comparison of performance and efficiency on Beauty. "Train. Time" denotes the total time, including LLM-based user profiling for session contexts (e.g., profiles or intents) and subsequent SBR model training. "Inf. Time" denotes the inference latency for the entire test set.
  • Figure 4: Random vs. uncertainty-aware session selection.
  • Figure 5: Results with varying amounts of LLM intents.
  • ...and 2 more figures