Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels

Blake Castleman; Uzay Macar; Ansaf Salleb-Aouissi

Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels

Blake Castleman, Uzay Macar, Ansaf Salleb-Aouissi

TL;DR

This work addresses the need for open-source, adaptive tutoring systems capable of concurrently steering learners through conceptual ideas and associated problems with varying difficulty. It proposes a deployable hierarchical MAB architecture consisting of a high-level concept MAB and a low-level problem MAB, guided by ZPDES and memory-decay modeling (MCM) and augmented with MAPLE-inspired transient difficulty ranking. Bayesian Knowledge Tracing simulations show that a difficulty-agnostic hierarchical MAB improves mastery, and adding problem difficulty adaptation yields additional gains, indicating practical benefits for remote education. By delivering an open-source platform and detailed parameterizations, the paper enables researchers and educators to implement and extend MAB-based tutoring pipelines in real-world settings, with future work focusing on real-world trials, dynamic difficulty updates, and material redirects for underperforming students.

Abstract

Remote education has proliferated in the twenty-first century, yielding rise to intelligent tutoring systems. In particular, research has found multi-armed bandit (MAB) intelligent tutors to have notable abilities in traversing the exploration-exploitation trade-off landscape for student problem recommendations. Prior literature, however, contains a significant lack of open-sourced MAB intelligent tutors, which impedes potential applications of these educational MAB recommendation systems. In this paper, we combine recent literature on MAB intelligent tutoring techniques into an open-sourced and simply deployable hierarchical MAB algorithm, capable of progressing students concurrently through concepts and problems, determining ideal recommended problem difficulties, and assessing latent memory decay. We evaluate our algorithm using simulated groups of 500 students, utilizing Bayesian Knowledge Tracing to estimate students' content mastery. Results suggest that our algorithm, when turned difficulty-agnostic, significantly boosts student success, and that the further addition of problem-difficulty adaptation notably improves this metric.

Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels

TL;DR

Abstract

Paper Structure (24 sections, 8 equations, 3 figures, 1 algorithm)

This paper contains 24 sections, 8 equations, 3 figures, 1 algorithm.

Introduction
Related Work
Multi-Armed Bandit Intelligent Tutoring Frameworks - Without Difficulty Levels
Multi-Armed Bandit Intelligent Tutoring Frameworks - With Difficulty Levels
Methodology: Platform Architecture
Platform Section, Concept, and Problem Definition
Methodology: ZPDES Foundation for Algorithmic Progression
ZPDES Multi-Armed Bandit Design Foundation
MCM Algorithm Adaptation
Methodology: High-Level Concept Multi-Armed Bandit
Concept Progression Trees
Problem Difficulty
Methodology: Low-Level Problem Multi-Armed Bandit
Problem Progression Trees
Initial Problem Difficulty Integration
...and 9 more sections

Figures (3)

Figure 1: Progression tree examples for both the conceptual MAB and the problem MAB. Note that there are separate problem progression trees for each concept in a given section.
Figure 2: The average progression for groups of 500 simulated students' mastery of 5 sections (15 concepts) of material, using BKT. This includes a randomized sequence of questions (black), our hierarchical multi-armed bandit framework without problem difficulty considerations (red), and our hierarchical multi-armed bandit framework with problem difficulty included (blue).
Figure 3: A schematic of how the concept MAB and problem MABs interact.

Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels

TL;DR

Abstract

Hierarchical Multi-Armed Bandits for the Concurrent Intelligent Tutoring of Concepts and Problems of Varying Difficulty Levels

Authors

TL;DR

Abstract

Table of Contents

Figures (3)