EvoStruggle: A Dataset Capturing the Evolution of Struggle across Activities and Skill Levels

Shijia Feng; Michael Wray; Walterio Mayol-Cuevas

EvoStruggle: A Dataset Capturing the Evolution of Struggle across Activities and Skill Levels

Shijia Feng, Michael Wray, Walterio Mayol-Cuevas

TL;DR

This work addresses the challenge of detecting how struggle evolves during skill acquisition by introducing EvoStruggle, a large-scale, multi-activity dataset with precise temporal struggle annotations. The dataset comprises 61.68 hours of untrimmed video, 2,793 videos, and 5,385 labeled struggle segments from 76 participants performing 18 tasks across four activities, each repeated five times to capture skill progression. Temporal Action Localization models are evaluated as struggle detectors, achieving a mean average precision of 34.56% across tasks and 19.24% across activities, demonstrating transferability of struggle cues while highlighting generalization challenges. EvoStruggle provides a rich resource for developing robust, generalizable assistance and tutoring systems that adapt as learners evolve.

Abstract

The ability to determine when a person struggles during skill acquisition is crucial for both optimizing human learning and enabling the development of effective assistive systems. As skills develop, the type and frequency of struggles tend to change, and understanding this evolution is key to determining the user's current stage of learning. However, existing manipulation datasets have not focused on how struggle evolves over time. In this work, we collect a dataset for struggle determination, featuring 61.68 hours of video recordings, 2,793 videos, and 5,385 annotated temporal struggle segments collected from 76 participants. The dataset includes 18 tasks grouped into four diverse activities -- tying knots, origami, tangram puzzles, and shuffling cards, representing different task variations. In addition, participants repeated the same task five times to capture their evolution of skill. We define the struggle determination problem as a temporal action localization task, focusing on identifying and precisely localizing struggle segments with start and end times. Experimental results show that Temporal Action Localization models can successfully learn to detect struggle cues, even when evaluated on unseen tasks or activities. The models attain an overall average mAP of 34.56% when generalizing across tasks and 19.24% across activities, indicating that struggle is a transferable concept across various skill-based tasks while still posing challenges for further improvement in struggle detection. Our dataset is available at https://github.com/FELIXFENG2019/EvoStruggle.

EvoStruggle: A Dataset Capturing the Evolution of Struggle across Activities and Skill Levels

TL;DR

Abstract

EvoStruggle: A Dataset Capturing the Evolution of Struggle across Activities and Skill Levels

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)