Hints Help Finding and Fixing Bugs Differently in Python and Text-based Program Representations

Ruchit Rawal; Victor-Alexandru Pădurean; Sven Apel; Adish Singla; Mariya Toneva

Hints Help Finding and Fixing Bugs Differently in Python and Text-based Program Representations

Ruchit Rawal, Victor-Alexandru Pădurean, Sven Apel, Adish Singla, Mariya Toneva

TL;DR

This study investigates how hints influence bug finding and fixing when algorithms are represented in Python code versus natural-language text, across users who differ in their initial understanding of the task. Through a large crowd-sourced experiment (N=753) spanning eight condition combinations (two representations × four hint types) and two tasks per participant, the authors measure accuracy on bug-related questions and response time. Key findings show that text-based representations boost accuracy for users with clear understanding, while hints significantly improve Python-based debugging and can bridge gaps between representations and understanding levels; detailed fixes consistently outperform other hint types. The work provides practical guidance for designing adaptive programming tools that tailor representation and hints to user skill, with data and scripts publicly available to enable replication and further study.

Abstract

With the recent advances in AI programming assistants such as GitHub Copilot, programming is not limited to classical programming languages anymore--programming tasks can also be expressed and solved by end-users in natural text. Despite the availability of this new programming modality, users still face difficulties with algorithmic understanding and program debugging. One promising approach to support end-users is to provide hints to help them find and fix bugs while forming and improving their programming capabilities. While it is plausible that hints can help, it is unclear which type of hint is helpful and how this depends on program representations (classic source code or a textual representation) and the user's capability of understanding the algorithmic task. To understand the role of hints in this space, we conduct a large-scale crowd-sourced study involving 753 participants investigating the effect of three types of hints (test cases, conceptual, and detailed), across two program representations (Python and text-based), and two groups of users (with clear understanding or confusion about the algorithmic task). We find that the program representation (Python vs. text) has a significant influence on the users' accuracy at finding and fixing bugs. Surprisingly, users are more accurate at finding and fixing bugs when they see the program in natural text. Hints are generally helpful in improving accuracy, but different hints help differently depending on the program representation and the user's understanding of the algorithmic task. These findings have implications for designing next-generation programming tools that provide personalized support to users, for example, by adapting the programming modality and providing hints with respect to the user's skill level and understanding.

Hints Help Finding and Fixing Bugs Differently in Python and Text-based Program Representations

TL;DR

Abstract

Hints Help Finding and Fixing Bugs Differently in Python and Text-based Program Representations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)