WDD: Weighted Delta Debugging
Xintong Zhou, Zhenyang Xu, Mengxiao Zhang, Yongqiang Tian, Chengnian Sun
TL;DR
The paper tackles inefficiencies in delta debugging caused by ignoring varied fragment sizes within test inputs. It introduces Weighted Delta Debugging (WDD), which assigns weights to elements by size and uses weight-based partitioning to guide minimization, yielding two algorithms: $W_\text{ddmin}$ and $W_\text{ProbDD}$. Through extensive evaluation on HDD and Perses across 62 benchmarks (C and XML), the authors demonstrate that WDD improves both efficiency and effectiveness, producing smaller minimizations with fewer tests and faster runtimes. The work provides formalization, algorithmic development, empirical validation, and replication artifacts, offering a practical and generalizable enhancement to tree-based program reduction and debugging workflows.
Abstract
Delta Debugging is a widely used family of algorithms (e.g., ddmin and ProbDD) to automatically minimize bug-triggering test inputs, thus to facilitate debugging. It takes a list of elements with each element representing a fragment of the test input, systematically partitions the list at different granularities, identifies and deletes bug-irrelevant partitions. Prior delta debugging algorithms assume there are no differences among the elements in the list, and thus treat them uniformly during partitioning. However, in practice, this assumption usually does not hold, because the size (referred to as weight) of the fragment represented by each element can vary significantly. For example, a single element representing 50% of the test input is much more likely to be bug-relevant than elements representing only 1%. This assumption inevitably impairs the efficiency or even effectiveness of these delta debugging algorithms. This paper proposes Weighted Delta Debugging (WDD), a novel concept to help prior delta debugging algorithms overcome the limitation mentioned above. The key insight of WDD is to assign each element in the list a weight according to its size, and distinguish different elements based on their weights during partitioning. We designed two new minimization algorithms, Wddmin and WProbDD, by applying WDD to ddmin and ProbDD respectively. We extensively evaluated Wddmin and WProbDD in two representative applications, HDD and Perses, on 62 benchmarks across two languages. The results strongly demonstrate the value of WDD. We firmly believe that WDD opens up a new dimension to improve test input minimization techniques.
