Algorithm Configuration for Structured Pfaffian Settings
Maria-Florina Balcan, Anh Tuan Nguyen, Dravyansh Sharma
TL;DR
This work develops and applies a Pfaffian-enhanced learning framework for data-driven algorithm configuration. By extending the Goldberg–Jerrum framework to Pfaffian functions, the Pfaffian GJ framework enables learning guarantees for parameterized algorithms whose dual utilities exhibit Pfaffian piecewise structure, broadening beyond rational-function routines. A refined Pfaffian piecewise structure ties Pfaffian boundaries and pieces together to yield tighter pseudo-dimension bounds, and the authors demonstrate generalization guarantees for data-driven agglomerative clustering, graph-based semi-supervised learning, and regularized logistic regression, along with online dispersion tools for Pfaffian discontinuities. The framework is complemented by online learning results with dispersion-based no-regret guarantees, and the paper closes with concrete applications, comparisons to prior work, and directions for future research. Overall, the Pfaffian approach broadens the theoretical foundations of data-driven algorithm design and provides principled, scalable guarantees for complex, Pfaffian-structured problems.
Abstract
Data-driven algorithm design automatically adapts algorithms to specific application domains, achieving better performance. In the context of parameterized algorithms, this approach involves tuning the algorithm's hyperparameters using problem instances drawn from the problem distribution of the target application domain. This can be achieved by maximizing empirical utilities that measure the algorithms' performance as a function of their hyperparameters, using problem instances. While empirical evidence supports the effectiveness of data-driven algorithm design, providing theoretical guarantees for several parameterized families remains challenging. This is due to the intricate behaviors of their corresponding utility functions, which typically admit piecewise discontinuous structures. In this work, we present refined frameworks for providing learning guarantees for parameterized data-driven algorithm design problems in both distributional and online learning settings. For the distributional learning setting, we introduce the \textit{Pfaffian GJ framework}, an extension of the classical \textit{GJ framework}, that is capable of providing learning guarantees for function classes for which the computation involves Pfaffian functions. Unlike the GJ framework, which is limited to function classes with computation characterized by rational functions, our proposed framework can deal with function classes involving Pfaffian functions, which are much more general and widely applicable. We then show that for many parameterized algorithms of interest, their utility function possesses a \textit{refined piecewise structure}, which automatically translates to learning guarantees using our proposed framework.
