A classification model based on a population of hypergraphs
Samuel Barton, Adelle Coster, Diane Donovan, James Lefevre
TL;DR
This work tackles classification by explicitly modeling high-order feature interactions through a population of hypergraphs. It constructs discrete incidence representations via normalization and discretization, builds a main-effects hypergraph, and extends to two-way and higher-order eta-way interactions, aggregating predictions from many models for robustness. Empirical results on Fisher's Iris and starch grain datasets show competitive accuracy relative to Random Forest, with higher gains on the more complex starch dataset, especially when using eta > 1 and threshold-based decision rules. The approach highlights the value of higher-order interactions and ensemble hypergraph models for robust, generalizable classification with practical capabilities to rule out unlikely classes.
Abstract
This paper introduces a novel hypergraph classification algorithm. The use of hypergraphs in this framework has been widely studied. In previous work, hypergraph models are typically constructed using distance or attribute based methods. That is, hyperedges are generated by connecting a set of samples which are within a certain distance or have a common attribute. These methods however, do not often focus on multi-way interactions directly. The algorithm provided in this paper looks to address this problem by constructing hypergraphs which explore multi-way interactions of any order. We also increase the performance and robustness of the algorithm by using a population of hypergraphs. The algorithm is evaluated on two datasets, demonstrating promising performance compared to a generic random forest classification algorithm.
