Generalized Naive Bayes
Edith Alice Kovács, Anna Ország, Dániel Pfeifer, András Benczúr
TL;DR
The paper addresses NB's limitation from the conditional independence assumption by introducing Generalized Naive Bayes (GNB), a cherry-tree based extension that uses triplet clusters to better capture dependencies. It presents two learning algorithms, GNB-A (greedy) and GNB-O (optimal via maximum weighted arborescence), and proves that GNB offers a fit at least as good as NB in terms of KL divergence, with GNB-O achieving an optimal structure under a mild condition. The authors develop a classification process and integrated feature-selection methods, and validate their approach on several real medical datasets, frequently outperforming NB and TAN while maintaining interpretability through feature-importance scores. Overall, the work advances probabilistic graphical modeling for classification by providing transparent, information-theoretically grounded methods that improve accuracy and offer actionable feature insights in health-related tasks.
Abstract
In this paper we introduce the so-called Generalized Naive Bayes structure as an extension of the Naive Bayes structure. We give a new greedy algorithm that finds a good fitting Generalized Naive Bayes (GNB) probability distribution. We prove that this fits the data at least as well as the probability distribution determined by the classical Naive Bayes (NB). Then, under a not very restrictive condition, we give a second algorithm for which we can prove that it finds the optimal GNB probability distribution, i.e. best fitting structure in the sense of KL divergence. Both algorithms are constructed to maximize the information content and aim to minimize redundancy. Based on these algorithms, new methods for feature selection are introduced. We discuss the similarities and differences to other related algorithms in terms of structure, methodology, and complexity. Experimental results show, that the algorithms introduced outperform the related algorithms in many cases.
