Alternating Bi-Objective Optimization for Explainable Neuro-Fuzzy Systems
Qusai Khaled, Uzay Kaymak, Laura Genga
TL;DR
This work tackles the non-convex nature of the accuracy–explainability trade-off in neuro-fuzzy systems by introducing X-ANFIS, an alternating bi-objective gradient-based optimization that decouples performance and explainability. It leverages Cauchy membership functions for stable training and interleaves a differentiable explainability loss (X-pass) to enforce target distinguishability among adjacent fuzzy sets, updating antecedents while keeping consequents via regularized least squares. Across nine real-world UCI regression datasets, X-ANFIS achieves high semantic distinguishability (around $D \approx 0.50$) with competitive $R^2$, often recovering non-convex Pareto regions inaccessible to scalarized approaches and producing spatially coherent MF partitions. The findings show gradient-based optimization, when structured as alternating objectives, can yield explainable neuro-fuzzy models, and motivate extending these principles beyond Takagi–Sugeno to other fuzzy paradigms such as Mamdani systems.
Abstract
Fuzzy systems show strong potential in explainable AI due to their rule-based architecture and linguistic variables. Existing approaches navigate the accuracy-explainability trade-off either through evolutionary multi-objective optimization (MOO), which is computationally expensive, or gradient-based scalarization, which cannot recover non-convex Pareto regions. We propose X-ANFIS, an alternating bi-objective gradient-based optimization scheme for explainable adaptive neuro-fuzzy inference systems. Cauchy membership functions are used for stable training under semantically controlled initializations, and a differentiable explainability objective is introduced and decoupled from the performance objective through alternating gradient passes. Validated in approximately 5,000 experiments on nine UCI regression datasets, X-ANFIS consistently achieves target distinguishability while maintaining competitive predictive accuracy, recovering solutions beyond the convex hull of the MOO Pareto front.
