Input Invex Neural Network

Suman Sapkota; Binod Bhattarai

Input Invex Neural Network

Suman Sapkota, Binod Bhattarai

TL;DR

Input Invex Neural Networks (II-NN) address the challenge of generating connected decision boundaries by enforcing invexity in neural models, ensuring simply connected lower contour sets. The work presents two concrete constructions: (i) Gradient Clipped Gradient Penalty (GC-GP), which constrains local input gradients via a gradient clipping and a smooth projected-gradient penalty, and (ii) a modular composition $f_{invex}(X) = f_{cone}(f_{invertible}(X))$ of an invertible backbone with a convex function. These methods enable binary and multi-region (multi-invex) classifiers with interpretable, locality-aware decisions and are validated on toy data and large-scale benchmarks (MNIST, Fashion-MNIST, CIFAR-10/100) where they achieve competitive accuracy relative to ordinary and convex baselines. The approach supports network morphism-based NAS for local region learning and offers a path toward more interpretable regions in input space, though GC-GP does not guarantee invexity in all cases and formal proofs remain an open area for future work. Overall, II-NN provides a principled framework for constructing interpretable, region-based classifiers by leveraging invexity and connected sets in neural networks.

Abstract

Connected decision boundaries are useful in several tasks like image segmentation, clustering, alpha-shape or defining a region in nD-space. However, the machine learning literature lacks methods for generating connected decision boundaries using neural networks. Thresholding an invex function, a generalization of a convex function, generates such decision boundaries. This paper presents two methods for constructing invex functions using neural networks. The first approach is based on constraining a neural network with Gradient Clipped-Gradient Penality (GCGP), where we clip and penalise the gradients. In contrast, the second one is based on the relationship of the invex function to the composition of invertible and convex functions. We employ connectedness as a basic interpretation method and create connected region-based classifiers. We show that multiple connected set based classifiers can approximate any classification function. In the experiments section, we use our methods for classification tasks using an ensemble of 1-vs-all models as well as using a single multiclass model on small-scale datasets. The experiments show that connected set-based classifiers do not pose any disadvantage over ordinary neural network classifiers, but rather, enhance their interpretability. We also did an extensive study on the properties of invex function and connected sets for interpretability and network morphism with experiments on toy and real-world data sets. Our study suggests that invex function is fundamental to understanding and applying locality and connectedness of input space which is useful for various downstream tasks.

Input Invex Neural Network

TL;DR

Abstract

Input Invex Neural Network

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (23)

Theorems & Definitions (3)