Comparing Neural Network Encodings for Logic-based Explainability
Levi Cordeiro Carvalho, Saulo A. F. Oliveira, Thiago Alves Rocha
TL;DR
This work tackles the challenge of providing correct, minimal explanations for neural network predictions by translating ANN behavior into logic-based constraints solvable via MILP. It compares two encodings: the established fischetti approach using logical implications and an adapted tjeng encoding that avoids implications and reduces variable/constraint counts. Across 12 datasets and varying network depths, both encodings show similar times for explanation generation, with the tjeng-based adaptation delivering up to 18% faster constraint construction and up to 16% faster overall time. The findings advance the scalability of logic-based explainability and include publicly available implementations to support further research and application. The work highlights practical improvements in building explanations for moderate-sized networks and sets directions for evaluating larger architectures and alternative encodings.
Abstract
Providing explanations for the outputs of artificial neural networks (ANNs) is crucial in many contexts, such as critical systems, data protection laws and handling adversarial examples. Logic-based methods can offer explanations with correctness guarantees, but face scalability challenges. Due to these issues, it is necessary to compare different encodings of ANNs into logical constraints, which are used in logic-based explainability. This work compares two encodings of ANNs: one has been used in the literature to provide explanations, while the other will be adapted for our context of explainability. Additionally, the second encoding uses fewer variables and constraints, thus, potentially enhancing efficiency. Experiments showed similar running times for computing explanations, but the adapted encoding performed up to 18\% better in building logical constraints and up to 16\% better in overall time.
