Table of Contents
Fetching ...

ILLC: Iterative Layer-by-Layer Compression for Enhancing Structural Faithfulness in SpArX

Ungsik Kim

TL;DR

This work tackles the challenge of compressing deep neural networks without sacrificing structural fidelity or explanatory power. It introduces Iterative Layer-by-Layer Compression (ILLC), a layer-wise scheme that recalibrates each layer before compressing the next, achieving lower input-output and structural unfaithfulness while preserving the argumentative reasoning captured by the SparX/QBAF framework. Empirical results on the Breast Cancer Diagnosis dataset show consistent improvements across global and local explanations and reveal phenomena like Bi-Local-Maxima and Dead Neurons, which ILLC mitigates. The approach promises more compact, faithful models suitable for high-stakes domains where transparent internal reasoning is required, with potential extensions to other architectures and skip-connected networks.

Abstract

In the field of Explainable Artificial Intelligence (XAI), argumentative XAI approaches have been proposed to represent the internal reasoning process of deep neural networks in a more transparent way by interpreting hidden nodes as arguements. However, as the number of layers increases, existing compression methods simplify all layers at once, which lead to high accumulative information loss. To compensate for this, we propose an iterative layer-by-layer compression technique in which each layer is compressed separately and the reduction error in the next layer is immediately compensated for, thereby improving the overall input-output and structural fidelity of the model. Experiments on the Breast Cancer Diagnosis dataset show that, compared to traditional compression, the method reduces input-output and structural unfaithfulness, and maintains a more consistent attack-support relationship in the Argumentative Explanation scheme. This is significant because it provides a new way to make complex MLP models more compact while still conveying their internal inference logic without distortion.

ILLC: Iterative Layer-by-Layer Compression for Enhancing Structural Faithfulness in SpArX

TL;DR

This work tackles the challenge of compressing deep neural networks without sacrificing structural fidelity or explanatory power. It introduces Iterative Layer-by-Layer Compression (ILLC), a layer-wise scheme that recalibrates each layer before compressing the next, achieving lower input-output and structural unfaithfulness while preserving the argumentative reasoning captured by the SparX/QBAF framework. Empirical results on the Breast Cancer Diagnosis dataset show consistent improvements across global and local explanations and reveal phenomena like Bi-Local-Maxima and Dead Neurons, which ILLC mitigates. The approach promises more compact, faithful models suitable for high-stakes domains where transparent internal reasoning is required, with potential extensions to other architectures and skip-connected networks.

Abstract

In the field of Explainable Artificial Intelligence (XAI), argumentative XAI approaches have been proposed to represent the internal reasoning process of deep neural networks in a more transparent way by interpreting hidden nodes as arguements. However, as the number of layers increases, existing compression methods simplify all layers at once, which lead to high accumulative information loss. To compensate for this, we propose an iterative layer-by-layer compression technique in which each layer is compressed separately and the reduction error in the next layer is immediately compensated for, thereby improving the overall input-output and structural fidelity of the model. Experiments on the Breast Cancer Diagnosis dataset show that, compared to traditional compression, the method reduces input-output and structural unfaithfulness, and maintains a more consistent attack-support relationship in the Argumentative Explanation scheme. This is significant because it provides a new way to make complex MLP models more compact while still conveying their internal inference logic without distortion.

Paper Structure

This paper contains 13 sections, 15 equations, 2 figures, 2 tables, 1 algorithm.

Figures (2)

  • Figure 1: A visual illustration of how the "activation value updated in the previous layer" leads to the input of the next layer during Iterative Layer-by-Layer Compression (ILLC) to minimize error accumulation. (a) is the original MLP structure. In step 2, the original method and our method have the same behavior. (b) The original method compresses using activation values obtained by transferring from the original MLP model. (c) Our method compresses using activation values obtained by again forwarding from the model compressed in step 2. Since the output of layer 1, compressed and recalculated in step 1, becomes the input of step 2, the error in layer 2 compression can be corrected immediately. This maintains a finer activation pattern between layers than the existing method, which ultimately reduces both structural fidelity and input-output fidelity compared to the original model.
  • Figure 2: Analysis of layer 20 and 100 neurons MLP in Table \ref{['tab:global_explanation']}

Theorems & Definitions (10)

  • Definition 1: Multi-Layer Perceptron (MLP)
  • Definition 2: Quantitative Argumentation Framework (QBAF)
  • Definition 3: Aggregation and Influence Functions for MLP Modeling
  • Definition 4: Graphical Structure of Clustered MLP
  • Definition 5: Parameters of Clustered MLP
  • Definition 6: Input-Output Unfaithfulness
  • Definition 7: Structural Unfaithfulness
  • Definition 8: Cognitive Complexity
  • Definition 9: Global Aggregation Functions
  • Definition 10: Local Edge Aggregation Function