Concept-based Analysis of Neural Networks via Vision-Language Models

Ravi Mangal; Nina Narodytska; Divya Gopinath; Boyue Caroline Hu; Anirban Roy; Susmit Jha; Corina Pasareanu

Concept-based Analysis of Neural Networks via Vision-Language Models

Ravi Mangal, Nina Narodytska, Divya Gopinath, Boyue Caroline Hu, Anirban Roy, Susmit Jha, Corina Pasareanu

TL;DR

This work tackles the challenge of formally analyzing vision-based DNNs by leveraging Vision-Language Models (VLMs) as a semantic lens. It introduces Con_spec, a first-order specification language for expressing model properties in terms of human-understandable concepts, and defines a concept representation map rep implemented via VLMs to enable automated verification in the shared text/image embedding space. A key methodological contribution is the affine alignment r_map between a vision model’s embeddings and a VLM’s embeddings, which, together with a decomposition of the vision model into encoders and heads, reduces verification to linear constraints on the head, enabling scalable, region-focused checks. The paper demonstrates the approach on a ResNet18 classifier trained on RIVAL10 using CLIP, extracting concept directions through CLIP captions, statistically validating rep predicates, and verifying the model against Con_spec properties, thereby illustrating the potential of semantic, scalable DNN verification with multimodal foundations.

Abstract

The analysis of vision-based deep neural networks (DNNs) is highly desirable but it is very challenging due to the difficulty of expressing formal specifications for vision tasks and the lack of efficient verification procedures. In this paper, we propose to leverage emerging multimodal, vision-language, foundation models (VLMs) as a lens through which we can reason about vision models. VLMs have been trained on a large body of images accompanied by their textual description, and are thus implicitly aware of high-level, human-understandable concepts describing the images. We describe a logical specification language $\texttt{Con}_{\texttt{spec}}$ designed to facilitate writing specifications in terms of these concepts. To define and formally check $\texttt{Con}_{\texttt{spec}}$ specifications, we build a map between the internal representations of a given vision model and a VLM, leading to an efficient verification procedure of natural-language properties for vision models. We demonstrate our techniques on a ResNet-based classifier trained on the RIVAL-10 dataset using CLIP as the multimodal model.

Concept-based Analysis of Neural Networks via Vision-Language Models

TL;DR

Abstract

designed to facilitate writing specifications in terms of these concepts. To define and formally check

specifications, we build a map between the internal representations of a given vision model and a VLM, leading to an efficient verification procedure of natural-language properties for vision models. We demonstrate our techniques on a ResNet-based classifier trained on the RIVAL-10 dataset using CLIP as the multimodal model.

Paper Structure (43 sections, 4 theorems, 25 equations, 14 figures)

This paper contains 43 sections, 4 theorems, 25 equations, 14 figures.

Introduction
Preliminaries
Neural Network Classifiers.
Cosine Similarity.
Vision-Language Models.
$\texttt{Con}_{\texttt{spec}}$ Specification Language
Syntax
Example.
Semantics
Vision-Language Models as Analysis Tools
Mapping vision model embedding to VLM embedding.
Exploiting model decomposition for verification.
Discussion.
Case Study
Dataset, Concepts, and Strength Predicates
...and 28 more sections

Key Result

theorem thmcountertheorem

Given a vision model $f:X\rightarrow Y$ that can be decomposed into $f_{enc}:X\rightarrow Z$ and $f_{head}:Z\rightarrow Y$ where $Z:=\mathbb{R}^{d'}$, a $\texttt{Con}_{\texttt{spec}}$ specification $e$, a linear concept representation map $rep$, as defined in Defn. def:con_eqv, and an input scope $B where $\widehat{rep}$ operates on embeddings instead of inputs and is obtained from $rep$ by replac

Figures (14)

Figure 1: Overview of Approach
Figure 2: $\texttt{Con}_{\texttt{spec}}$ syntax
Figure 3: $\texttt{Con}_{\texttt{spec}}$ semantics
Figure 4: Relevant concepts per class
Figure 5: Satisfaction probabilities for $rep$ implemented only using CLIP model $g$
...and 9 more figures

Theorems & Definitions (13)

definition thmcounterdefinition: Satisfaction of specification by model
definition thmcounterdefinition: Linear $rep$ via VLM
definition thmcounterdefinition: Linear $rep$ via vision model, $r_{map}$, and VLM
definition thmcounterdefinition: Faithful alignment of representation spaces
theorem thmcountertheorem
proof
lemma thmcounterlemma
proof
lemma thmcounterlemma
proof
...and 3 more

Concept-based Analysis of Neural Networks via Vision-Language Models

TL;DR

Abstract

Concept-based Analysis of Neural Networks via Vision-Language Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (14)

Theorems & Definitions (13)