UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification

Alvaro Lopez Pellicer; Kittipos Giatgong; Yi Li; Neeraj Suri; Plamen Angelov

UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification

Alvaro Lopez Pellicer, Kittipos Giatgong, Yi Li, Neeraj Suri, Plamen Angelov

TL;DR

UNICAD addresses the combined challenges of adversarial attacks and unseen classes in image classification by unifying attack detection, noise reduction, and novel class identification within a prototype-based, similarity-driven architecture. It introduces a layered design with a Feature Extraction Layer, Prototype Conditional Probability Layer, Global Decision Making Layer, Denoising Layer using a novel combined loss $\mathcal{L}_{\text{comb}}$, and an Attack Decision Making Layer to dynamically handle attacks and emerging classes. Key contributions include the prototype-based similarity framework, the embedded denoising autoencoder that preserves latent and pixel fidelity, and the ability to detect and incorporate new classes without full retraining, demonstrated on CIFAR-10 with backbones like VGG-16 and DINOv2. The results show strong robustness under FGSM, PGD, and C&W attacks, competitive unseen-class detection, and improved clean accuracy retention, signaling significant practical impact for secure and adaptable vision systems in dynamic environments where distribution shifts and adversarial threats occur. The work advances a practical path toward real-world deployment of unified defenses that balance accuracy, interpretability, and open-set adaptability.

Abstract

As the use of Deep Neural Networks (DNNs) becomes pervasive, their vulnerability to adversarial attacks and limitations in handling unseen classes poses significant challenges. The state-of-the-art offers discrete solutions aimed to tackle individual issues covering specific adversarial attack scenarios, classification or evolving learning. However, real-world systems need to be able to detect and recover from a wide range of adversarial attacks without sacrificing classification accuracy and to flexibly act in {\bf unseen} scenarios. In this paper, UNICAD, is proposed as a novel framework that integrates a variety of techniques to provide an adaptive solution. For the targeted image classification, UNICAD achieves accurate image classification, detects unseen classes, and recovers from adversarial attacks using Prototype and Similarity-based DNNs with denoising autoencoders. Our experiments performed on the CIFAR-10 dataset highlight UNICAD's effectiveness in adversarial mitigation and unseen class classification, outperforming traditional models.

UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification

TL;DR

, and an Attack Decision Making Layer to dynamically handle attacks and emerging classes. Key contributions include the prototype-based similarity framework, the embedded denoising autoencoder that preserves latent and pixel fidelity, and the ability to detect and incorporate new classes without full retraining, demonstrated on CIFAR-10 with backbones like VGG-16 and DINOv2. The results show strong robustness under FGSM, PGD, and C&W attacks, competitive unseen-class detection, and improved clean accuracy retention, signaling significant practical impact for secure and adaptable vision systems in dynamic environments where distribution shifts and adversarial threats occur. The work advances a practical path toward real-world deployment of unified defenses that balance accuracy, interpretability, and open-set adaptability.

Abstract

Paper Structure (16 sections, 13 equations, 3 figures, 3 tables)

This paper contains 16 sections, 13 equations, 3 figures, 3 tables.

Introduction
Related Work
Adversarial Attacks in Image Domain
Defences Against Adversarial Attacks
Novel Class Identification Techniques
Proposed UNICAD Architecture
Feature Extraction Layer
Prototype Conditional Probability Layer
Global Decision Making Layer
Denoising layer
Attack Decision Making Layer
Results and discussion
Dataset
Experimental Configuration
Results
...and 1 more sections

Figures (3)

Figure 1: FGSM attack visualization on a CIFAR-10 image, demonstrating the perturbation effects.
Figure 2: Schematic of the UNICAD framework, showing its layered structure and the processing pathways for (a) trained class images, (b) adversarially attacked images, and (c) unseen class images.
Figure 3: Architecture of the proposed denoising autoencoder highlighting two scenarios: (a) processing of clean images and (b) removal of adversarial noise from attacked images.

UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification

TL;DR

Abstract

UNICAD: A Unified Approach for Attack Detection, Noise Reduction and Novel Class Identification

Authors

TL;DR

Abstract

Table of Contents

Figures (3)