EAGER: Edge-Aligned LLM Defense for Robust, Efficient, and Accurate Cybersecurity Question Answering
Onat Gungor, Roshan Sood, Jiasheng Zhou, Tajana Rosing
TL;DR
EAGER addresses the need for robust, accurate cybersecurity QA on edge devices by co-designing quantization-aware fine-tuning with domain-specific preference alignment. It integrates 4-bit quantized base weights (via NF4) with trainable low-rank adapters (QLoRA) and Direct Preference Optimization (DPO) trained on a self-generated cybersecurity preference dataset, enabling efficient edge deployment while preserving reasoning and safety. The framework achieves up to 7.3× reductions in attack success rate and up to 55% improvements in QA accuracy, with the lowest latency observed on Jetson Orin among evaluated defenses, demonstrating practical viability for SOC-like workflows at the edge. These results indicate that joint optimization of efficiency, robustness, and task accuracy through co-designed adaptation and domain-specific preferences can substantially improve lightweight LLM-based cybersecurity QA in resource-constrained environments.
Abstract
Large Language Models (LLMs) are highly effective for cybersecurity question answering (QA) but are difficult to deploy on edge devices due to their size. Quantization reduces memory and compute requirements but often degrades accuracy and increases vulnerability to adversarial attacks. We present EAGER, an edge-aligned defense framework that integrates parameter-efficient quantization with domain-specific preference alignment to jointly optimize efficiency, robustness, and accuracy. Unlike prior methods that address these aspects separately, EAGER leverages Quantized Low-Rank Adaptation (QLoRA) for low-cost fine-tuning and Direct Preference Optimization (DPO) on a self-constructed cybersecurity preference dataset, eliminating the need for human labels. Experiments show that EAGER reduces adversarial attack success rates by up to 7.3x and improves QA accuracy by up to 55% over state-of-the-art defenses, while achieving the lowest response latency on a Jetson Orin, demonstrating its practical edge deployment.
