A Deep Reinforcement Learning Approach for Security-Aware Service Acquisition in IoT
Marco Arazzi, Serena Nicolazzo, Antonino Nocera
TL;DR
The paper tackles the challenge of enabling end-users to control privacy and security during IoT service acquisition by introducing a deep reinforcement learning framework where a user agent learns to select service providers that satisfy SecSLA/PLA-derived security classes within operational deadlines. The approach models IoT as a graph with WSLA-defined services and a weighted security lattice, enabling a security-loss-based reward that balances operation completion with security/privacy constraints. A Deep Q-Network drives the agent’s decisions, employing Accept-Action and Decline-Action strategies, along with a contact-list mechanism to store and reuse favorable providers. Extensive experiments using NYC-based movement data demonstrate high operation completion with controlled security loss, and show that even lightweight IoT devices can perform real-time inference, supporting practical deployment in smart-city scenarios.
Abstract
The novel Internet of Things (IoT) paradigm is composed of a growing number of heterogeneous smart objects and services that are transforming architectures and applications, increasing systems' complexity, and the need for reliability and autonomy. In this context, both smart objects and services are often provided by third parties which do not give full transparency regarding the security and privacy of the features offered. Although machine-based Service Level Agreements (SLA) have been recently leveraged to establish and share policies in Cloud-based scenarios, and also in the IoT context, the issue of making end users aware of the overall system security levels and the fulfillment of their privacy requirements through the provision of the requested service remains a challenging task. To tackle this problem, we propose a complete framework that defines suitable levels of privacy and security requirements in the acquisition of services in IoT, according to the user needs. Through the use of a Reinforcement Learning based solution, a user agent, inside the environment, is trained to choose the best smart objects granting access to the target services. Moreover, the solution is designed to guarantee deadline requirements and user security and privacy needs. Finally, to evaluate the correctness and the performance of the proposed approach we illustrate an extensive experimental analysis.
