Beyond Binary Opinions: A Deep Reinforcement Learning-Based Approach to Uncertainty-Aware Competitive Influence Maximization
Qi Zhang, Dian Chen, Lance M. Kaplan, Audun Jøsang, Dong Hyun Jeong, Feng Chen, Jin-Hee Cho
TL;DR
This work addresses CIM under realistic opinion uncertainty by moving beyond binary beliefs and modeling uncertain opinions with Subjective Logic. It introduces DRIM, a dual-agent DRL framework that optimizes seed sets while incorporating uncertainty into opinion dynamics and exploration–exploitation decisions, including uncertainty-aware seed selection and partially observable networks. Key contributions include the Uncertainty-based Opinion Model (UOM), integration of Evidential Deep Learning for uncertainty estimation, and multiple uncertainty-aware EE strategies, all validated on three real OSN datasets with strong performance gains and efficiency improvements. The approach advances practical misinformation mitigation and seed-optimization in complex, dynamic networks, offering robust, scalable strategies for real-world CIM scenarios.
Abstract
The Competitive Influence Maximization (CIM) problem involves multiple entities competing for influence in online social networks (OSNs). While Deep Reinforcement Learning (DRL) has shown promise, existing methods often assume users' opinions are binary and ignore their behavior and prior knowledge. We propose DRIM, a multi-dimensional uncertainty-aware DRL-based CIM framework that leverages Subjective Logic (SL) to model uncertainty in user opinions, preferences, and DRL decision-making. DRIM introduces an Uncertainty-based Opinion Model (UOM) for a more realistic representation of user uncertainty and optimizes seed selection for propagating true information while countering false information. In addition, it quantifies uncertainty in balancing exploration and exploitation. Results show that UOM significantly enhances true information spread and maintains influence against advanced false information strategies. DRIM-based CIM schemes outperform state-of-the-art methods by up to 57% and 88% in influence while being up to 48% and 77% faster. Sensitivity analysis indicates that higher network observability and greater information propagation boost performance, while high network activity mitigates the effect of users' initial biases.
