REM: A Scalable Reinforced Multi-Expert Framework for Multiplex Influence Maximization
Huyen Nguyen, Hieu Dam, Nguyen Do, Cong Tran, Cuong Pham
TL;DR
REM tackles Multiplex Influence Maximization by combining a Variational Autoencoder–based Seed2Vec with a Propagation Mixture of Experts to capture diverse diffusion dynamics across multiplex layers. It uses reinforcement-learning–inspired latent exploration to generate seed sets and a priority-replay mechanism to iteratively improve the latent model. Empirical results on five real multiplex networks under IC and LT show REM achieves higher influence spread and faster inference than state-of-the-art baselines, with near-linear scalability. This framework enables robust, scalable seed selection in complex multiplex environments where diffusion patterns are heterogeneous and unknown.
Abstract
In social online platforms, identifying influential seed users to maximize influence spread is a crucial as it can greatly diminish the cost and efforts required for information dissemination. While effective, traditional methods for Multiplex Influence Maximization (MIM) have reached their performance limits, prompting the emergence of learning-based approaches. These novel methods aim for better generalization and scalability for more sizable graphs but face significant challenges, such as (1) inability to handle unknown diffusion patterns and (2) reliance on high-quality training samples. To address these issues, we propose the Reinforced Expert Maximization framework (REM). REM leverages a Propagation Mixture of Experts technique to encode dynamic propagation of large multiplex networks effectively in order to generate enhanced influence propagation. Noticeably, REM treats a generative model as a policy to autonomously generate different seed sets and learn how to improve them from a Reinforcement Learning perspective. Extensive experiments on several real-world datasets demonstrate that REM surpasses state-of-the-art methods in terms of influence spread, scalability, and inference time in influence maximization tasks.
