Fusion of Mixture of Experts and Generative Artificial Intelligence in Mobile Edge Metaverse
Guangyuan Liu, Hongyang Du, Dusit Niyato, Jiawen Kang, Zehui Xiong, Abbas Jamalipour, Shiwen Mao, Dong In Kim
TL;DR
The paper addresses the computational and coherence challenges of content creation in the mobile edge Metaverse by fusing Mixture of Experts (MoE) with Generative AI (GAI). It develops a framework where MoE selective activation governs Discriminative and Generative AI tasks, enabling scalable, edge-friendly video generation through task decomposition and cross-device collaboration. The approach integrates MMVAE and MoE-Fusion concepts to enhance multimodal content generation, validated via case studies using the VBench metric suite. Key findings show improved video quality and consistency when tasks are properly decomposed, while also highlighting training complexity and bandwidth as notable challenges for deployment at scale.
Abstract
In the digital transformation era, Metaverse offers a fusion of virtual reality (VR), augmented reality (AR), and web technologies to create immersive digital experiences. However, the evolution of the Metaverse is slowed down by the challenges of content creation, scalability, and dynamic user interaction. Our study investigates an integration of Mixture of Experts (MoE) models with Generative Artificial Intelligence (GAI) for mobile edge computing to revolutionize content creation and interaction in the Metaverse. Specifically, we harness an MoE model's ability to efficiently manage complex data and complex tasks by dynamically selecting the most relevant experts running various sub-models to enhance the capabilities of GAI. We then present a novel framework that improves video content generation quality and consistency, and demonstrate its application through case studies. Our findings underscore the efficacy of MoE and GAI integration to redefine virtual experiences by offering a scalable, efficient pathway to harvest the Metaverse's full potential.
