CoDream: Exchanging dreams instead of models for federated aggregation with heterogeneous models
Abhishek Singh, Gauri Gupta, Ritvik Kapila, Yichuan Shi, Alex Dang, Sheshank Shankar, Mohammed Ehab, Ramesh Raskar
TL;DR
CoDream introduces a novel federated framework that exchanges dreams—randomly initialized data points optimized in input space—to capture the global data distribution without sharing raw data or model parameters. It enables model-agnostic collaboration by performing knowledge extraction, collaborative dreaming, and knowledge acquisition entirely in the data space, with gradients aggregated linearly to preserve secure aggregation properties. The approach combines entropy-based local dreaming, regularized dream optimization, and a knowledge-distillation-based acquisition stage, achieving competitive accuracy on MNIST, SVHN, and CIFAR-10 under both IID and non-IID settings with heterogeneous client models. Empirical results demonstrate CoDream’s scalability, robustness to heterogeneity, and significant reduction in model-size-dependent communication, suggesting practical applicability for privacy-preserving, cross-architecture federated learning.
Abstract
Federated Learning (FL) enables collaborative optimization of machine learning models across decentralized data by aggregating model parameters. Our approach extends this concept by aggregating "knowledge" derived from models, instead of model parameters. We present a novel framework called CoDream, where clients collaboratively optimize randomly initialized data using federated optimization in the input data space, similar to how randomly initialized model parameters are optimized in FL. Our key insight is that jointly optimizing this data can effectively capture the properties of the global data distribution. Sharing knowledge in data space offers numerous benefits: (1) model-agnostic collaborative learning, i.e., different clients can have different model architectures; (2) communication that is independent of the model size, eliminating scalability concerns with model parameters; (3) compatibility with secure aggregation, thus preserving the privacy benefits of federated learning; (4) allowing of adaptive optimization of knowledge shared for personalized learning. We empirically validate CoDream on standard FL tasks, demonstrating competitive performance despite not sharing model parameters. Our code: https://mitmedialab.github.io/codream.github.io/
