Social Learning: Towards Collaborative Learning with Large Language Models
Amirkeivan Mohtashami, Florian Hartmann, Sian Gooding, Lukas Zilka, Matt Sharifi, Blaise Aguera y Arcas
TL;DR
This work formalizes privacy-aware social learning for LLM-driven agents, enabling knowledge transfer between models without sharing private data. It proposes two mechanisms—Verbal Social Learning (instruction sharing) and Live models (synthetic example sharing)—and evaluates them across diverse tasks, using memorization-based privacy metrics to quantify data leakage. The results show that both approaches can achieve competitive performance compared with using original labels and prompts, with the Secret Sharer adaptation indicating only modest memorization. The paper also explores extensions (teaching to larger models, voting aggregators) and outlines future directions for improving teaching strategies, generalizing to other modalities, and strengthening privacy guarantees.
Abstract
We introduce the framework of "social learning" in the context of large language models (LLMs), whereby models share knowledge with each other in a privacy-aware manner using natural language. We present and evaluate two approaches for knowledge transfer between LLMs. In the first scenario, we allow the model to generate abstract prompts aiming to teach the task. In our second approach, models transfer knowledge by generating synthetic examples. We evaluate these methods across diverse datasets and quantify memorization as a proxy for privacy loss. These techniques inspired by social learning yield promising results with low memorization of the original data. In particular, we show that performance using these methods is comparable to results with the use of original labels and prompts. Our work demonstrates the viability of social learning for LLMs, establishes baseline approaches and highlights several unexplored areas for future work.
