Multi-Novelty: Improve the Diversity and Novelty of Contents Generated by Large Language Models via inference-time Multi-Views Brainstorming
Arash Lagzian, Srinivas Anumasa, Dianbo Liu
TL;DR
This work tackles the challenge that large language models often produce less diverse and novel content due to training-data limitations. It introduces inference-time multi-view brainstorming, an architecture-agnostic approach that augments prompts with diverse textual and visual views to boost diversity and novelty in generated outputs. A comprehensive evaluation framework assesses diversity, novelty, and correctness across multiple models on a large-scale 909k-response dataset, revealing significant gains in diversity and novelty with some trade-offs in correctness. The method enables richer, more exploratory AI applications without requiring architectural changes, with potential impact on creative AI agents and AI-driven scientific exploration.
Abstract
Large Language Models (LLMs) demonstrate remarkable proficiency in generating accurate and fluent text. However, they often struggle with diversity and novelty, leading to repetitive or overly deterministic responses. These limitations stem from constraints in training data, including gaps in specific knowledge domains, outdated information, and an over-reliance on textual sources. Such shortcomings reduce their effectiveness in tasks requiring creativity, multi-perspective reasoning, and exploratory thinking, such as LLM based AI scientist agents and creative artist agents . To address this challenge, we introduce inference-time multi-view brainstorming method, a novel approach that enriches input prompts with diverse perspectives derived from both textual and visual sources, which we refere to as "Multi-Novelty". By incorporating additional contextual information as diverse starting point for chain of thoughts, this method enhances the variety and creativity of generated outputs. Importantly, our approach is model-agnostic, requiring no architectural modifications and being compatible with both open-source and proprietary LLMs.
