MetaSynth: Multi-Agent Metadata Generation from Implicit Feedback in Black-Box Systems
Shreeranjani Srirangamsridharan, Ali Abavisani, Reza Yousefi Maragheh, Ramin Giahi, Kai Zhao, Jason Cho, Sushant Kumar
TL;DR
The paper tackles optimizing meta titles and descriptions in black-box search/ranking environments where explicit labels are unavailable. It introduces MetaSynth, a three-component framework comprising an exemplar library, a constrained generation module, and an evaluator-refinement loop that learns from implicit feedback from top-ranked results. Offline and online experiments on proprietary data and the Amazon dataset show state-of-the-art improvements in NDCG, MRR, and ranking metrics, with significant CTR and click increases in live A/B tests. The approach demonstrates a general paradigm for leveraging weak supervision to optimize content in black-box systems, with potential extensions to other domains and modalities.
Abstract
Meta titles and descriptions strongly shape engagement in search and recommendation platforms, yet optimizing them remains challenging. Search engine ranking models are black box environments, explicit labels are unavailable, and feedback such as click-through rate (CTR) arrives only post-deployment. Existing template, LLM, and retrieval-augmented approaches either lack diversity, hallucinate attributes, or ignore whether candidate phrasing has historically succeeded in ranking. This leaves a gap in directly leveraging implicit signals from observable outcomes. We introduce MetaSynth, a multi-agent retrieval-augmented generation framework that learns from implicit search feedback. MetaSynth builds an exemplar library from top-ranked results, generates candidate snippets conditioned on both product content and exemplars, and iteratively refines outputs via evaluator-generator loops that enforce relevance, promotional strength, and compliance. On both proprietary e-commerce data and the Amazon Reviews corpus, MetaSynth outperforms strong baselines across NDCG, MRR, and rank metrics. Large-scale A/B tests further demonstrate 10.26% CTR and 7.51% clicks. Beyond metadata, this work contributes a general paradigm for optimizing content in black-box systems using implicit signals.
