Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

Zhen Yang; Ming Ding; Tinglin Huang; Yukuo Cen; Junshuai Song; Bin Xu; Yuxiao Dong; Jie Tang

Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

Zhen Yang, Ming Ding, Tinglin Huang, Yukuo Cen, Junshuai Song, Bin Xu, Yuxiao Dong, Jie Tang

TL;DR

This survey synthesizes negative sampling (NS) as a unifying technique across domains, formalizing a general NS framework and tracing its five evolutionary lines. It categorizes negatives by how candidates are constructed (global, local, mini-batch, hop, memory-based) and by sampling strategy (static, hard, GAN-based, auxiliary-based, in-batch), detailing their pros, cons, and domain-specific adaptations. The paper surveys NS applications in recommender systems, graph representation learning, knowledge graphs, NLP, and computer vision, highlighting practical gains, trade-offs, and robust design principles. It also discusses open problems—such as the trade-off between negative quantity and quality, false negatives, and the potential of non-sampling or negative-free approaches—and outlines future directions for adaptive, robust, and scalable NS methods.

Abstract

Negative sampling has swiftly risen to prominence as a focal point of research, with wide-ranging applications spanning machine learning, computer vision, natural language processing, data mining, and recommender systems. This growing interest raises several critical questions: Does negative sampling really matter? Is there a general framework that can incorporate all existing negative sampling methods? In what fields is it applied? Addressing these questions, we propose a general framework that leverages negative sampling. Delving into the history of negative sampling, we trace the development of negative sampling through five evolutionary paths. We dissect and categorize the strategies used to select negative sample candidates, detailing global, local, mini-batch, hop, and memory-based approaches. Our review categorizes current negative sampling methods into five types: static, hard, GAN-based, Auxiliary-based, and In-batch methods, providing a clear structure for understanding negative sampling. Beyond detailed categorization, we highlight the application of negative sampling in various areas, offering insights into its practical benefits. Finally, we briefly discuss open problems and future directions for negative sampling.

Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

TL;DR

Abstract

Paper Structure (21 sections, 4 equations, 11 figures, 2 tables)

This paper contains 21 sections, 4 equations, 11 figures, 2 tables.

Introduction
Negative Sampling
Formalization and Framework
Negative Sampling History
The Importance of Negative Sampling
Algorithms
Negative Sample Candidates Construction
Negative Sampling Algorithms
Static Negative Sampling
Hard Negative Sampling
GAN-based Negative Sampling
Auxiliary-based Negative Sampling.
In-batch Negative Sampling
Applications
Negative Sampling in RS
...and 6 more sections

Figures (11)

Figure 1: Performance and Convergence speed comparison in the RS domain with the Non-NS and other methods using negative sampling on the Yelp2018 datasets with LightGCN as an encoder. PNS($\alpha$) denotes popularity-based negative sampling with various power parameters $\alpha$. DNS($\beta$) represents dynamic negative sampling with a different number of negative sample candidates $\beta$. The details of the negative sampling methods can be found in Section \ref{['sec:trace_back']}.
Figure 2: An illustration of a general framework that uses negative sampling. Positive and negative pairs are sampled implicitly or explicitly by positive and negative samplers respectively, both of them composing the training data. An encoder is applied for latent representation learning in various domains. In contrastive learning, positive pairs (i.e., Pos-view Pairs) are derived from data augmentations of the same instance or different perspectives of the same entity, while in other domains, such as metric learning, positive pairs (Pos-coexist Pairs) are the other instances in the dataset.
Figure 3: The five development lines of negative sampling. Each development line addresses different challenges and has been adopted and adapted to suit the specific needs of individual domains, such as RS, NLP, GRL, KGE, and CV.
Figure 4: Performance comparison with different choices of $\beta$ on GRL and RS domains, respectively.
Figure 5: (a) Performance comparison between Static NS and Hard NS; (b) Impact of the number of negative candidates in DNS.
...and 6 more figures

Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

TL;DR

Abstract

Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

Authors

TL;DR

Abstract

Table of Contents

Figures (11)