A Survey of the Usages of Deep Learning in Natural Language Processing
Daniel W. Otter, Julian R. Medina, Jugal K. Kalita
TL;DR
The paper surveys how deep learning has transformed NLP, spanning architectures, core tasks, and practical applications. It traces the evolution from CNNs and RNNs to attention-based Transformers, highlighting pretraining and encoder–decoder frameworks as central. Key findings include that Transformer-based pre-trained models (e.g., BERT, GPT) achieve state-of-the-art across many tasks, but data requirements and language coverage remain important challenges. The work emphasizes evaluation, cross-lingual transfer, and the need for broader multilingual resources, while anticipating continued progress driven by scalable pretraining and multilingual benchmarks.
Abstract
Over the last several years, the field of natural language processing has been propelled forward by an explosion in the use of deep learning models. This survey provides a brief introduction to the field and a quick overview of deep learning architectures and methods. It then sifts through the plethora of recent studies and summarizes a large assortment of relevant contributions. Analyzed research areas include several core linguistic processing issues in addition to a number of applications of computational linguistics. A discussion of the current state of the art is then provided along with recommendations for future research in the field.
