Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

Zeyneb N. Kaya; Souvick Ghosh

Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

Zeyneb N. Kaya, Souvick Ghosh

TL;DR

This paper presents a systematic review of large language model (LLM) research (2016–2023) to map aims, methodologies, limitations, and societal impacts. By aggregating 61 influential studies across 16 venues, it identifies three main RQ-driven axes: aims (responsible development, performance improvements, investigative understanding), methodologies (datasets/benchmarks, input/output design, training, and understanding), and limitations (performance gaps, study constraints, and societal concerns). The work highlights ethical considerations, data and benchmark development, prompting and tool-use strategies, and the balance between openness and safety in model release. It offers a structured synthesis of current progress and prescribes directions for responsible, scalable, and transparent LLM research and deployment.

Abstract

There have been rapid advancements in the capabilities of large language models (LLMs) in recent years, greatly revolutionizing the field of natural language processing (NLP) and artificial intelligence (AI) to understand and interact with human language. Therefore, in this work, we conduct a systematic investigation of the literature to identify the prominent themes and directions of LLM developments, impacts, and limitations. Our findings illustrate the aims, methodologies, limitations, and future directions of LLM research. It includes responsible development considerations, algorithmic improvements, ethical challenges, and societal implications of LLM development. Overall, this paper provides a rigorous and comprehensive overview of current research in LLM and identifies potential directions for future development. The article highlights the application areas that could have a positive impact on society along with the ethical considerations.

Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

TL;DR

Abstract

Paper Structure (26 sections, 5 figures)

This paper contains 26 sections, 5 figures.

Introduction
Systematic Reviews
Systematic Review Methodology
Selection of Sources (Inclusion/Exclusion Criteria)
Thematic Analysis Procedure / Coding Scheme
Characteristics of Publications
Aims & Objectives (RQ1)
Responsible Development Considerations
Improving LLM Performance
Investigative Studies
Methodologies & Capabilities (RQ2)
Dataset and Benchmark Development
Model Input/Output
Model Training
LLM Understanding
...and 11 more sections

Figures (5)

Figure 1: Steps of Performing a Systematic Review.
Figure 2: Frequencies of Themes and Sub Themes.
Figure 3: Publication Statistics.
Figure 4: Avg. Number of Authors Per Paper by Year.
Figure 5: WordCloud of Author Affiliations.

Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

TL;DR

Abstract

Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions

TL;DR

Abstract

Table of Contents

Figures (5)