Table of Contents
Fetching ...

Advances in Artificial Intelligence: A Review for the Creative Industries

Nantheera Anantrasirichai, Fan Zhang, David Bull

TL;DR

The paper surveys AI advances since 2022, emphasizing generative AI, LLMs, diffusion models, and INRs, and analyzes their integrated impact across the creative production pipeline. It highlights a shift from AI as a support tool to a core creative technology through unified frameworks, multimodal capabilities, and real-time 3D/immersive content generation. The authors discuss human-AI collaboration, limitations, and pressing challenges such as copyright, bias, compute demands, and regulatory needs. The study catalogs developments across content creation, information analysis, post-production, compression, and quality assessment, offering a cross-domain perspective on capabilities, constraints, and trajectories for researchers and practitioners.

Abstract

Artificial intelligence (AI) has undergone transformative advances since 2022, particularly through generative AI, large language models (LLMs), and diffusion models, fundamentally reshaping the creative industries. However, existing reviews have not comprehensively addressed these recent breakthroughs and their integrated impact across the creative production pipeline. This paper addresses this gap by providing a systematic review of AI technologies that have emerged or matured since our 2022 review, examining their applications across content creation, information analysis, post-production enhancement, compression, and quality assessment. We document how transformers, LLMs, diffusion models, and implicit neural representations have established new capabilities in text-to-image/video generation, real-time 3D reconstruction, and unified multi-task frameworks-shifting AI from support tool to core creative technology. Beyond technological advances, we analyze the trend toward unified AI frameworks that integrate multiple creative tasks, replacing task-specific solutions. We critically examine the evolving role of human-AI collaboration, where human oversight remains essential for creative direction and mitigating AI hallucinations. Finally, we identify emerging challenges including copyright concerns, bias mitigation, computational demands, and the need for robust regulatory frameworks. This review provides researchers and practitioners with a comprehensive understanding of current AI capabilities, limitations, and future trajectories in creative applications.

Advances in Artificial Intelligence: A Review for the Creative Industries

TL;DR

The paper surveys AI advances since 2022, emphasizing generative AI, LLMs, diffusion models, and INRs, and analyzes their integrated impact across the creative production pipeline. It highlights a shift from AI as a support tool to a core creative technology through unified frameworks, multimodal capabilities, and real-time 3D/immersive content generation. The authors discuss human-AI collaboration, limitations, and pressing challenges such as copyright, bias, compute demands, and regulatory needs. The study catalogs developments across content creation, information analysis, post-production, compression, and quality assessment, offering a cross-domain perspective on capabilities, constraints, and trajectories for researchers and practitioners.

Abstract

Artificial intelligence (AI) has undergone transformative advances since 2022, particularly through generative AI, large language models (LLMs), and diffusion models, fundamentally reshaping the creative industries. However, existing reviews have not comprehensively addressed these recent breakthroughs and their integrated impact across the creative production pipeline. This paper addresses this gap by providing a systematic review of AI technologies that have emerged or matured since our 2022 review, examining their applications across content creation, information analysis, post-production enhancement, compression, and quality assessment. We document how transformers, LLMs, diffusion models, and implicit neural representations have established new capabilities in text-to-image/video generation, real-time 3D reconstruction, and unified multi-task frameworks-shifting AI from support tool to core creative technology. Beyond technological advances, we analyze the trend toward unified AI frameworks that integrate multiple creative tasks, replacing task-specific solutions. We critically examine the evolving role of human-AI collaboration, where human oversight remains essential for creative direction and mitigating AI hallucinations. Finally, we identify emerging challenges including copyright concerns, bias mitigation, computational demands, and the need for robust regulatory frameworks. This review provides researchers and practitioners with a comprehensive understanding of current AI capabilities, limitations, and future trajectories in creative applications.
Paper Structure (47 sections, 3 equations, 7 figures, 2 tables)

This paper contains 47 sections, 3 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: Generative AI. (a) Transformer architecture Vaswani:attention:2017. (b) The top row represents the diffusion process and the bottom row represents the generation process of the new image Yang:diffusion:2023. (c) Latent Diffusion Models (LDM) Rombach:LDM:2022.
  • Figure 2: (a) Timeline of large language models. (b) Performance comparison evaluated by FLASK Ye:FLASK:2024.
  • Figure 3: Text-to-image generation (generated on 27 November 2024). (a) text-to-image generation by Ideogram v1, DALL·E 3, Photoshop 2025, and sdxy-turbo by Nvidia. (b) The top-row images were generated by DALL·E in ChatGPT 4. The bottom-row images are generated by LLM-grounded Diffusion Lian:LLMG:2024.
  • Figure 4: (Left) Video-to-3D post animation by DeepMotion. (Right) Image and audio to video by VASA-1 xu:VASA-1:2024
  • Figure 5: (Left) Examples of SR ($\times$4) using generative model. (Right) Real-time portrait editing with FacePoke.
  • ...and 2 more figures