Table of Contents
Fetching ...

When Large Language Models Meet Vector Databases: A Survey

Zhi Jing, Yongye Su, Yikun Han, Bo Yuan, Haiyun Xu, Chunjiang Liu, Kehai Chen, Min Zhang

TL;DR

This survey analyzes how Vector Databases (VecDBs) can address core limitations of Large Language Models (LLMs), including hallucinations, outdated knowledge, high costs, and memory constraints. It surveys VecDB fundamentals, retrieval-augmented generation (RAG), semantic caching, and memory-enabled architectures, highlighting multimodal extensions and retrieval optimizations. The paper discusses practical challenges such as hybrid search needs, multi-modality, data preprocessing, multi-tenancy, and knowledge conflicts, offering a roadmap for future work and enabling more capable, scalable AI systems. Overall, it articulates a coherent framework for integrating VecDBs with LLMs to enhance data handling, knowledge extraction, and real-time adaptability in production settings.

Abstract

This survey explores the synergistic potential of Large Language Models (LLMs) and Vector Databases (VecDBs), a burgeoning but rapidly evolving research area. With the proliferation of LLMs comes a host of challenges, including hallucinations, outdated knowledge, prohibitive commercial application costs, and memory issues. VecDBs emerge as a compelling solution to these issues by offering an efficient means to store, retrieve, and manage the high-dimensional vector representations intrinsic to LLM operations. Through this nuanced review, we delineate the foundational principles of LLMs and VecDBs and critically analyze their integration's impact on enhancing LLM functionalities. This discourse extends into a discussion on the speculative future developments in this domain, aiming to catalyze further research into optimizing the confluence of LLMs and VecDBs for advanced data handling and knowledge extraction capabilities.

When Large Language Models Meet Vector Databases: A Survey

TL;DR

This survey analyzes how Vector Databases (VecDBs) can address core limitations of Large Language Models (LLMs), including hallucinations, outdated knowledge, high costs, and memory constraints. It surveys VecDB fundamentals, retrieval-augmented generation (RAG), semantic caching, and memory-enabled architectures, highlighting multimodal extensions and retrieval optimizations. The paper discusses practical challenges such as hybrid search needs, multi-modality, data preprocessing, multi-tenancy, and knowledge conflicts, offering a roadmap for future work and enabling more capable, scalable AI systems. Overall, it articulates a coherent framework for integrating VecDBs with LLMs to enhance data handling, knowledge extraction, and real-time adaptability in production settings.

Abstract

This survey explores the synergistic potential of Large Language Models (LLMs) and Vector Databases (VecDBs), a burgeoning but rapidly evolving research area. With the proliferation of LLMs comes a host of challenges, including hallucinations, outdated knowledge, prohibitive commercial application costs, and memory issues. VecDBs emerge as a compelling solution to these issues by offering an efficient means to store, retrieve, and manage the high-dimensional vector representations intrinsic to LLM operations. Through this nuanced review, we delineate the foundational principles of LLMs and VecDBs and critically analyze their integration's impact on enhancing LLM functionalities. This discourse extends into a discussion on the speculative future developments in this domain, aiming to catalyze further research into optimizing the confluence of LLMs and VecDBs for advanced data handling and knowledge extraction capabilities.
Paper Structure (30 sections, 2 figures, 1 table)