One Swallow Does Not Make a Summer: Understanding Semantic Structures in Embedding Spaces
Yandong Sun, Qiang Huang, Ziwei Xu, Yiqun Sun, Yixuan Tang, Anthony K. H. Tung
TL;DR
This work introduces Semantic Field Subspaces (SFS) as a geometry-preserving, context-aware representation of local semantic neighborhoods within embedding spaces. The unsupervised SAFARI algorithm leverages a novel Semantic Shift metric to uncover hierarchical semantic structures directly from embeddings, without re-embedding or external ontologies. An efficient SVD-based approximation accelerates shift computation, yielding 15–30x speedups with negligible error. Across six real-world datasets spanning text and vision, SFSes improve classification, reveal nuanced semantics (e.g., political bias), and demonstrate consistent, modality-agnostic hierarchical discovery. Together, SFS and SAFARI provide a scalable, interpretable framework for analyzing and leveraging semantic structure in multimodal embeddings.
Abstract
Embedding spaces are fundamental to modern AI, translating raw data into high-dimensional vectors that encode rich semantic relationships. Yet, their internal structures remain opaque, with existing approaches often sacrificing semantic coherence for structural regularity or incurring high computational overhead to improve interpretability. To address these challenges, we introduce the Semantic Field Subspace (SFS), a geometry-preserving, context-aware representation that captures local semantic neighborhoods within the embedding space. We also propose SAFARI (SemAntic Field subspAce deteRmInation), an unsupervised, modality-agnostic algorithm that uncovers hierarchical semantic structures using a novel metric called Semantic Shift, which quantifies how semantics evolve as SFSes evolve. To ensure scalability, we develop an efficient approximation of Semantic Shift that replaces costly SVD computations, achieving a 15~30x speedup with average errors below 0.01. Extensive evaluations across six real-world text and image datasets show that SFSes outperform standard classifiers not only in classification but also in nuanced tasks such as political bias detection, while SAFARI consistently reveals interpretable and generalizable semantic hierarchies. This work presents a unified framework for structuring, analyzing, and scaling semantic understanding in embedding spaces.
