Labor Space: A Unifying Representation of the Labor Market via Large Language Models
Seongwoon Kim, Yong-Yeol Ahn, Jaehyuk Park
TL;DR
Labor Space provides a unifying, multi-type embedding of labor-market entities by fine-tuning a contextual language model on NAICS, O*NET, ESCO, and Crunchbase descriptions to produce a shared vector space. It enables cross-type proximity, axes-based positioning, and vector arithmetic to model economic shocks and technology exposure, offering a tool for policymakers and business leaders to reason about ripple effects within the labor ecosystem. The approach combines contextual representations with relation-aware training (triplet, cosine, and multiple-negatives ranking losses) to connect industries, occupations, skills, and firms, validated through axis projections and AI-exposure correlations. This framework has the potential to inform skill development, industry strategy, and policy interventions by providing a coherent, scalable view of the labor market dynamics.
Abstract
The labor market is a complex ecosystem comprising diverse, interconnected entities, such as industries, occupations, skills, and firms. Due to the lack of a systematic method to map these heterogeneous entities together, each entity has been analyzed in isolation or only through pairwise relationships, inhibiting comprehensive understanding of the whole ecosystem. Here, we introduce $\textit{Labor Space}$, a vector-space embedding of heterogeneous labor market entities, derived through applying a large language model with fine-tuning. Labor Space exposes the complex relational fabric of various labor market constituents, facilitating coherent integrative analysis of industries, occupations, skills, and firms, while retaining type-specific clustering. We demonstrate its unprecedented analytical capacities, including positioning heterogeneous entities on an economic axes, such as `Manufacturing--Healthcare'. Furthermore, by allowing vector arithmetic of these entities, Labor Space enables the exploration of complex inter-unit relations, and subsequently the estimation of the ramifications of economic shocks on individual units and their ripple effect across the labor market. We posit that Labor Space provides policymakers and business leaders with a comprehensive unifying framework for labor market analysis and simulation, fostering more nuanced and effective strategic decision-making.
