On Classification with Large Language Models in Cultural Analytics

David Bamman; Kent K. Chang; Li Lucy; Naitian Zhou

On Classification with Large Language Models in Cultural Analytics

David Bamman, Kent K. Chang, Li Lucy, Naitian Zhou

TL;DR

It is found that prompt-based LLMs are competitive with traditional supervised models for established tasks, but perform less well on de novo tasks.

Abstract

In this work, we survey the way in which classification is used as a sensemaking practice in cultural analytics, and assess where large language models can fit into this landscape. We identify ten tasks supported by publicly available datasets on which we empirically assess the performance of LLMs compared to traditional supervised methods, and explore the ways in which LLMs can be employed for sensemaking goals beyond mere accuracy. We find that prompt-based LLMs are competitive with traditional supervised models for established tasks, but perform less well on de novo tasks. In addition, LLMs can assist sensemaking by acting as an intermediary input to formal theory testing.

On Classification with Large Language Models in Cultural Analytics

TL;DR

Abstract

On Classification with Large Language Models in Cultural Analytics

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)