Table of Contents
Fetching ...

Eliciting Topic Hierarchies from Large Language Models

Grace Li, Tao Long, Lydia B. Chilton

TL;DR

This work investigates using large language models to aid topic scoping by incrementally generating topic hierarchies up to five levels of specificity. It introduces a 5-Level Topic Classification system and compares three prompting strategies, finding that including the full path of parent topics in prompts markedly improves subtopic quality, with the Full Path + Current Topic approach achieving about 77% accuracy on a Wikipedia CS test suite. Human annotators reported substantial agreement (Cohen-Kappa 0.61), and the analysis highlights that most errors are Too General or Too Specific, especially at the deepest level. The results suggest LLMs can support education, content creation, and product management by enabling dynamic, audience-tailored topic exploration, while future work should focus on interactive interfaces and cross-domain generalization.

Abstract

Current research has explored how Generative AI can support the brainstorming process for content creators, but a gap remains in exploring support-tools for the pre-writing process. Specifically, our research is focused on supporting users in finding topics at the right level of specificity for their audience. This process is called topic scoping. Topic scoping is a cognitively demanding task, requiring users to actively recall subtopics in a given domain. This manual approach also reduces the diversity of subtopics that a user is able to explore. We propose using Large Language Models (LLMs) to support the process of topic scoping by iteratively generating subtopics at increasing levels of specificity: dynamically creating topic hierarchies. We tested three different prompting strategies and found that increasing the amount of context included in the prompt improves subtopic generation by 20 percentage points. Finally, we discuss applications of this research in education, content creation, and product management.

Eliciting Topic Hierarchies from Large Language Models

TL;DR

This work investigates using large language models to aid topic scoping by incrementally generating topic hierarchies up to five levels of specificity. It introduces a 5-Level Topic Classification system and compares three prompting strategies, finding that including the full path of parent topics in prompts markedly improves subtopic quality, with the Full Path + Current Topic approach achieving about 77% accuracy on a Wikipedia CS test suite. Human annotators reported substantial agreement (Cohen-Kappa 0.61), and the analysis highlights that most errors are Too General or Too Specific, especially at the deepest level. The results suggest LLMs can support education, content creation, and product management by enabling dynamic, audience-tailored topic exploration, while future work should focus on interactive interfaces and cross-domain generalization.

Abstract

Current research has explored how Generative AI can support the brainstorming process for content creators, but a gap remains in exploring support-tools for the pre-writing process. Specifically, our research is focused on supporting users in finding topics at the right level of specificity for their audience. This process is called topic scoping. Topic scoping is a cognitively demanding task, requiring users to actively recall subtopics in a given domain. This manual approach also reduces the diversity of subtopics that a user is able to explore. We propose using Large Language Models (LLMs) to support the process of topic scoping by iteratively generating subtopics at increasing levels of specificity: dynamically creating topic hierarchies. We tested three different prompting strategies and found that increasing the amount of context included in the prompt improves subtopic generation by 20 percentage points. Finally, we discuss applications of this research in education, content creation, and product management.
Paper Structure (16 sections, 2 figures, 4 tables)

This paper contains 16 sections, 2 figures, 4 tables.

Figures (2)

  • Figure 1: Topic hierarchy from Wikipedia's "Category: Computer science" with certain missing subtopics being filled in from college level curriculum. This Topic hierarchy is used as a test suite to evaluate the three different prompting strategies to generate subtopics in Level 2, Level 3, Level 4, and Level 5.
  • Figure 2: Bar chart demonstrating the percentage of properly scoped subtopics across all levels for each prompting strategy.