Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology

Romy Beauté; David J. Schwartzman; Guillaume Dumas; Jennifer Crook; Fiona Macpherson; Adam B. Barrett; Anil K. Seth

Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology

Romy Beauté, David J. Schwartzman, Guillaume Dumas, Jennifer Crook, Fiona Macpherson, Adam B. Barrett, Anil K. Seth

TL;DR

The paper addresses the limitation of predefined questionnaires in capturing the full range of stroboscopically induced phenomenology by analyzing open-ended Dreamachine reports with a data-driven MOSAIC pipeline. It combines BERTopic topic modelling on sentence embeddings with Llama-3-8B-Instruct automatic labeling to identify latent experiential topics from 862 sentences across two Dreamachine variants. The HS and DL analyses reveal a spectrum from simple visual halluci nations to complex imagery and altered states, including substantial unassigned responses that underscore idiosyncratic experiences. The work demonstrates a practical, open-source workflow for analyzing subjective reports and highlights the potential to map phenomenological categories to neural data in future neurophenomenology research.

Abstract

Stroboscopic light stimulation (SLS) on closed eyes typically induces simple visual hallucinations (VHs), characterised by vivid, geometric and colourful patterns. A dataset of 862 sentences, extracted from 422 open subjective reports, was recently compiled as part of the Dreamachine programme (Collective Act, 2022), an immersive multisensory experience that combines SLS and spatial sound in a collective setting. Although open reports extend the range of reportable phenomenology, their analysis presents significant challenges, particularly in systematically identifying patterns. To address this challenge, we implemented a data-driven approach leveraging Large Language Models and Topic Modelling to uncover and interpret latent experiential topics directly from the Dreamachine's text-based reports. Our analysis confirmed the presence of simple VHs typically documented in scientific studies of SLS, while also revealing experiences of altered states of consciousness and complex hallucinations. Building on these findings, our computational approach expands the systematic study of subjective experience by enabling data-driven analyses of open-ended phenomenological reports, capturing experiences not readily identified through standard questionnaires. By revealing rich and multifaceted aspects of experiences, our study broadens our understanding of stroboscopically-induced phenomena while highlighting the potential of Natural Language Processing and Large Language Models in the emerging field of computational (neuro)phenomenology. More generally, this approach provides a practically applicable methodology for uncovering subtle hidden patterns of subjective experience across diverse research domains.

Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology

TL;DR

Abstract

Mapping of Subjective Accounts into Interpreted Clusters (MOSAIC): Topic Modelling and LLM applied to Stroboscopic Phenomenology

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)