Table of Contents
Fetching ...

ConvCounsel: A Conversational Dataset for Student Counseling

Po-Chuan Chen, Mahdin Rohmatillah, You-Teng Lin, Jen-Tzung Chien

TL;DR

A specialized mental health dataset that emphasizes the active listening strategy employed in conversation for counseling, also named as ConvCounsel, which comprises both speech and text data, which can facilitate the development of a reliable pipeline for mental health dialogue systems.

Abstract

Student mental health is a sensitive issue that necessitates special attention. A primary concern is the student-to-counselor ratio, which surpasses the recommended standard of 250:1 in most universities. This imbalance results in extended waiting periods for in-person consultations, which cause suboptimal treatment. Significant efforts have been directed toward developing mental health dialogue systems utilizing the existing open-source mental health-related datasets. However, currently available datasets either discuss general topics or various strategies that may not be viable for direct application due to numerous ethical constraints inherent in this research domain. To address this issue, this paper introduces a specialized mental health dataset that emphasizes the active listening strategy employed in conversation for counseling, also named as ConvCounsel. This dataset comprises both speech and text data, which can facilitate the development of a reliable pipeline for mental health dialogue systems. To demonstrate the utility of the proposed dataset, this paper also presents the NYCUKA, a spoken mental health dialogue system that is designed by using the ConvCounsel dataset. The results show the merit of using this dataset.

ConvCounsel: A Conversational Dataset for Student Counseling

TL;DR

A specialized mental health dataset that emphasizes the active listening strategy employed in conversation for counseling, also named as ConvCounsel, which comprises both speech and text data, which can facilitate the development of a reliable pipeline for mental health dialogue systems.

Abstract

Student mental health is a sensitive issue that necessitates special attention. A primary concern is the student-to-counselor ratio, which surpasses the recommended standard of 250:1 in most universities. This imbalance results in extended waiting periods for in-person consultations, which cause suboptimal treatment. Significant efforts have been directed toward developing mental health dialogue systems utilizing the existing open-source mental health-related datasets. However, currently available datasets either discuss general topics or various strategies that may not be viable for direct application due to numerous ethical constraints inherent in this research domain. To address this issue, this paper introduces a specialized mental health dataset that emphasizes the active listening strategy employed in conversation for counseling, also named as ConvCounsel. This dataset comprises both speech and text data, which can facilitate the development of a reliable pipeline for mental health dialogue systems. To demonstrate the utility of the proposed dataset, this paper also presents the NYCUKA, a spoken mental health dialogue system that is designed by using the ConvCounsel dataset. The results show the merit of using this dataset.

Paper Structure

This paper contains 11 sections, 4 figures, 4 tables.

Figures (4)

  • Figure 1: NYCUKA interface consisting of different components which include (A) expression switch for animated character (B) dialogue box (C) user input (either text or voice) (D) animated character
  • Figure 2: Procedure to conduct data collection for ConvCounsel dataset.
  • Figure 3: Implementation for NYCUKA dialogue system.
  • Figure 4: Procedure for comparing the quality of responses generated by GPT-4o using two different prompts.