A Corpus of Simulated Counselling Sessions with Dialog Act Annotation

John Lee, Haley Fong, Lai Shuen Judy Wong, Chun Chung Mak, Chi Hin Yip, Ching Wah Larry Ng


Abstract
We present a corpus of simulated counselling sessions consisting of speech- and text-based dialogs in Cantonese. Consisting of 152K Chinese characters, the corpus labels the dialog act of both client and counsellor utterances, segments each dialog into stages, and identifies the forward and backward links in the dialog. We analyze the distribution of client and counsellor communicative intentions in the various stages, and discuss significant patterns of the dialog flow.
Anthology ID:
2022.lrec-1.615
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
5723–5730
Language:
URL:
https://aclanthology.org/2022.lrec-1.615
DOI:
Bibkey:
Cite (ACL):
John Lee, Haley Fong, Lai Shuen Judy Wong, Chun Chung Mak, Chi Hin Yip, and Ching Wah Larry Ng. 2022. A Corpus of Simulated Counselling Sessions with Dialog Act Annotation. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 5723–5730, Marseille, France. European Language Resources Association.
Cite (Informal):
A Corpus of Simulated Counselling Sessions with Dialog Act Annotation (Lee et al., LREC 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/improve-issue-templates/2022.lrec-1.615.pdf