Hanna Kędzierska


2022

pdf
DiaBiz.Kom - towards a Polish Dialogue Act Corpus Based on ISO 24617-2 Standard
Marcin Oleksy | Jan Wieczorek | Dorota Drużyłowska | Julia Klyus | Aleksandra Domogała | Krzysztof Hwaszcz | Hanna Kędzierska | Daria Mikoś | Anita Wróż
Proceedings of the 29th International Conference on Computational Linguistics

This article presents the specification and evaluation of DiaBiz.Kom – the corpus of dialogue texts in Polish. The corpus contains transcriptions of telephone conversations conducted according to a prepared scenario. The transcripts of conversations have been manually annotated with a layer of information concerning communicative functions. DiaBiz.Kom is the first corpus of this type prepared for the Polish language and will be used to develop a system of dialog analysis and modules for creating advanced chatbots.