SpeechReporting Corpus: annotated corpora of West African traditional narratives
Ekaterina Aplonova, Izabela Jordanoska, Timofey Arkhangelskiy, Tatiana Nikitina
Abstract
This paper describes the SpeechReporting database, an online collection of corpora annotated for a range of discourse phenomena. The corpora contain folktales from 7 lesser-studied West African languages. Apart from its value for theoretical linguistics, especially for the study of reported speech, the database is an important resource for the preservation of intangible cultural heritage of minority languages and the development and testing of cross-linguistically applicable computational tools.- Anthology ID:
- 2023.rail-1.4
- Volume:
- Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023)
- Month:
- May
- Year:
- 2023
- Address:
- Dubrovnik, Croatia
- Editors:
- Rooweither Mabuya, Don Mthobela, Mmasibidi Setaka, Menno Van Zaanen
- Venue:
- RAIL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 26–31
- Language:
- URL:
- https://aclanthology.org/2023.rail-1.4
- DOI:
- 10.18653/v1/2023.rail-1.4
- Cite (ACL):
- Ekaterina Aplonova, Izabela Jordanoska, Timofey Arkhangelskiy, and Tatiana Nikitina. 2023. SpeechReporting Corpus: annotated corpora of West African traditional narratives. In Proceedings of the Fourth workshop on Resources for African Indigenous Languages (RAIL 2023), pages 26–31, Dubrovnik, Croatia. Association for Computational Linguistics.
- Cite (Informal):
- SpeechReporting Corpus: annotated corpora of West African traditional narratives (Aplonova et al., RAIL 2023)
- PDF:
- https://preview.aclanthology.org/naacl24-info/2023.rail-1.4.pdf