An automatic discourse relation alignment experiment on TED-MDB
Abstract
This paper describes an automatic discourse relation alignment experiment as an empirical justification of the planned annotation projection approach to enlarge the 3600-word multilingual corpus of TED Multilingual Discourse Bank (TED-MDB). The experiment is carried out on a single language pair (English-Turkish) included in TED-MDB. The paper first describes the creation of a large corpus of English-Turkish bi-sentences, then it presents a sense-based experiment that automatically aligns the relations in the English sentences of TED-MDB with the Turkish sentences. The results are very close to the results obtained from an earlier semi-automatic post-annotation alignment experiment validated by human annotators and are encouraging for future annotation projection tasks.- Anthology ID:
- W19-3612
- Volume:
- Proceedings of the 2019 Workshop on Widening NLP
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- WiNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 31–34
- Language:
- URL:
- https://aclanthology.org/W19-3612
- DOI:
- Cite (ACL):
- Sibel Ozer and Deniz Zeyrek. 2019. An automatic discourse relation alignment experiment on TED-MDB. In Proceedings of the 2019 Workshop on Widening NLP, pages 31–34, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- An automatic discourse relation alignment experiment on TED-MDB (Ozer & Zeyrek, WiNLP 2019)