DiMLex-Bangla: A Lexicon of Bangla Discourse Connectives

Debopam Das, Manfred Stede, Soumya Sankar Ghosh, Lahari Chatterjee


Abstract
We present DiMLex-Bangla, a newly developed lexicon of discourse connectives in Bangla. The lexicon, upon completion of its first version, contains 123 Bangla connective entries, which are primarily compiled from the linguistic literature and translation of English discourse connectives. The lexicon compilation is later augmented by adding more connectives from a currently developed corpus, called the Bangla RST Discourse Treebank (Das and Stede, 2018). DiMLex-Bangla provides information on syntactic categories of Bangla connectives, their discourse semantics and non-connective uses (if any). It uses the format of the German connective lexicon DiMLex (Stede and Umbach, 1998), which provides a cross-linguistically applicable XML schema. The resource is the first of its kind in Bangla, and is freely available for use in studies on discourse structure and computational applications.
Anthology ID:
2020.lrec-1.138
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
1097–1102
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.138
DOI:
Bibkey:
Cite (ACL):
Debopam Das, Manfred Stede, Soumya Sankar Ghosh, and Lahari Chatterjee. 2020. DiMLex-Bangla: A Lexicon of Bangla Discourse Connectives. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1097–1102, Marseille, France. European Language Resources Association.
Cite (Informal):
DiMLex-Bangla: A Lexicon of Bangla Discourse Connectives (Das et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2020.lrec-1.138.pdf