Detection and Annotation of Events in Kannada

Suhan Prabhu, Ujwal Narayan, Alok Debnath, Sumukh S, Manish Shrivastava


Abstract
In this paper, we provide the basic guidelines towards the detection and linguistic analysis of events in Kannada. Kannada is a morphologically rich, resource poor Dravidian language spoken in southern India. As most information retrieval and extraction tasks are resource intensive, very little work has been done on Kannada NLP, with almost no efforts in discourse analysis and dataset creation for representing events or other semantic annotations in the text. In this paper, we linguistically analyze what constitutes an event in this language, the challenges faced with discourse level annotation and representation due to the rich derivational morphology of the language that allows free word order, numerous multi-word expressions, adverbial participle constructions and constraints on subject-verb relations. Therefore, this paper is one of the first attempts at a large scale discourse level annotation for Kannada, which can be used for semantic annotation and corpus development for other tasks in the language.
Anthology ID:
2020.isa-1.10
Volume:
Proceedings of the 16th Joint ACL-ISO Workshop on Interoperable Semantic Annotation
Month:
May
Year:
2020
Address:
Marseille
Editor:
Harry Bunt
Venue:
ISA
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
88–93
Language:
English
URL:
https://aclanthology.org/2020.isa-1.10
DOI:
Bibkey:
Cite (ACL):
Suhan Prabhu, Ujwal Narayan, Alok Debnath, Sumukh S, and Manish Shrivastava. 2020. Detection and Annotation of Events in Kannada. In Proceedings of the 16th Joint ACL-ISO Workshop on Interoperable Semantic Annotation, pages 88–93, Marseille. European Language Resources Association.
Cite (Informal):
Detection and Annotation of Events in Kannada (Prabhu et al., ISA 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-2024-clasp/2020.isa-1.10.pdf