Automatic Identification of Explicit Connectives in Malayalam

Kumari Sheeja S, Sobha Lalitha Devi


Abstract
This work presents an automatic identification of explicit connectives and its arguments using supervised method, Conditional Random Fields (CRFs). In this work, we focus on the identification of connectives and their arguments in the corpus. We consider explicit connectives and its arguments for the present study. The corpus we have considered has 4,000 sentences from Malayalam documents and manually annotated the corpus for POS, chunk, clause, discourse connectives and its arguments. The corpus thus annotated is used for building the base engine. The percentage of the performance of the system is evaluated based on the precision, recall and F-score and obtained encouraging results. We have analysed the errors generated by the system and used the features obtained from the anlaysis to improve the performance of the system
Anthology ID:
2022.wildre-1.13
Volume:
Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Girish Nath Jha, Sobha L., Kalika Bali, Atul Kr. Ojha
Venue:
WILDRE
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
74–79
Language:
URL:
https://aclanthology.org/2022.wildre-1.13
DOI:
Bibkey:
Cite (ACL):
Kumari Sheeja S and Sobha Lalitha Devi. 2022. Automatic Identification of Explicit Connectives in Malayalam. In Proceedings of the WILDRE-6 Workshop within the 13th Language Resources and Evaluation Conference, pages 74–79, Marseille, France. European Language Resources Association.
Cite (Informal):
Automatic Identification of Explicit Connectives in Malayalam (Sheeja S & Lalitha Devi, WILDRE 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2022.wildre-1.13.pdf