Event Extraction from Unstructured Amharic Text

Ephrem Tadesse, Rosa Tsegaye, Kuulaa Qaqqabaa


Abstract
In information extraction, event extraction is one of the types that extract the specific knowledge of certain incidents from texts. Event extraction has been done on different languages text but not on one of the Semitic language, Amharic. In this study, we present a system that extracts an event from unstructured Amharic text. The system has designed by the integration of supervised machine learning and rule-based approaches. We call this system a hybrid system. The system uses the supervised machine learning to detect events from the text and the handcrafted and the rule-based rules to extract the event from the text. For the event extraction, we have been using event arguments. Event arguments identify event triggering words or phrases that clearly express the occurrence of the event. The event argument attributes can be verbs, nouns, sometimes adjectives (such as ̃rg/wedding) and time as well. The hybrid system has compared with the standalone rule-based method that is well known for event extraction. The study has shown that the hybrid system has outperformed the standalone rule-based method.
Anthology ID:
2020.lrec-1.258
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2103–2109
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.258
DOI:
Bibkey:
Cite (ACL):
Ephrem Tadesse, Rosa Tsegaye, and Kuulaa Qaqqabaa. 2020. Event Extraction from Unstructured Amharic Text. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 2103–2109, Marseille, France. European Language Resources Association.
Cite (Informal):
Event Extraction from Unstructured Amharic Text (Tadesse et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2020.lrec-1.258.pdf