Abstract
In this paper we focus on creation of interoperable annotation resources that make up a significant proportion of an on-going project on the development of conceptually annotated multilingual corpora for the domain of terrorist attacks in three languages (English, French and Russian) that can be used for comparative linguistic research, intelligent content and trend analysis, summarization, machine translation, etc. Conceptual annotation is understood as a type of task-oriented domain-specific semantic annotation. The annotation process in our project relies on ontological analysis. The paper details on the issues of the development of both static and dynamic resources such as a universal conceptual annotation scheme, multilingual domain ontology and multipurpose annotation platform with flexible settings, which can be used for the automation of the conceptual resource acquisition and of the annotation process, as well as for the documentation of the annotated corpora specificities. The resources constructed in the course of the research are also to be used for developing concept disambiguation metrics by means of qualitative and quantitative analysis of the golden portion of the conceptually annotated multilingual corpora and of the annotation platform linguistic knowledge.- Anthology ID:
- 2020.isa-1.12
- Volume:
- 16th Joint ACL - ISO Workshop on Interoperable Semantic Annotation PROCEEDINGS
- Month:
- May
- Year:
- 2020
- Address:
- Marseille
- Venue:
- ISA
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 102–109
- Language:
- English
- URL:
- https://aclanthology.org/2020.isa-1.12
- DOI:
- Cite (ACL):
- Svetlana Sheremetyeva. 2020. Towards Creating Interoperable Resources for Conceptual Annotation of Multilingual Domain Corpora. In 16th Joint ACL - ISO Workshop on Interoperable Semantic Annotation PROCEEDINGS, pages 102–109, Marseille. European Language Resources Association.
- Cite (Informal):
- Towards Creating Interoperable Resources for Conceptual Annotation of Multilingual Domain Corpora (Sheremetyeva, ISA 2020)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2020.isa-1.12.pdf