Abstract
This paper presents the identification of formulaic sequences in the reference corpus of spoken Slovenian and their annotation in terms of syntactic structure, pragmatic function and lexicographic relevance. The annotation campaign, specific in terms of setting, subjectivity and the multifunctionality of items under investigation, resulted in a preliminary lexicon of formulaic sequences in spoken Slovenian with immediate potential for future explorations in formulaic language research. This is especially relevant for the notable number of identified multi-word expressions with discourse-structuring and stance-marking functions, which have often been overlooked by traditional phraseology research.- Anthology ID:
- W19-4013
- Volume:
- Proceedings of the 13th Linguistic Annotation Workshop
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Annemarie Friedrich, Deniz Zeyrek, Jet Hoek
- Venue:
- LAW
- SIG:
- SIGANN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 108–112
- Language:
- URL:
- https://aclanthology.org/W19-4013
- DOI:
- 10.18653/v1/W19-4013
- Cite (ACL):
- Kaja Dobrovoljc. 2019. Annotating formulaic sequences in spoken Slovenian: structure, function and relevance. In Proceedings of the 13th Linguistic Annotation Workshop, pages 108–112, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Annotating formulaic sequences in spoken Slovenian: structure, function and relevance (Dobrovoljc, LAW 2019)
- PDF:
- https://preview.aclanthology.org/proper-vol2-ingestion/W19-4013.pdf