Abstract
“Collocate list and collocation network are two widely used representation methods of colloca-tions, but they have significant weaknesses in representing contextual information. To solve thisproblem, we propose a new representation method, namely the contextualized representation ofcollocate (CRC), which highlights the importance of the position of the collocates and pins acollocate as the interaction of two dimensions: association strength and co-occurrence position. With a full image of all the collocates surrounding the node word, CRC carries the contextualinformation and makes the representation more informative and intuitive. Through three casestudies, i.e., synonym distinction, image analysis, and efficiency in lexical use, we demonstratethe advantages of CRC in practical applications. CRC is also a new quantitative tool to measurelexical usage pattern similarities for corpus-based research. It can provide a new representationframework for language researchers and learners.”- Anthology ID:
- 2023.ccl-1.71
- Volume:
- Proceedings of the 22nd Chinese National Conference on Computational Linguistics
- Month:
- August
- Year:
- 2023
- Address:
- Harbin, China
- Editors:
- Maosong Sun, Bing Qin, Xipeng Qiu, Jing Jiang, Xianpei Han
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 836–846
- Language:
- English
- URL:
- https://aclanthology.org/2023.ccl-1.71
- DOI:
- Cite (ACL):
- Liu Daohuan and Tang Xuri. 2023. The Contextualized Representation of Collocation. In Proceedings of the 22nd Chinese National Conference on Computational Linguistics, pages 836–846, Harbin, China. Chinese Information Processing Society of China.
- Cite (Informal):
- The Contextualized Representation of Collocation (Daohuan & Xuri, CCL 2023)
- PDF:
- https://preview.aclanthology.org/naacl24-info/2023.ccl-1.71.pdf