Abstract
Collocation and idiom extraction are well-known challenges with many potential applications in Natural Language Processing (NLP). Our experimental, open-source software system, called ICE, is a python package for flexibly extracting collocations and idioms, currently in English. It also has a competitive POS tagger that can be used alone or as part of collocation/idiom extraction. ICE is available free of cost for research and educational uses in two user-friendly formats. This paper gives an overview of ICE and its performance, and briefly describes the research underlying the extraction algorithms.- Anthology ID:
- E17-3027
- Volume:
- Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 108–111
- Language:
- URL:
- https://aclanthology.org/E17-3027
- DOI:
- Cite (ACL):
- Vasanthi Vuppuluri, Shahryar Baki, An Nguyen, and Rakesh Verma. 2017. ICE: Idiom and Collocation Extractor for Research and Education. In Proceedings of the Software Demonstrations of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 108–111, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- ICE: Idiom and Collocation Extractor for Research and Education (Vuppuluri et al., EACL 2017)
- PDF:
- https://preview.aclanthology.org/starsem-semeval-split/E17-3027.pdf