VnCoreNLP: A Vietnamese Natural Language Processing Toolkit
Thanh Vu, Dat Quoc Nguyen, Dai Quoc Nguyen, Mark Dras, Mark Johnson
Abstract
We present an easy-to-use and fast toolkit, namely VnCoreNLP—a Java NLP annotation pipeline for Vietnamese. Our VnCoreNLP supports key natural language processing (NLP) tasks including word segmentation, part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing, and obtains state-of-the-art (SOTA) results for these tasks. We release VnCoreNLP to provide rich linguistic annotations to facilitate research work on Vietnamese NLP. Our VnCoreNLP is open-source and available at: https://github.com/vncorenlp/VnCoreNLP- Anthology ID:
 - N18-5012
 - Volume:
 - Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations
 - Month:
 - June
 - Year:
 - 2018
 - Address:
 - New Orleans, Louisiana
 - Editors:
 - Yang Liu, Tim Paek, Manasi Patwardhan
 - Venue:
 - NAACL
 - SIG:
 - Publisher:
 - Association for Computational Linguistics
 - Note:
 - Pages:
 - 56–60
 - Language:
 - URL:
 - https://aclanthology.org/N18-5012
 - DOI:
 - 10.18653/v1/N18-5012
 - Cite (ACL):
 - Thanh Vu, Dat Quoc Nguyen, Dai Quoc Nguyen, Mark Dras, and Mark Johnson. 2018. VnCoreNLP: A Vietnamese Natural Language Processing Toolkit. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pages 56–60, New Orleans, Louisiana. Association for Computational Linguistics.
 - Cite (Informal):
 - VnCoreNLP: A Vietnamese Natural Language Processing Toolkit (Vu et al., NAACL 2018)
 - PDF:
 - https://preview.aclanthology.org/ingest-acl-2023-videos/N18-5012.pdf
 - Code
 - vncorenlp/VnCoreNLP + additional community code