UER: An Open-Source Toolkit for Pre-training Models

Zhe Zhao; Hui Chen; Jinbin Zhang; Wayne Xin Zhao; Tao Liu; Wei Lu; Xi Chen; Haotang Deng; Qi Ju; Xiaoyong Du

doi:10.18653/v1/D19-3041

UER: An Open-Source Toolkit for Pre-training Models

Zhe Zhao, Hui Chen, Jinbin Zhang, Xin Zhao, Tao Liu, Wei Lu, Xi Chen, Haotang Deng, Qi Ju, Xiaoyong Du

[How to correct problems with metadata yourself]

Abstract

Existing works, including ELMO and BERT, have revealed the importance of pre-training for NLP tasks. While there does not exist a single pre-training model that works best in all cases, it is of necessity to develop a framework that is able to deploy various pre-training models efficiently. For this purpose, we propose an assemble-on-demand pre-training toolkit, namely Universal Encoder Representations (UER). UER is loosely coupled, and encapsulated with rich modules. By assembling modules on demand, users can either reproduce a state-of-the-art pre-training model or develop a pre-training model that remains unexplored. With UER, we have built a model zoo, which contains pre-trained models based on different corpora, encoders, and targets (objectives). With proper pre-trained models, we could achieve new state-of-the-art results on a range of downstream datasets.

Anthology ID:: D19-3041
Volume:: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations
Month:: November
Year:: 2019
Address:: Hong Kong, China
Editors:: Sebastian Padó, Ruihong Huang
Venues:: EMNLP | IJCNLP
SIG:: SIGDAT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 241–246
Language:
URL:: https://aclanthology.org/D19-3041
DOI:: 10.18653/v1/D19-3041
Bibkey:
Cite (ACL):: Zhe Zhao, Hui Chen, Jinbin Zhang, Xin Zhao, Tao Liu, Wei Lu, Xi Chen, Haotang Deng, Qi Ju, and Xiaoyong Du. 2019. UER: An Open-Source Toolkit for Pre-training Models. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): System Demonstrations, pages 241–246, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):: UER: An Open-Source Toolkit for Pre-training Models (Zhao et al., EMNLP-IJCNLP 2019)
Copy Citation:
PDF:: https://preview.aclanthology.org/teach-a-man-to-fish/D19-3041.pdf
Code: dbiir/UER-py
Data: GLUE

PDF Search Code