Kaiyue Wen
2025
Demons in the Detail: On Implementing Load Balancing Loss for Training Specialized Mixture-of-Expert Models
Zihan Qiu
|
Zeyu Huang
|
Bo Zheng
|
Kaiyue Wen
|
Zekun Wang
|
Rui Men
|
Ivan Titov
|
Dayiheng Liu
|
Jingren Zhou
|
Junyang Lin
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images
Shengguang Wu
|
Fan-Yun Sun
|
Kaiyue Wen
|
Nick Haber
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2022
Finding Skill Neurons in Pre-trained Transformer-based Language Models
Xiaozhi Wang
|
Kaiyue Wen
|
Zhengyan Zhang
|
Lei Hou
|
Zhiyuan Liu
|
Juanzi Li
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
On Transferability of Prompt Tuning for Natural Language Processing
Yusheng Su
|
Xiaozhi Wang
|
Yujia Qin
|
Chi-Min Chan
|
Yankai Lin
|
Huadong Wang
|
Kaiyue Wen
|
Zhiyuan Liu
|
Peng Li
|
Juanzi Li
|
Lei Hou
|
Maosong Sun
|
Jie Zhou
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies