Xiaozhe Ren
2025
DAPE V2: Process Attention Score as Feature Map for Length Extrapolation
Chuanyang Zheng
|
Yihang Gao
|
Han Shi
|
Jing Xiong
|
Jiankai Sun
|
Jingyao Li
|
Minbin Huang
|
Xiaozhe Ren
|
Michael Ng
|
Xin Jiang
|
Zhenguo Li
|
Yu Li
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Self-Adjust Softmax
Chuanyang Zheng
|
Yihang Gao
|
Guoxuan Chen
|
Han Shi
|
Jing Xiong
|
Xiaozhe Ren
|
Chao Huang
|
Zhenguo Li
|
Yu Li
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
2023
CAME: Confidence-guided Adaptive Memory Efficient Optimization
Yang Luo
|
Xiaozhe Ren
|
Zangwei Zheng
|
Zhuo Jiang
|
Xin Jiang
|
Yang You
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2021
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation
Chenhe Dong
|
Guangrun Wang
|
Hang Xu
|
Jiefeng Peng
|
Xiaozhe Ren
|
Xiaodan Liang
Findings of the Association for Computational Linguistics: EMNLP 2021