Ruihao Gong
2025
Pre3: Enabling Deterministic Pushdown Automata for Faster Structured LLM Generation
Junyi Chen
|
Shihao Bai
|
Zaijun Wang
|
Siyu Wu
|
Chuheng Du
|
Hailong Yang
|
Ruihao Gong
|
Shengzhong Liu
|
Fan Wu
|
Guihai Chen
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2024
LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit
Ruihao Gong
|
Yang Yong
|
Shiqiao Gu
|
Yushi Huang
|
Chengtao Lv
|
Yunchen Zhang
|
Dacheng Tao
|
Xianglong Liu
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track
2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and effective shifting and scaling
Xiuying Wei
|
Yunchen Zhang
|
Yuhang Li
|
Xiangguo Zhang
|
Ruihao Gong
|
Jinyang Guo
|
Xianglong Liu
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Adaptive Contrastive Knowledge Distillation for BERT Compression
Jinyang Guo
|
Jiaheng Liu
|
Zining Wang
|
Yuqing Ma
|
Ruihao Gong
|
Ke Xu
|
Xianglong Liu
Findings of the Association for Computational Linguistics: ACL 2023
Co-authors
- Xianglong Liu 3
- Jinyang Guo 2
- Yunchen Zhang 2
- Shihao Bai 1
- Junyi Chen 1
- show all...