Efficient Sparse Attention needs Adaptive Token Release
Chaoran Zhang, Lixin Zou, Dan Luo, Xiangyang Luo, Zihao Li, Min Tang, Chenliang Li
- Anthology ID:
- 2024.findings-acl.837
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2024
- Month:
- August
- Year:
- 2024
- Address:
- Bangkok, Thailand
- Editors:
- Lun-Wei Ku, Andre Martins, Vivek Srikumar
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 14081–14094
- Language:
- URL:
- https://preview.aclanthology.org/Author-page-Marten-During-lu/2024.findings-acl.837/
- DOI:
- 10.18653/v1/2024.findings-acl.837
- Cite (ACL):
- Chaoran Zhang, Lixin Zou, Dan Luo, Xiangyang Luo, Zihao Li, Min Tang, and Chenliang Li. 2024. Efficient Sparse Attention needs Adaptive Token Release. In Findings of the Association for Computational Linguistics: ACL 2024, pages 14081–14094, Bangkok, Thailand. Association for Computational Linguistics.
- Cite (Informal):
- Efficient Sparse Attention needs Adaptive Token Release (Zhang et al., Findings 2024)
- PDF:
- https://preview.aclanthology.org/Author-page-Marten-During-lu/2024.findings-acl.837.pdf