ACL Anthology
News
(current)
FAQ
(current)
Corrections
(current)
Submissions
(current)
GitHub
This page is part of a
temporary preview
of a proposed change that may be incomplete or contain mistakes. It is
not official
and will be removed when the change is merged or abandoned.
Brett H.
Meyer
2024
pdf
bib
Intermediate Layer Distillation with the Reused Teacher Classifier: A Study on the Importance of the Classifier of Attention-based Models
Hang Zhang
|
Seyyed Hasan Mozafari
|
James J. Clark
|
Brett H. Meyer
|
Warren J. Gross
Findings of the Association for Computational Linguistics: EMNLP 2024
Search
Co-authors
James J. Clark
1
Warren J. Gross
1
Seyyed Hasan Mozafari
1
Hang Zhang
1
Venues
findings
1
Fix author