- Anthology ID:
- 2023.conll-babylm.24
- Volume:
- Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Alex Warstadt, Aaron Mueller, Leshem Choshen, Ethan Wilcox, Chengxu Zhuang, Juan Ciro, Rafael Mosquera, Bhargavi Paranjabe, Adina Williams, Tal Linzen, Ryan Cotterell
- Venue:
- CoNLL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 279–289
- Language:
- URL:
- https://aclanthology.org/2023.conll-babylm.24
- DOI:
- 10.18653/v1/2023.conll-babylm.24
- Cite (ACL):
- Inar Timiryasov and Jean-Loup Tastet. 2023. Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty. In Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning, pages 279–289, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty (Timiryasov & Tastet, CoNLL 2023)
- PDF:
- https://preview.aclanthology.org/fix-volume-bibkeys/2023.conll-babylm.24.pdf