Samuel Albanie
2025
GAMEBoT: Transparent Assessment of LLM Reasoning in Games
Wenye Lin
|
Jonathan Roberts
|
Yunhan Yang
|
Samuel Albanie
|
Zongqing Lu
|
Kai Han
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Adhiraj Ghosh
|
Sebastian Dziadzio
|
Ameya Prabhu
|
Vishaal Udandarao
|
Samuel Albanie
|
Matthias Bethge
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2024
HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
Tim Franzmeyer
|
Aleksandar Shtedritski
|
Samuel Albanie
|
Philip Torr
|
Joao F. Henriques
|
Jakob Foerster
Findings of the Association for Computational Linguistics: ACL 2024
2023
Crosslingual Generalization through Multitask Finetuning
Niklas Muennighoff
|
Thomas Wang
|
Lintang Sutawika
|
Adam Roberts
|
Stella Biderman
|
Teven Le Scao
|
M Saiful Bari
|
Sheng Shen
|
Zheng Xin Yong
|
Hailey Schoelkopf
|
Xiangru Tang
|
Dragomir Radev
|
Alham Fikri Aji
|
Khalid Almubarak
|
Samuel Albanie
|
Zaid Alyafeai
|
Albert Webson
|
Edward Raff
|
Colin Raffel
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Co-authors
- Alham Fikri Aji 1
- Khalid Almubarak 1
- Zaid Alyafeai 1
- M Saiful Bari 1
- Matthias Bethge 1
- show all...
- Stella Biderman 1
- Sebastian Dziadzio 1
- Jakob Foerster 1
- Tim Franzmeyer 1
- Adhiraj Ghosh 1
- Kai Han 1
- Joao F. Henriques 1
- Teven Le Scao 1
- Wenye Lin 1
- Zongqing Lu 1
- Niklas Muennighoff 1
- Ameya Prabhu 1
- Dragomir Radev 1
- Edward Raff 1
- Colin Raffel 1
- Adam Roberts 1
- Jonathan Roberts 1
- Hailey Schoelkopf 1
- Sheng Shen 1
- Aleksandar Shtedritski 1
- Lintang Sutawika* 1
- Xiangru Tang 1
- Philip Torr 1
- Vishaal Udandarao 1
- Thomas Wang 1
- Albert Webson 1
- Yunhan Yang 1
- Zheng-Xin Yong 1