Swiss AI Initiative - Collecting Large Amounts of High-Quality Data for Training Large Language Models
Jan Deriu, Maud Ehrmann, Emanuela Boros, Maximilian Böther, Christiane Sibille, Ihor Protsenko, Marta Brucka, Imanol Schlag, Elliott Ash
- Anthology ID:
- 2024.swisstext-1.40
- Volume:
- Proceedings of the 9th edition of the Swiss Text Analytics Conference
- Month:
- June
- Year:
- 2024
- Address:
- Chur, Switzerland
- Editors:
- Capol Corsin, Cieliebak Mark, Weichselbraun Albert, Musat Claudiu, Maier Elisabeth, Zimmermann Lucas
- Venue:
- SwissText
- SIG:
- SIGSEM
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 188
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.swisstext-1.40/
- DOI:
- Cite (ACL):
- Jan Deriu, Maud Ehrmann, Emanuela Boros, Maximilian Böther, Christiane Sibille, Ihor Protsenko, Marta Brucka, Imanol Schlag, and Elliott Ash. 2024. Swiss AI Initiative - Collecting Large Amounts of High-Quality Data for Training Large Language Models. In Proceedings of the 9th edition of the Swiss Text Analytics Conference, pages 188–188, Chur, Switzerland. Association for Computational Linguistics.
- Cite (Informal):
- Swiss AI Initiative - Collecting Large Amounts of High-Quality Data for Training Large Language Models (Deriu et al., SwissText 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.swisstext-1.40.pdf