Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties
Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai
- Anthology ID:
- 2024.emnlp-main.1137
- Volume:
- Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
- Month:
- November
- Year:
- 2024
- Address:
- Miami, Florida, USA
- Editors:
- Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 20416–20431
- Language:
- URL:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.emnlp-main.1137/
- DOI:
- 10.18653/v1/2024.emnlp-main.1137
- Cite (ACL):
- Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, and Joyce Chai. 2024. Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 20416–20431, Miami, Florida, USA. Association for Computational Linguistics.
- Cite (Informal):
- Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties (Yu et al., EMNLP 2024)
- PDF:
- https://preview.aclanthology.org/jlcl-multiple-ingestion/2024.emnlp-main.1137.pdf