Wenlong Huang
2025
Foundation Models Meet Embodied Agents
Manling Li
|
Yunzhu Li
|
Jiayuan Mao
|
Wenlong Huang
Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 5: Tutorial Abstracts)
This tutorial will present a systematic overview of recent advances in foundation models for embodied agents, covering three types of foundation models based on input and output: Large Language Models (LLMs), Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs)