Yunzhu Li


2025

pdf bib
Foundation Models Meet Embodied Agents
Manling Li | Yunzhu Li | Jiayuan Mao | Wenlong Huang
Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 5: Tutorial Abstracts)

This tutorial will present a systematic overview of recent advances in foundation models for embodied agents, covering three types of foundation models based on input and output: Large Language Models (LLMs), Vision-Language Models (VLMs), Vision-Language-Action Models (VLAs)