- This event has passed.
Fall 2025 GRASP SFI: Suraj Nair, Physical Intelligence, “Scaling Robot Learning with Vision-Language-Action Models”
October 22 @ 3:00 pm - 4:00 pm
This speaker was present virtually. This is a hybrid event with in-person attendance in Levine 307 and virtual attendance…
ABSTRACT
The last several years have witnessed tremendous progress in the capabilities of AI systems, driven largely by foundation models that scale expressive architectures with diverse data sources. While the impact of this technology on vision and language understanding is abundantly clear, its use in robotics remains in its infancy. Scaling robot learning still presents numerous open challenges—from selecting the right data to scale, to developing algorithms that can effectively fit this data for closed-loop operation in the physical world. At Physical Intelligence, we aim to tackle these questions. This talk will present our recent work on building vision-language-action models, covering topics such as architecture design, data scaling, and open research directions.