- This event has passed.
Fall 2025 GRASP on Robotics: Jie Tan, Google DeepMind, “Gemini Robotics: Bringing AI into the Physical World”
November 21 @ 10:30 am - 11:45 am
This event was in-person ONLY in Wu and Chen Auditorium.
ABSTRACT
Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. In this talk, I will present Gemini Robotics, an advanced Vision-Language-Action (VLA) generalist model capable of directly controlling robots. Gemini Robotics executes smooth movements to tackle a wide range of complex manipulation tasks while also being robust to variations in object types and positions, handling unseen environments as well as following diverse, open vocabulary instructions. With additional fine-tuning, Gemini Robotics can be specialized to new capabilities including solving long-horizon, highly dexterous tasks, learning new short-horizon tasks from as few as 100 demonstrations and adapting to completely novel robot embodiments. Furthermore, I will discuss the challenges, learnings and future research directions on robot foundation models.