Loading Events

« All Events

  • This event has passed.

Fall 2025 GRASP on Robotics: Jie Tan, Google DeepMind, “Gemini Robotics: Bringing AI into the Physical World”

November 21 @ 10:30 am - 11:45 am

This event was in-person ONLY in Wu and Chen Auditorium.

ABSTRACT

Recent advancements in large multimodal models have led to the emergence of remarkable generalist capabilities in digital domains, yet their translation to physical agents such as robots remains a significant challenge. In this talk, I will present Gemini Robotics, an advanced Vision-Language-Action (VLA) generalist model capable of directly controlling robots. Gemini Robotics executes smooth movements to tackle a wide range of complex manipulation tasks while also being robust to variations in object types and positions, handling unseen environments as well as following diverse, open vocabulary instructions. With additional fine-tuning, Gemini Robotics can be specialized to new capabilities including solving long-horizon, highly dexterous tasks, learning new short-horizon tasks from as few as 100 demonstrations and adapting to completely novel robot embodiments. Furthermore, I will discuss the challenges, learnings and future research directions on robot foundation models.

Presenter

Jie Tan

Jie Tan - Learn More

Jie Tan is a Senior Staff Research Scientist and Tech Lead Manager in the robotics team of Google DeepMind. His research focuses on building foundation models and deep reinforcement learning methods to robots, with interests spanning locomotion, navigation, manipulation, simulation, and sim-to-real transfer. Jie Tan is also an adjunct associate professor at Georgia Institute of Technology. He got his PhD at the Computer Graphics Laboratory in Georgia Tech, advised by Greg Turk and Karen Liu.

Details

Venue

Wu and Chen Auditorium
3330 Walnut Street
Philadelphia, PA 19104
+ Google Map