Loading Events

« All Events

Fall 2025 GRASP on Robotics: Jiatao Gu, University of Pennsylvania, “Towards Robust World Models”

October 3 @ 10:30 am - 11:45 am

This event will be in-person ONLY in Wu and Chen Auditorium.

ABSTRACT

Autonomous agents need a world model that explains observations, predicts what comes next, and chooses actions over long horizons. Think of catching a ball: the robot must infer where it is now and where it will be next—even when it slips out of view—and move to intercept. Recently, large diffusion-based video models trained on internet-scale data have shown promising results for world modeling; however, they remain brittle—forecasting errors accumulate over time, especially during long open-loop rollouts without geometric grounding or collective feedback. In this talk, we present our recent research toward a more robust video generation foundation. Instead of diffusion, we build on scalable normalizing flows–a different family of generative models based on invertible transformations. We will detail the mathematical formulation, explain how these models can be trained end to end, and describe how we construct a practical video model from this framework. We will conclude by outlining research directions derived from this approach and steps toward a truly robust world model.

Presenter

Jiatao Gu

Jiatao Gu

Jiatao Gu is an Assistant Professor at the University of Pennsylvania in the CIS department where he leads the Generative Machine Learning Research (GMLR) lab. He also serves as a part-time staff research scientist at Apple. Before joining Apple, Jiatao was a senior research scientist at Facebook AI Research (FAIR). He received his Ph.D. from the University of Hong Kong after earning his Bachelor’s degree from Tsinghua University. He is the recipient of the Hong Kong PhD Fellowship. His current research goal is to advance the capabilities of AI agents to interact with the physical world, with a special emphasis on leveraging generative machine learning approaches for world modeling, iterative reasoning, and effective decision-making.

Details

Date:
October 3
Time:
10:30 am - 11:45 am
Event Categories:
,

Venue

Wu and Chen Auditorium
3330 Walnut Street
Philadelphia, PA 19104
+ Google Map