This event has passed.

Fall 2024 GRASP Seminar: Anand Bhattad, Toyota Technological Institute at Chicago, “Are Generative Image Models Physically Grounded?”

Name: Fall 2024 GRASP Seminar: Anand Bhattad, Toyota Technological Institute at Chicago, “Are Generative Image Models Physically Grounded?”
Start: 2024-12-03T12:00:00-05:00
End: 2024-12-03T13:00:00-05:00
Location: Raisler Lounge, Towne 225

December 3, 2024 @ 12:00 pm - 1:00 pm

*This seminar was held in-person in Raisler Lounge as well as virtually…

ABSTRACT

Computer vision has transformed from simple edge detection in the 1980s to modern generative models that generate uncannily realistic images: objects are in sensible places, lighting seems realistic, and textures appear accurate. But how do they achieve this understanding of our visual world?

Probing their internal representations reveals that these models encode fundamental aspects of physical reality. Within these models, we discovered classical computer vision concepts like intrinsic images — decomposing scenes into color, shape, and lighting — learned without explicit training. These discoveries allow us to manipulate real photographs in physically plausible ways. However, we also find surprising gaps in their understanding, such as their limitation of replicating principles of projective geometry, which provides reliable signatures for detecting generated images.

This talk explores what knowledge emerges within generative image models, revealing their strengths and weaknesses. I will discuss how these insights drive new applications and open challenges, pushing us closer to building generative models grounded in the physical world.

Presenter

Anand Bhattad - Learn More

Anand Bhattad is a Research Assistant Professor at Toyota Technological Institute at Chicago (TTIC), a philanthropically endowed academic computer science graduate institute located on the University of Chicago campus. He is also a visiting scholar at UC Berkeley, hosted by Alexei (Alyosha) Efros. Earlier, he completed his PhD in the Department of Computer Science at the University of Illinois Urbana-Champaign (UIUC) working with David Forsyth. His research interests include computer vision, graphics, and computational photography. He is currently serving as an Area Chair at CVPR 2025 and WACV 2025 and has been awarded Outstanding Reviewer at ICCV 2023 and Emergency Outstanding Reviewer at CVPR 2021. He has led the organization of community building workshops at CVPR including “Scholars and Big Models: How Can Academics Adapt” at CVPR 2023 and “CV 20/20: A Retrospective Vision” at CVPR 2024.

Details

Date:: December 3, 2024
Time:: 12:00 pm - 1:00 pm
Event Category:: Seminars

Venue

Raisler Lounge, Towne 225

220 S. 33rd Street
Philadelphia, PA 19104 + Google Map

Fall 2024 GRASP Seminar: Anand Bhattad, Toyota Technological Institute at Chicago, “Are Generative Image Models Physically Grounded?”

December 3, 2024 @ 12:00 pm - 1:00 pm

ABSTRACT

Presenter

Details

Venue

Related Events

Spring 2025 GRASP on Robotics: Reid Simmons, Carnegie Mellon University, “AI-Based Assistants for the Elderly”

Spring 2025 GRASP SFI: Lillian Ratliff, University of Washington, “TBD”

Spring 2025 GRASP on Robotics: Lingjie Liu, University of Pennsylvania, “Towards Next-Gen 3D Reconstruction and Generation: From Visual Fidelity to Multimodal and Physical Understanding”