Spring 2013 GRASP Seminar: Pedro Ortega, Max Planck Institute for Intelligent Systems and Biological Cybernetics, "Adaptive Coding of Actions and Observations"
Friday, February 8, 2013 - 11:00am to 12:00pm
Abstract: The application of expected utility theory to construct adaptiveagents is both computationally intractable and statisticallyquestionable. To overcome these difficulties, agents need theability to delay the choice of the optimal policy to a later stagewhen they have learned more about the environment. Howshould agents do this optimally? An information-theoretic answerto this question is given by the Bayesian control rule - the solutionto the adaptive coding problem when there are not only observationsbut also actions. We review the central ideas behind the Bayesiancontrol rule.
Pedro A. Ortega is currently postdoctoral fellow at the Max Planck Institute for Intelligent Systems and Biological Cybernetics working on the foundations of autonomous agency. He grew up in Zürich, Switzerland and then moved to Santiago, Chile, where he graduated in 2005 as a Computer Engineer from the University of Chile. In 2011, he obtained his PhD in Engineering from the University of Cambridge under the supervision of Zoubin Ghahramani.