Skip to main content
Home

Main navigation

  • Home
  • Series
  • People
  • Depts & Colleges
  • Open Education

Main navigation

  • Home
  • Series
  • People
  • Depts & Colleges
  • Open Education

From probabilistic bisimulation to representation learning via metrics

Series
Strachey Lectures
Video Audio Embed
Strachey Lecture: From probabilistic bisimulation to representation learning via metrics - Professor Prakash Panangaden
Bisimulation is a fundamental equivalence relation in process theory invented by Robin Milner and with an elegant fixed-point definition due to David Park. In this talk I will review the concept of bisimulation and then discuss its probabilistic analogue. This was extended to systems with continuous state spaces. Despite its origin in theoretical work, it has proved to be useful in fields like machine learning, especially reinforcement learning. Surprisingly, it turned out that one could prove a striking theorem: a theorem that pins down exactly what differences one can "see" in process behaviours when two systems are not bisimilar.
However, it is questionable whether a concept like equivalence is the right one for quantitative systems. If two systems are almost, but not quite, the same, bisimulation would just say that they are not equivalent. One would like to say in some way that they are "almost" the same. Metric analogues of bisimulation were developed to capture a notion of behavioral similarity rather than outright equivalence. These ideas have been adopted by the machine learning community and a bisimulation-style metric was developed for Markov decision processes. Recent work has shown that variants of these bisimulation metrics can be useful in representation learning. I will tell the tale of this arc of ideas in as accessible a way as possible.

More in this series

View Series
Strachey Lectures
Captioned

Strachey Lecture: The Computer in the Sky

The talk will emphasize the diversity of mathematical tools necessary for understanding blockchain protocols and their applications
Previous
Strachey Lectures
Captioned

Privacy, Verification, Robustness: A Cryptographer's perspective on ML

Strachey Lecture: Privacy, Verification, Robustness: A Cryptographer's perspective on ML
Next
Transcript Available

Episode Information

Series
Strachey Lectures
People
Prakash Panangaden
Keywords
bisimulation
machine learning
reinforcement learning
Department: Department of Computer Science
Date Added: 02/12/2024
Duration: 00:55:03

Subscribe

Apple Podcast Video Apple Podcast Audio Audio RSS Feed Video RSS Feed

Download

Download Video Download Audio Download Transcript

Footer

  • About
  • Accessibility
  • Contribute
  • Copyright
  • Contact
  • Privacy
'Oxford Podcasts' Twitter Account @oxfordpodcasts | MediaPub Publishing Portal for Oxford Podcast Contributors | Upcoming Talks in Oxford | © 2011-2022 The University of Oxford