{"id":186520,"date":"2011-07-06T00:00:00","date_gmt":"2011-07-19T10:57:54","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/msr-research-item\/some-recent-developments-in-approximate-inference-learning-and-control\/"},"modified":"2016-08-22T11:32:33","modified_gmt":"2016-08-22T18:32:33","slug":"some-recent-developments-in-approximate-inference-learning-and-control","status":"publish","type":"msr-video","link":"https:\/\/www.microsoft.com\/en-us\/research\/video\/some-recent-developments-in-approximate-inference-learning-and-control\/","title":{"rendered":"Some recent developments in approximate inference: learning and control"},"content":{"rendered":"<div class=\"asset-content\">\n<p>I&#8217;ll discuss two pieces of work on inference in probabilistic models:<\/p>\n<p>The first concerns a very general class of Bayesian Linear Models that are widely used in statistics and machine learning. A great deal of research has been carried out on developing approximate inference techniques for this important class. In particular I&#8217;ll discuss methods that bound the model likelihood, which is of interest in parameter learning.  The well-known `local&#8217; variational methods  lower bound the marginal likelihood. Despite their popularity over the last decade, I&#8217;ll discuss our recent result that shows that local methods result in weaker bounds than alternative `mean-field&#8217; variational methods. In addition, I&#8217;ll discuss the perhaps surprising result that the mean-field bound is concave and discuss how one may make computationally efficient approximations in large-scale models with many thousands of variables.<\/p>\n<p>Lagrange Duality is being increasingly exploited across machine learning but to date has received comparatively little attention in planning and control. For the second part of the talk I&#8217;ll discuss an application of Lagrange Duality in learning Markov Decision Process policies. In particular, I&#8217;ll discuss the computationally difficult finite-horizon time-independent policy case, and demonstrate how our method exhibits substantially improved performance compared to policy gradients and more recent `EM&#8217; style procedures.<\/p>\n<\/div>\n<p><!-- .asset-content --><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I&#8217;ll discuss two pieces of work on inference in probabilistic models: The first concerns a very general class of Bayesian Linear Models that are widely used in statistics and machine learning. A great deal of research has been carried out on developing approximate inference techniques for this important class. In particular I&#8217;ll discuss methods that [&hellip;]<\/p>\n","protected":false},"featured_media":196250,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-video-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-session-type":[],"msr-impact-theme":[],"msr-pillar":[],"msr-episode":[],"msr-research-theme":[],"class_list":["post-186520","msr-video","type-msr-video","status-publish","has-post-thumbnail","hentry","msr-locale-en_us"],"msr_download_urls":"","msr_external_url":"https:\/\/youtu.be\/-VN1z9iiOcM","msr_secondary_video_url":"","msr_video_file":"","_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/186520","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-video"}],"version-history":[{"count":0,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/186520\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/196250"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=186520"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=186520"},{"taxonomy":"msr-video-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video-type?post=186520"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=186520"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=186520"},{"taxonomy":"msr-session-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-session-type?post=186520"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=186520"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=186520"},{"taxonomy":"msr-episode","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-episode?post=186520"},{"taxonomy":"msr-research-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-theme?post=186520"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}