{"id":749359,"date":"2019-12-05T13:22:30","date_gmt":"2019-12-05T21:22:30","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-research-item&#038;p=749359"},"modified":"2021-05-27T13:29:12","modified_gmt":"2021-05-27T20:29:12","slug":"foundations-of-real-world-reinforcement-learning","status":"publish","type":"msr-video","link":"https:\/\/www.microsoft.com\/en-us\/research\/video\/foundations-of-real-world-reinforcement-learning\/","title":{"rendered":"Foundations of Real-World Reinforcement Learning"},"content":{"rendered":"<p>Reinforcement learning (RL) is an approach to sequential decision making under uncertainty which formalizes the principles for designing an autonomous learning agent. The broad goal of a reinforcement learning agent is to find an optimal policy which maximizes its long-term rewards over time. Its list of applications is growing as the technology advances and continues to be further integrated into many areas, such as education, health, advertising, autonomous systems, and gaming.<\/p>\n<p>By starting from the perspective of an agent which interacts with and affects its environment, RL provides an improvement upon supervised learning in situations requiring decisions, and not just predictions. In particular, it motivates exploratory actions to discover novel rewarding behavior in the environment, a hallmark of intelligent agents.<\/p>\n<p>In this webinar\u2014led by Microsoft Researchers John Langford, Partner Research Manager with over a decade of experience in reinforcement learning-related research, and Alekh Agarwal, Principal Research Manager and leader of the Reinforcement Learning group in Redmond\u2014learn how RL works to impact real-world problems across a variety of domains.<\/p>\n<p>Together, you&#8217;ll explore:<\/p>\n<ul>\n<li>The definition and uses of RL, from a general paradigm to its broad range of applications<\/li>\n<li>The various benefits of using RL as well as its current challenges<\/li>\n<li>The specific types of RL\u2014contextual bandits, imitation learning, and strategic exploration<\/li>\n<li>Where these cutting-edge methods might take the future of RL.<\/li>\n<\/ul>\n<p><strong>Resource list:<\/strong><\/p>\n<ul>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/podcast\/reinforcement-learning-for-the-real-world-with-dr-john-langford-and-rafah-hosn\/\">Reinforcement learning for the real world with Dr. John Langford and Rafah Hosn<\/a> (Podcast)<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/real-world-reinforcement-learning\/\">Real World Reinforcement Learning<\/a> (Project page)<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/kinematic-state-abstraction-and-provably-efficient-rich-observation-reinforcement-learning\/\">Kinematic State Abstraction and Provably Efficient Rich-Observation Reinforcement Learning<\/a> (Publication)<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/provably-efficient-reinforcement-learning-with-rich-observations\/\">Provably efficient reinforcement learning with rich observations<\/a> (Blog)<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/hunch.net\/~rwil\/\" target=\"_blank\" rel=\"noopener noreferrer\">ICML 2017 Tutorial on Real World Interactive Learning<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (Tutorial)<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/hunch.net\/\" target=\"_blank\" rel=\"noopener noreferrer\">Machine Learning (Theory)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (John Langford\u2019s blog)<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/vowpalwabbit.org\/\" target=\"_blank\" rel=\"noopener noreferrer\">Vowpal Wabbit<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> (open source project)<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/theme\/reinforcement-learning-group\/#!opportunities\">Reinforcement Learning<\/a> (Career opportunities)<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/alekha\/\">Alekh Agarwal<\/a> (Researcher profile)<\/li>\n<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/jcl\/\">John Langford<\/a> (Researcher profile)<\/li>\n<\/ul>\n<p>*This on-demand webinar features a previously recorded Q&A session and open captioning.<\/p>\n<p>This webinar originally aired on December 5, 2019<\/p>\n<p>Explore more Microsoft Research webinars: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/aka.ms\/msrwebinars\">https:\/\/aka.ms\/msrwebinars<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Reinforcement learning (RL) is an approach to sequential decision making under uncertainty which formalizes the principles for designing an autonomous learning agent. The broad goal of a reinforcement learning agent is to find an optimal policy which maximizes its long-term rewards over time. Its list of applications is growing as the technology advances and continues [&hellip;]<\/p>\n","protected":false},"featured_media":749365,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_hide_image_in_river":0,"footnotes":""},"research-area":[13556],"msr-video-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-session-type":[],"msr-impact-theme":[],"msr-pillar":[],"msr-episode":[],"msr-research-theme":[],"class_list":["post-749359","msr-video","type-msr-video","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-locale-en_us"],"msr_download_urls":"","msr_external_url":"https:\/\/youtu.be\/A6wNJ4-MpIg","msr_secondary_video_url":"","msr_video_file":"","_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/749359","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-video"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/749359\/revisions"}],"predecessor-version":[{"id":749377,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/749359\/revisions\/749377"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/749365"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=749359"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=749359"},{"taxonomy":"msr-video-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video-type?post=749359"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=749359"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=749359"},{"taxonomy":"msr-session-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-session-type?post=749359"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=749359"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=749359"},{"taxonomy":"msr-episode","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-episode?post=749359"},{"taxonomy":"msr-research-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-theme?post=749359"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}