{"id":171481,"date":"2015-06-23T21:06:39","date_gmt":"2015-06-23T21:06:39","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/project\/multi-world-testing-mwt\/"},"modified":"2020-04-15T15:29:11","modified_gmt":"2020-04-15T22:29:11","slug":"multi-world-testing-mwt","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/multi-world-testing-mwt\/","title":{"rendered":"Multiworld Testing"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-213024 alignleft\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/loop-300x227.png\" alt=\"\" width=\"300\" height=\"227\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/loop-300x227.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/loop-768x580.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/loop-80x60.png 80w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/loop.png 823w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><strong>Exponentially better than A\/B testing.<\/strong> Multiworld Testing\u00a0(MWT) is the capability to test and optimize over K policies (context-based decision rules) using an amount of data and computation that scales logarithmically in K, without necessarily knowing these policies before or during data collection. MWT can answer exponentially more detailed questions compared to traditional A\/B testing. The underlying machine learning methodology draws on research on &#8220;contextual bandits&#8221; and &#8220;counterfactual evaluation&#8221;.<\/p>\n<p><strong>A system for interactive learning.<\/strong> We implement MWT as a machine learning system for making context-based decisions. The system supports the full cycle from exploration to logging to training policies to deploying them in production. Built as a cloud service, the system is widely applicable, modular, and easy to use. We currently offer two versions of the system: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/mwtds.azurewebsites.net\/\">MWT Decision Service<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0(self-hosted) and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/ds.microsoft.com\/\">Custom Decision Service<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0(multitenant). This is an ongoing project, released internally in Jun&#8217;15 and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/hunch.net\/?p=4464948\">announced externally<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> in Jul&#8217;16. Custom Decision Service in public preview since May&#8217;17, as a part of <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/\">Microsoft Azure Cognitive Services<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. A version of the system is already deployed very successfully with <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"http:\/\/www.msn.com\/\">MSN<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.\u00a0<i>\u00a0<\/i><br \/>\n<img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-249161 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/DS-informal-1-1024x337.png\" alt=\"Multiworld Testing Decision Service\" width=\"1024\" height=\"337\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/DS-informal-1-1024x337.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/DS-informal-1-300x99.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/DS-informal-1-768x253.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/DS-informal-1.png 1104w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p><strong>A typical example.<\/strong> Suppose one wants to optimize clicks on suggested news stories. To discover what works, one needs to explore over the possible news stories. Further, if the suggested news story can be chosen depending on the visitor&#8217;s profile, then one needs to explore over the possible &#8220;policies&#8221; that map profiles to news stories (and there are exponentially more &#8220;policies&#8221; than news stories!).\u00a0Traditional machine learning fails at this because it does not explore. Whereas the Decision Service can explore continuously, and optimize decisions using this exploration data.<\/p>\n<p><strong>Team.<\/strong>\u00a0We are a diverse group of researchers working on all aspects of MWT, spanning algorithms, machine learning, systems, and economics, and covering the entire range from theory to experiments to practical deployments. Most of us are located at <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/lab\/microsoft-research-new-york\/\">Microsoft Research NYC<\/a>. We can be contacted at <a href=\"mailto:mwtdev@microsoft.com\">mwtdev@microsoft.com<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Exponentially better than A\/B testing. Multiworld Testing\u00a0(MWT) is the capability to test and optimize over K policies (context-based decision rules) using an amount of data and computation that scales logarithmically in K, without necessarily knowing these policies before or during data collection. MWT can answer exponentially more detailed questions compared to traditional A\/B testing. The [&hellip;]<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13561,13556,13547],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-171481","msr-project","type-msr-project","status-publish","hentry","msr-research-area-algorithms","msr-research-area-artificial-intelligence","msr-research-area-systems-and-networking","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"2013-11-01","related-publications":[],"related-downloads":[],"related-videos":[],"related-groups":[],"related-events":[],"related-opportunities":[],"related-posts":[419964],"related-articles":[],"tab-content":[{"id":0,"name":"Releases","content":"<a href=\"http:\/\/ds.microsoft.com\/\">Custom Decision Service<\/a>\r\nMay 2017: public preview release, as a part of <a href=\"https:\/\/azure.microsoft.com\/en-us\/services\/cognitive-services\/\">Cognitive Services<\/a>.\r\n\r\n<a href=\"http:\/\/mwtds.azurewebsites.net\/\">MWT Decision Service<\/a>\r\nJul 2016: external announcement.\r\n\r\n<a href=\"https:\/\/github.com\/multiworldtesting\/explore\/tree\/v1.0\">MWT Exploration library<\/a>\r\nA library for MWT, structurally compatible with learning algorithms in <a href=\"https:\/\/github.com\/JohnLangford\/vowpal_wabbit\/wiki\">Vowpal Wabbit<\/a>.\r\nNov 2014: external release.\r\n\r\n<a href=\"http:\/\/research.microsoft.com\/en-US\/projects\/mwt\/mwt-intro.pdf\" target=\"_self\">MWT white paper<\/a>\u00a0(rev. March 2016)\r\nJul 2016: rev2 released\r\nSep 2015: released externally\r\n\r\n<strong>Deployment<\/strong>: personalized news\u00a0on <a href=\"http:\/\/www.msn.com\/\" target=\"_self\">msn.com<\/a>\r\nDeployed on 100%\u00a0of the traffic; <em>25% lift in clicks<\/em>.\r\n<em><strong>Innovation Award<\/strong><\/em> from Microsoft's Universal Storefronts."},{"id":1,"name":"Timeline","content":"<ul>\r\n \t<li>May 2017: Custom Decision Service announced as public preview.<\/li>\r\n \t<li>Jul 2016: MWT Decision service <a href=\"http:\/\/hunch.net\/?p=4464948\">announced externally<\/a>.<\/li>\r\n \t<li>Jul 2016: MWT white paper (rev2)\u00a0released<\/li>\r\n \t<li>Mar 2016: a demo at MSR TechFest 2016 (Redmond, MSFT-only).<\/li>\r\n \t<li>Mar 2016: <em><strong>Innovation Award<\/strong><\/em> from Microsoft's Universal Storefronts for the MSN deployment.<\/li>\r\n \t<li>Jan 2016:\u00a0Decision Service for personalized news\u00a0on MSN:\u00a0 deployed on 100% of the traffic.<\/li>\r\n \t<li>Nov 2015: Mini-course (Redmond, WA, MSFT-only) [<a href=\"https:\/\/microsoft.sharepoint.com\/teams\/machine-learning\/SitePages\/E4-2015-Redmond.aspx\">internal link<\/a>]<\/li>\r\n \t<li>Nov 2015: Mini-course (Cambridge, UK, MSFT-only) [<a title=\"\" href=\"https:\/\/microsoft.sharepoint.com\/teams\/machine-learning\/SitePages\/E4-2015-London.aspx\" target=\"_self\">internal link<\/a>].<\/li>\r\n \t<li>Sep 2015: Decision Service for personalized news\u00a0on MSN:\u00a0first test flights.<\/li>\r\n \t<li>Sep 2015: MWT white paper released externally.<\/li>\r\n \t<li>Mar 2015: Workshop on Interactive Machine Learning\u00a0(Redmond, MSFT-only) [<a class=\"invalidLink\" title=\"\" href=\"http:\/\/msrweb\/projects\/mwt\/TF15-IML-workshop.mht\" target=\"_self\">internal link<\/a>].<\/li>\r\n \t<li>Mar 2015: a demo\u00a0at MSR TechFest 2015 (Redmond, MSFT-only).<\/li>\r\n \t<li>Jun 2015: MWT Decision Service (v1) released internally.<\/li>\r\n \t<li>Nov 2014: MWT exploration library released.<\/li>\r\n \t<li>Oct 2014:\u00a0Mini-course (Redmond, MSFT-only) [<a href=\"https:\/\/microsoft.sharepoint.com\/teams\/machine-learning\/SitePages\/E4-course.aspx\">internal link<\/a>]<\/li>\r\n \t<li>Oct 2014: Tutorial at\u00a0\"Practice of\u00a0ML Conf.\"\u00a0(Redmond, MSFT-only).<\/li>\r\n \t<li>Mar 2014: Workshop (Redmond, MSFT-only) [<a class=\"invalidLink\" href=\"http:\/\/msrweb\/projects\/MWT\/TF14-E4workshop\/program.mht\">internal link<\/a>].<\/li>\r\n \t<li>Mar 2014: a demo and a lecture\u00a0at MSR TechFest 2014 (Redmond, MSFT-only).<\/li>\r\n<\/ul>"},{"id":2,"name":"Talks","content":"<ul>\r\n \t<li>Alekh Agarwal, Berkeley, Apr 2017<\/li>\r\n \t<li>John Langford, Facebook, Feb 2017<\/li>\r\n \t<li>Sid Sen, <a href=\"https:\/\/sites.google.com\/site\/mlsysnips2016\/\">ML Systems Workshop<\/a> at NIPS 2016<\/li>\r\n \t<li>Alekh Agarwal, <a href=\"https:\/\/conferences.oreilly.com\/artificial-intelligence\/ai-ny-2016\">O\u2019Reilly AI conference<\/a>, Sept 2016 (<a href=\"https:\/\/twimlai.com\/twiml-talk-017-interactive-machine-learning-systems-alekh-agarwal-interview\/\">a podcast interview<\/a>).<\/li>\r\n \t<li>Dan Melamed, Machine Learning and Data Science Conf. (Microsoft-only), May 2016.<\/li>\r\n \t<li>Alekh Agarwal, <a href=\"http:\/\/bostonml.com\/\">Boston ML Forum<\/a>, March 2016.<\/li>\r\n \t<li>John Langford, <a href=\"https:\/\/sites.google.com\/site\/mlaihci\/presentations-schedule\">Workshop on ML from\/for Adaptive User Technologies<\/a>\u00a0at NIPS 2015.<\/li>\r\n \t<li>Sarah Bird, <a href=\"http:\/\/learningsys.org\/\">Workshop on ML Systems<\/a>\u00a0at NIPS 2015.<\/li>\r\n \t<li>Alekh Agarwal, <a href=\"http:\/\/www.meetup.com\/London-Machine-Learning-Meetup\/events\/226037610\/\">London ML meetup<\/a>, Nov 2015.<\/li>\r\n \t<li>Sid Sen, OneAnalyst Conference (Microsoft internal), June 2015<\/li>\r\n \t<li>Lihong Li, <a href=\"http:\/\/www.wsdm-conference.org\/2015\/\">WSDM 2015<\/a> (tutorial).<\/li>\r\n \t<li>Alekh Agarwal, <a href=\"https:\/\/www.fields.utoronto.ca\/programs\/scientific\/14-15\/gradresearch\/\">Statistics Graduate Student Research Day<\/a> at Univ. of Toronto, Apr 2015.<\/li>\r\n \t<li>John Langford, <a href=\"https:\/\/nips.cc\/Conferences\/2013\">NIPS 2013<\/a> (tutorial).<\/li>\r\n<\/ul>"},{"id":3,"name":"Details","content":"<a class=\"invalidLink\" title=\"\" href=\"https:\/\/github.com\/Microsoft\/mwt-ds\/raw\/master\/images\/MWT-WhitePaper.pdf\">MWT white paper<\/a> (pdf) -\u00a0Background for potential users of the Decision Service: machine learning methodology and system design, and how to make them fit YOUR application. Also covers past deployments and experimental evaluation. For a broad technical audience, both in product groups and in research.\r\n\r\n<a class=\"invalidLink\" title=\"\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/mwt-mwt-slides.pptx\" target=\"_self\">Slide deck<\/a> on MWT and the Decision Service.\r\n\r\n<a href=\"https:\/\/github.com\/Microsoft\/mwt-ds\/wiki\">MWT Decision Service Wiki<\/a>:\u00a0tutorials, guides and references.\r\n\r\n<a href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/cognitive-services\/custom-decision-service\/custom-decision-service-overview\">Custom Decision Service documentation<\/a>\r\n\r\n<strong>Tutorials\u00a0and lectures<\/strong>:\r\n<ul>\r\n \t<li><a href=\"http:\/\/hunch.net\/~jl\/interact.pdf\">Learning to interact<\/a> (John Langford, NIPS 2013).<\/li>\r\n \t<li><a class=\"invalidLink\" title=\"\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2016\/02\/mwt-bottou-lecture-web.pdf\" target=\"_self\">Learning to Interact<\/a> (Leon Bottou, MSR TechFest 2014).<\/li>\r\n \t<li><a href=\"http:\/\/cilvr.cs.nyu.edu\/doku.php?id=courses:bigdata:slides:start\" target=\"_self\">Large-Scale Machine Learning<\/a> (NYU, 2013, John Langford and Yann LeCun, lectures 9-11).<\/li>\r\n<\/ul>\r\n<strong>Most relevant papers:<\/strong>\r\n<ul>\r\n \t<li><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/unbiased-offline-evaluation-of-contextual-bandit-based-news-article-recommendation-algorithms\/\">Basic techniques<\/a> of MWT with applications to news recommendation (WSDM 2011).<\/li>\r\n \t<li><a class=\"invalidLink\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/learning-from-logged-implicit-exploration-data\/\">MWT with estimated action selection probabilities<\/a>\u00a0 (NIPS 2010).<\/li>\r\n \t<li><a class=\"invalidLink\" title=\"\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/counterfactual-estimation-and-optimization-of-click-metrics-in-search-engines-a-case-study\/\" target=\"_self\">Application of MWT\u00a0to Bing Speller<\/a> (WWW'14).<\/li>\r\n<\/ul>\r\n<strong>Background reading:<\/strong>\r\n<ul>\r\n \t<li><a href=\"http:\/\/www.princeton.edu\/~sbubeck\/SurveyBCB12.pdf\">survey on multi-armed bandits<\/a><\/li>\r\n \t<li><a href=\"http:\/\/homes.di.unimi.it\/cesa-bianchi\/predbook\/\">book on prediction &amp; learning<\/a><\/li>\r\n<\/ul>"},{"id":4,"name":"Alumni","content":"<a href=\"http:\/\/www.justinmrao.com\/\">Justin Rao<\/a>, HomeAway.\r\n<a href=\"http:\/\/leon.bottou.org\/\">Leon Bottou<\/a>, Facebook AI research.\r\nLuong Hoang, Microsoft.\r\n<a href=\"https:\/\/www.linkedin.com\/in\/slbird\">Sarah Bird<\/a>"}],"slides":[],"related-researchers":[{"type":"user_nicename","display_name":"Markus Cozowicz","user_id":32803,"people_section":"Group 1","alias":"marcozo"},{"type":"user_nicename","display_name":"Miro Dud\u00edk","user_id":32867,"people_section":"Group 1","alias":"mdudik"},{"type":"user_nicename","display_name":"John Langford","user_id":32204,"people_section":"Group 1","alias":"jcl"},{"type":"user_nicename","display_name":"Siddhartha Sen","user_id":33656,"people_section":"Group 1","alias":"sidsen"},{"type":"user_nicename","display_name":"Alex Slivkins","user_id":33685,"people_section":"Group 1","alias":"slivkins"}],"msr_research_lab":[],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/171481","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":12,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/171481\/revisions"}],"predecessor-version":[{"id":636702,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/171481\/revisions\/636702"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=171481"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=171481"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=171481"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=171481"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=171481"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}