{"id":633669,"date":"2020-03-03T17:01:02","date_gmt":"2020-02-06T15:33:56","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-academic-program&#038;p=633669"},"modified":"2024-03-20T09:16:05","modified_gmt":"2024-03-20T16:16:05","slug":"rl-open-source-fest","status":"publish","type":"msr-academic-program","link":"https:\/\/www.microsoft.com\/en-us\/research\/academic-program\/rl-open-source-fest\/","title":{"rendered":"Reinforcement Learning Open Source Fest"},"content":{"rendered":"\n\n<p>Introducing students to open-source reinforcement learning programs and software development.<\/p>\n\n\n\n<p><strong>Program dates:<\/strong> May \u2013 August 2023<\/p>\n\n\n\n\n\n\n<p>The submission portal for Reinforcement Learning (RL) Open-Source Fest 2023 is now closed. <\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"about-the-program\">About the program<\/h2>\n\n\n\n<p>RL Open-Source Fest is a global online program focused on introducing students to open-source reinforcement learning programs and software development while working alongside researchers, data scientists, and engineers on the <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/real-world-reinforcement-learning\/\">Real World Reinforcement Learning<\/a> team at Microsoft Research NYC. Our goal is to bring together a diverse group of students from around the world to collectively solve open-source reinforcement learning problems and advance the state-of-the-art research\u202fand development\u202falongside the RL community while providing open-source code written and released to benefit all.<\/p>\n\n\n\n<p>Students will work on a four-month research programming project from May-August 2023. At the end of the program, students will present each of their projects to the Microsoft Research Real World Reinforcement Learning team online.<\/p>\n\n\n\n<p>Accepted students will receive a $10,000 USD stipend, or equivalent in local currency. Selected students can receive funding through one of the following payment options:&nbsp;&nbsp;<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Microsoft provides payment directly to a student\u2019s academic institution, which then disperses funds according to the institution\u2019s guidelines.<\/li>\n\n\n\n<li>Microsoft provides payment directly to the individual student. You will be solely responsible for any taxes related to the payments you receive from Microsoft.<\/li>\n<\/ol>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"open-source-projects\">Open-source projects<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/vowpalwabbit.org\/\">Vowpal Wabbit (VW)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is an open-source machine learning library created by John Langford and developed by Microsoft Research with the help of many contributors. It is a fast, flexible, online, and active learning solution that empowers people to solve complex interactive machine learning problems, with a large focus on contextual bandits and reinforcement learning. It is a vehicle for both research prototyping and driving bleeding edge algorithms to production. RL OS Fest is all about open-source projects in the Vowpal Wabbit ecosystem.<\/p>\n\n\n\n<p>View this year&#8217;s <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/vowpalwabbit.org\/rlos\/2023\/projects\">2023 open-source projects<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. <\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"eligibility\">Eligibility<\/h3>\n\n\n\n<p>To be eligible for the program, students must be enrolled in or accepted into an accredited institution including colleges, universities, Master programs, PhD programs, and undergraduate programs.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"student-responsibilities-during-the-program\">Student responsibilities during the program<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Submit quality work: code compiles, has unit tests and documentation, and&nbsp;passes code review<\/li>\n\n\n\n<li>Regularly communicate work completed, what you intend to do next, and blockers<\/li>\n\n\n\n<li>Re-evaluate project tasks if you\u2019re significantly ahead or behind schedule<\/li>\n\n\n\n<li>Regular check-ins&nbsp;with your mentor\/collaborator<\/li>\n\n\n\n<li>Listen and respond to feedback<\/li>\n\n\n\n<li>Pro-active learning<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"what-makes-a-successful-project\">What makes a successful project?<\/h3>\n\n\n\n<p>Success looks different for every project. Challenging yourself and developing skills and knowledge are the most important part. Producing some sort of deliverable item is great, but not strictly required. We all know how development and experimentation goes, unforeseen problems can come up and present new challenges and that\u2019s all part of the process. You\u2019ll have a mentor and support along the way.<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A successful engineering-oriented project might include pull requests merging your work, a design document, tests, and general documentation<\/li>\n\n\n\n<li>A successful data science-oriented project might involve pull requests, reproducible experiments, datasets, a report, and visualized results<\/li>\n\n\n\n<li>A successful prototyping-oriented project might include an MVP, tests, and documentation<\/li>\n<\/ul>\n\n\n\n<p><strong>Contact us:<\/strong> If you have questions about this program, please send us an email at <a href=\"mailto:RLOSFEST@microsoft.com\">RLOSFEST@microsoft.com<\/a>.<\/p>\n\n\n\n\n\n<h2 class=\"wp-block-heading\" id=\"program-timeline\">Program Timeline<\/h2>\n\n\n\n<p>February 20, 2023 | Proposal period opens (at 9:00 AM Eastern Standard Time)<br>April 3, 2023 | Proposal period closes (at 11:59 PM Eastern Standard Time)<\/p>\n\n\n\n<p>April 24, 2023 | Selected applicants notified<\/p>\n\n\n\n<p>May 1, 2023 | Student and mentor meetings<br>May 8, 2023 | Project begin<br>August 15, 2023 | Final project presentations<\/p>\n\n\n\n\n\n<p><strong>The submission portal for RL Open-Source Fest 2023 is now closed.<\/strong><\/p>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"how-to-submit-a-proposal\">How to submit a proposal<\/h3>\n\n\n\n<p>The below outlines the information necessary to submit your proposal in our submission portal. Acceptance notifications will be sent on April 24, 2023.<\/p>\n\n\n\n<p><strong>Proposal details<\/strong><br>You will be asked to fill out the following questions:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Are you currently enrolled in an accredited university or college? (Please note, proof of enrollment will be required)<\/li>\n\n\n\n<li>Select your country of residence<\/li>\n\n\n\n<li>Upload your resume or a document containing a list of classes completed to date<\/li>\n\n\n\n<li>Upload existing or past personal projects you\u2019ve worked on or open-source projects to which you\u2019ve contributed<\/li>\n\n\n\n<li>Choose your preferred project from the list of <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/vowpalwabbit.org\/rlos\/2023\/projects\">Open-Source Projects<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n\n\n\n<li>Upload your proposal for the selected Open-Source Project. Include why you want to work on this problem specifically. Provide a rough outline of how you plan to execute on the project selected. This should include a week-by-week plan of what you\u2019d need to learn and the challenges you foresee.<\/li>\n\n\n\n<li>Complete a pre-screening exercise according to the requirements of your selected project. <\/li>\n<\/ul>\n\n\n\n<p><strong>Important information pertaining to your proposal:<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Proposals submitted to Microsoft will not be returned. Microsoft cannot assume responsibility for the confidentiality of information submitted in the proposal. Therefore, proposals should not contain information that is confidential, restricted, or sensitive.<\/li>\n\n\n\n<li>Incomplete proposals will not be considered.<\/li>\n\n\n\n<li>Due to the volume of submissions, Microsoft Research cannot provide individual feedback on proposals.<\/li>\n<\/ul>\n\n\n\n<div class=\"wp-block-buttons is-layout-flex wp-block-buttons-is-layout-flex\"><\/div>\n\n\n\n\n\n<h2 class=\"wp-block-heading\" id=\"2023-alumni\">2023 Alumni<\/h2>\n\n\n\n<div style=\"height:20px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n<figure class=\"wp-block-image alignleft size-full is-resized is-style-rounded\"><img loading=\"lazy\" decoding=\"async\" width=\"360\" height=\"360\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Micky-Yun-Chan_360x360.jpg\" alt=\"RL Open Source Fest alumni - Micky Yun Chan\" class=\"wp-image-970374\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Micky-Yun-Chan_360x360.jpg 360w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Micky-Yun-Chan_360x360-300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Micky-Yun-Chan_360x360-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Micky-Yun-Chan_360x360-180x180.jpg 180w\" sizes=\"auto, (max-width: 360px) 100vw, 360px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"sharvani-somayaji\">Micky Yun Chan<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/micky-chan-bb9437137\/\" target=\"_blank\" rel=\"noopener noreferrer\">Micky<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> recently completed his computer science master degree in Erasmus Mundus Software Engineer For Green Deal (SE4GD) programme. He believes that open source development can and will play a big part for many years to come and he likes to explore random new open source projects in his free time. One of his goals is to create a successful open source project.<\/p>\n\n\n\n<p>Outside of programming, he likes to play board games and real-time strategy LAN games.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.youtube.com\/watch?v=MieJZ32bxdU&t=147s\" target=\"_blank\" rel=\"noopener noreferrer\">Testing Infrastructure for VowpalWabbit<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Several improvements can be proposed for the current approach to end-to-end testing in Vowpal Wabbit. The existing methodology generates expected output by executing the exact same command that is under evaluation, potentially introducing challenges to the robustness of the tests. Additionally, the current approach operates without assumptions about the nature of the data it is trained on. Furthermore, it lacks the capability to facilitate the implementation of tests using hyperparameter grids, which can result in increased implementation costs for Vowpal Wabbit.<\/p>\n\n\n\n<p>To address these concerns, a new domain-specific language has been developed to facilitate the creation of end-to-end test configurations. This language includes support for defining hyperparameter grids,&nbsp;pluggable data stimulators and assertion functions, thereby enhancing the testing process.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-full is-resized is-style-rounded\"><img loading=\"lazy\" decoding=\"async\" width=\"360\" height=\"360\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Stelios-Stavroulakis_360x360.jpg\" alt=\"RL Open Source Fest alumni - Stelios Stavroulakis\" class=\"wp-image-970359\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Stelios-Stavroulakis_360x360.jpg 360w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Stelios-Stavroulakis_360x360-300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Stelios-Stavroulakis_360x360-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Stelios-Stavroulakis_360x360-180x180.jpg 180w\" sizes=\"auto, (max-width: 360px) 100vw, 360px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"sharvani-somayaji\">Stelios Stavroulakis<\/h3>\n\n\n\n<p>Stelios Stavroulakis is a PhD student at the University of California, Irvine. For the past 5 years, he&#8217;s worked on reinforcement learning, focusing on applications in privacy, warehouse scheduling, and large language model hallucination control. His theoretical work is focused on exploring the intersection between reinforcement learning and game theory. Stelios has the ambition to incorporate his academic expertise to address a plethora of open problems in industry.<\/p>\n\n\n\n<p><strong>Demo:<\/strong> <strong><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.youtube.com\/watch?v=MieJZ32bxdU&t=725s\" target=\"_blank\" rel=\"noopener noreferrer\">Optimizing In-Context Learning in LLMs<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Our project focused on improving the way large language models (LLMs) respond by choosing the right examples for in-context learning.<\/p>\n\n\n\n<p>We addressed this by viewing it as a contextual bandit problem. By using a method called reduction to regression [1], we upper bound the regret of the contextual bandit algorithm by the performance of the regressor. Additionally, we utilized CappedIGW [2], ensuring our algorithm can handle a large action-space effectively. A standout feature is the regressor-agnostic nature of this method which allows the incorporation of powerful transformer models that excel in extracting semantic information from text. While we explored various techniques like bert and sentence transformers, progressive example fine-tuning (PEFT) emerged as the most promising approach.<\/p>\n\n\n\n<p>In light of our preliminary findings, there&#8217;s clear promise in improving in-context learning via interaction. By decoupling prompt learning from the LLM, we pave the way for tailored personalization as well as striking a balance between computational expense and performance.<\/p>\n\n\n\n<p>[1] Dylan J. Foster, & Alexander Rakhlin. (2020). Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles.<\/p>\n\n\n\n<p>[2] Rucker, M., Zhu, Y., & Mineiro, P. (2023). Infinite Action Contextual Bandits with Reusable Data Exhaust. arXiv preprint arXiv:2302.08551.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity is-style-end-mark\"\/>\n\n\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized is-style-rounded\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Sharvani-Somayaji_300x300-150x150.jpg\" alt=\"RL Open Source Fest alumni - Sharvani Somayaji\" class=\"wp-image-868839\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Sharvani-Somayaji_300x300-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Sharvani-Somayaji_300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Sharvani-Somayaji_300x300-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"sharvani-somayaji\">Sharvani Somayaji<\/h3>\n\n\n\n<p>Sharvani is a senior-year undergraduate student studying Electrical and Electronics Engineering at the National Institute of Technology Karnataka, India. Her interests lie in the fields related to AI, robotics, and web development, particularly in NLP and Reinforcement Learning. She likes to contribute to open source and collaborate with the community. Outside of work, she loves playing badminton, jogging, and drawing.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/jnkxDdZF_gc?t=1595\" target=\"_blank\" rel=\"noopener noreferrer\">Improve flatbuffer parser support in VW<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Vowpal Wabbit has several file inputs, examples, cache, and models. FlatBuffers is an efficient cross-platform serialization library for languages including C++, C#, C, Go, and Java. Improving flatbuffer parser support in VowpalWabbit will provide a new high-performance alternative to existing input data formats. This project focuses on improving the serialized size of the current flatbuffer format and measuring performance.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized is-style-rounded\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Ivoline-Ngong_300x300-150x150.jpg\" alt=\"RL Open Source Fest alumni - Ivoline Ngong\" class=\"wp-image-868833\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Ivoline-Ngong_300x300-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Ivoline-Ngong_300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Ivoline-Ngong_300x300-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"ivoline-ngong\">Ivoline Ngong<\/h3>\n\n\n\n<p>Ivoline is a 2nd-year Ph.D. student in Computer Science at the University of Vermont and a research scientist at OpenMined. Her research interests broadly revolve around all aspects of machine learning; theory, algorithms, and applications. Currently, she focuses on provable fairness and privacy-preserving machine learning including differential privacy, federated learning, and secure multiparty computation. She can usually be found in the kitchen whipping up a savory meal or getting lost in the plot of a good book when she\u2019s not working.&nbsp;&nbsp;<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/jnkxDdZF_gc?t=781\" target=\"_blank\" rel=\"noopener noreferrer\">Compiler Optimizations using Reinforcement Learning<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Expert-picked sequences are used in compilers to optimize performance for the conversion of human-written programs into executable binaries. These heuristics are developed by experts who spend hours tweaking compiler knobs resulting in smaller and faster binaries. By replacing these complex heuristics with reinforcement learning(RL) policies, we aim to enable compilers to automatically optimize code without using prefixed parameters or predefined ordering. Specifically, we try to tackle the phase-ordering problem, performing code size and runtime reduction in the LLVM compiler using RL environments, intermediate representations, and datasets provided by CompilerGym. Integrating the gym environment with Vowpal Wabbit RL agents like contextual bandits, we are able to obtain code size reductions. Furthermore, benchmarking the performance of different deep learning-based RL agents, as well as experiments in different observation spaces, shows promising results. Future work can focus on improving datasets and observation features to make them more representative as well as developing better reinforcement learning agents.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized is-style-rounded\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Shaokun-Zhang_300x300-150x150.jpg\" alt=\"RL Open Source Fest alumni - Shaokun Zhang\" class=\"wp-image-868836\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Shaokun-Zhang_300x300-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Shaokun-Zhang_300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Shaokun-Zhang_300x300-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"shaokun-zhang\">Shaokun Zhang<\/h3>\n\n\n\n<p>Shaokun Zhang is a first-year Ph.D. student in the College of Information Sciences and Technology at Pennsylvania State University. His primary research interests are automatic machine learning. Currently, his research has a special focus on AutoML in the data stream setting. He has contributed to several open-source projects such as AutoML library FLAML at Microsoft Research. Shaokun is passionate about doing research and coding. He wishes to do some impact works on AI research in his academic career. Outside of work, he loves traveling, reading, and running.&nbsp;<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/jnkxDdZF_gc?t=75\" target=\"_blank\" rel=\"noopener noreferrer\">Automl Extensions<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>In machine learning, AutoML is the process of applying machine learning (ML) models to real-world problems using automation. It automates the selection, composition, and parameterization of machine learning models. It aims to allow non-experts to make use of machine learning models and free people from repeated work of model development. As for VW, the target of AutoML reduction is to provide users with a hands-off method to get an optimal learning configuration without prior experience using VW or an in-depth understanding of their dataset. A &#8220;configuration&#8221; is the basic element of AutoML and a general term that can be extended to any aspect of VW. We defined it as a set of namespace interactions in a contextual bandit problem. However, VW only supports quadratics interaction in AutoML reduction, which greatly limited the application scope and feature richness. We rewrite the data structures of configuration in VW. In this way, it will support storing interactions of arbitrary size and provide a more flexible interface for future development. We also design a mechanism that extends AutoML to add and drop cubic interactions. It will greatly enlarge the search space and expands the application scope of the AutoML algorithm.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized is-style-rounded\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Songlin-Jiang_300x300-150x150.jpg\" alt=\"RL Open Source Fest alumni - Songlin Jiang\" class=\"wp-image-868830\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Songlin-Jiang_300x300-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Songlin-Jiang_300x300.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Songlin-Jiang_300x300-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"songlin-jiang\">Songlin Jiang<\/h3>\n\n\n\n<p>Songlin Jiang is from China and recently (2022 Summer) completed his bachelor&#8217;s degree in specialized class for fundamentals and theories of Computer Science and Technology (Hons) at Lanzhou University. Songlin is also an incoming student (2022 Fall) for Security and Cloud Computing (SECCLO) Erasmus Mundus joint master and receives full-ride scholarship support. He will study at two universities in the European Union and start his first year at Aalto University in Finland. Songlin is enthusiastic about open source. He is a member of the openSUSE for his continuous contribution during and after Google Summer of Code 2021. Distributed machine learning and related security issues are Songlin&#8217;s research interests. As Moore&#8217;s Law is starting to fail, he believes that distributed systems will be the future infrastructure to support the growing need for computing power in machine learning.&nbsp;<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/jnkxDdZF_gc?t=2276\" target=\"_blank\" rel=\"noopener noreferrer\">Native CSV parsing<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Our project introduces a <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/nam06.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fgithub.com%2FVowpalWabbit%2Fvowpal_wabbit%2Fpull%2F4073&data=05%7C01%7Cbmuller%40microsoft.com%7C947f06e254014ea8584c08da744a85a9%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637950160686088397%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=Iayq4xZLcLJhF17rsBIW1G3GNeTBu7EYhcuLwcOUcWw%3D&reserved=0)\" target=\"_blank\" rel=\"noopener noreferrer\">native CSV parser<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> making VW recognize the CSV format. The parser follows the RFC 4180 and MIME standards, with specifications of CSV header format adapting to VW training and prediction needs. Our project reaches 100% test and code coverage, and the parsing performance is comparable to the VW format parser. We also write a tutorial for using the CSV parser. Here are the design details:&nbsp;<\/p>\n\n\n\n<p>Because alternative delimiter-separated files are often given a `.csv` extension despite using a non-comma field separator, the parser allows specifying the CSV field separator. In addition, we always need a CSV header to work correctly. If the header doesn&#8217;t exist or suit VW parsing needs, we can tell the parser the correct one with command line options. In this case, users do not need to edit the dataset after downloading it from the Internet.&nbsp;<\/p>\n\n\n\n<p>For the format of the headers, we use `_label` and `_tag` to mark the label and tag column and `|` to separate the other column&#8217;s namespace and feature name. To ensure there will always be an equivalent CSV file for VW format files, we make the label data format the same as the VW labels&#8217;. The parser also supports scaling the namespace values by specifying the ratio in command line options. Since multi-line examples often mean different lines have different schemas, CSV is unsuitable for them. However, suppose all the cells in a line are empty, we will still mark it as a new line. In this case, users can still express multi-line examples in CSV files, although it is not listed as supported.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity is-style-end-mark\"\/>\n\n\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/02\/Muhamma-Ammar-Amid-v2-150x150.jpg\" alt=\"portrait of Muhamma Ammar Amid\" class=\"wp-image-822565\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/02\/Muhamma-Ammar-Amid-v2-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/02\/Muhamma-Ammar-Amid-v2.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/02\/Muhamma-Ammar-Amid-v2-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"muhammad-ammar-abid\">Muhammad Ammar Abid<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/muhammad-ammar-abid-3916191a3\/\" target=\"_blank\" rel=\"noopener noreferrer\">Muhammad Ammar Abid<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is a senior year undergraduate at NUCES (FAST) Peshawar campus in Pakistan. He has done more than 150 online courses, spending more than 800 hours online the past few years learning anything beneficial, new and exciting related to technology and his journey for learning still continues. Ammar\u2019s goal is to empower himself and other people through learning technology while believing the famous saying that knowledge is something nobody can take away from you. Ammar\u2019s Final Year Project is Real Time Pakistani Currency Detection for Visually Impaired aimed at solving the less accessible products\u2019 problem of currency detection in Pakistan. Ammar believes that his collaboration in Vowpal Wabbit will empower people around the globe and help them to achieve more. Ammar\u2019s future priority is to end up at a diverse and inclusive place where his collaboration can solve real world problems while empowering people in the process.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=1501\" target=\"_blank\" rel=\"noopener noreferrer\">Tensorboard and Tensorwatch Integration<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Visualization is necessary for brain to process large information. Tensorboard provides visualization and tooling necessary which can be effectively used for machine learning experimentation. Integrated in the Vowpal Wabbit ecosystem, this could help the users to focus more on the problem and extract meaningful results with help of visualization. Currently, we have extended the Vowpal Wabbit Python bindings to support outputting progress updates and model details to Tensorboard.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Wilson-Cheung-150x150.jpg\" alt=\"portrait of Wilson Cheung\" class=\"wp-image-822562\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Wilson-Cheung-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Wilson-Cheung.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Wilson-Cheung-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"wilson-cheung\">Wilson Cheung<\/h3>\n\n\n\n<p>Wilson Cheung recently completed his M.S. in Analytics from Georgia Institute of Technology. He previously worked as a data scientist at Booz Allen Hamilton where he developed many artificial intelligence and data engineering capabilities to address technological problems faced by government clients in healthcare and defense industries. He currently works as a data scientist at Amazon Web Services and is hoping to develop his professional career towards AI-based personalization using reinforcement learning and bandit-based methods learned in the participation of RLOS.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=30\" target=\"_blank\" rel=\"noopener noreferrer\">Extend FairLearn to include RL bias analysis using VW<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Analyzing outcomes of a learned policy in an online setting through contextual bandits poses many unforeseen consequences in the study of responsible AI. Particularly, given that the objective for many RL systems is to maximize accumulated rewards, any policy can potentially carry latent harms throughout the online evaluation process. By using logged contextual bandits data generated via Vowpal Wabbit under an assumed logged policy, we compute FairLearn-supported fairness metrics to identify these harms and use counterfactual analysis to assess the quality of evaluation policies. Lastly, we identify ways to mitigate the impacts of these harms under the reduced weighted multi-class classification setting.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Monika-Farsang-150x150.jpg\" alt=\"portrait of Monika Farsang\" class=\"wp-image-822547\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Monika-Farsang-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Monika-Farsang.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Monika-Farsang-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"monika-farsang\">M\u00f3nika Farsang<\/h3>\n\n\n\n<p>M\u00f3nika Farsang is from Budapest, Hungary. She recently graduated from the Budapest University of Technology and Economics, where she studied Mechatronics Engineering. Her specialization was in the field of Intelligent Embedded Systems. She is passionate about solving challenging problems and likes to contribute to open-source projects, because she loves to see that science and technology can become more accessible to everyone. Her research interests are machine learning, in particular reinforcement learning, bio-inspired solutions, and robotics. She hopes to join a PhD program, where she can continue her studies in this area.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=2288\" target=\"_blank\" rel=\"noopener noreferrer\">Safe Contextual Bandits<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Contextual bandit algorithms optimize the mean value of the reward distribution without paying attention to worst-case scenarios. However, there are many safety-critical domains where this kind of behavior is undesirable. Consequently, our goal is to implement safe contextual bandits which focus specifically on the worst cases. To achieve this, we use conditional value at risk (CVaR), which means the expected return in the worst q% of the cases. By using this, we optimize the average of the tail instead of the average of the whole distribution to maintain safety and avoid choosing bad actions. CVaR has a dual representation mathematically, which results in off-policy learning with passing modified rewards to the contextual bandit. By leveraging this formulation, we optimize the cut-off value of the distribution online. To optimize the cut-off point, we use a no-regret algorithm called FreeGrad, which is practical because it does not contain any hyperparameter tuning. The results of our project demonstrate that better worst-case behavior can be achieved by optimizing the CVaR of the distribution compared to typical contextual bandit policies.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Nishant-Kumar-150x150.jpg\" alt=\"portrait of Nishant Kumar\" class=\"wp-image-822553\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Nishant-Kumar-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Nishant-Kumar.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Nishant-Kumar-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"nishant-kumar\">Nishant Kumar<\/h3>\n\n\n\n<p>Nishant is currently pursuing his bachelor\u2019s in Electronics Engineering from the Indian Institute of Technology (BHU), Varanasi, India. As an undergraduate researcher, most of his work has been broadly in AI, with a particular focus on Reinforcement Learning, Multiagent systems and communication, and game AI. He also specializes in writing code to develop and maintain intelligent systems. He has worked on a diverse set of problems in AI, from implementing efficient RL agents using C++ code to enabling and enhancing parallel processing capabilities in machine learning libraries. Apart from that, he likes contributing to open-source code and reading about cybersecurity, blockchain, astronomy, and human psychology.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=660\" target=\"_blank\" rel=\"noopener noreferrer\">VW Parallel parsing improvements<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Vowpal Wabbit is known for its blazing-fast performance. However, VW\u2019s parsers can be a bottleneck for most operations, so an effective way to multithread the parsers is required to unleash their true potential. Last year, parallel parsing support for text input format was provided. This project builds upon that by providing a better and more efficient way to read and write cache, support for multiple passes, multiline examples, and JSON\/DsJSON input formats.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Milena-Mathew-150x150.jpg\" alt=\"portrait of Milena Mathew\" class=\"wp-image-822544\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Milena-Mathew-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Milena-Mathew.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Milena-Mathew-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"milena-mathew\">Milena Mathew<\/h3>\n\n\n\n<p>Milena Mathew is an undergraduate majoring in Electrical Engineering and Computer Science at UC Berkeley. She\u2019s broadly interested in the intersection of computer science and physics and has previously worked on applying machine learning techniques to problems in the natural sciences. After graduating, Milena hopes to work in industrial R&D. In her free time, you can typically find her tackling the latest crossword or baking a batch of cookies.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=2901\" target=\"_blank\" rel=\"noopener noreferrer\">Safe Contextual Bandits<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Contextual bandit algorithms are typically designed to maximize the expected reward over time. However, in systems where there\u2019s a safety constraint, avoiding bad actions may be valued over purely maximizing reward. We aim to account for both of these goals by developing a chance-constrained policy optimizing learner. Chance constrained policy optimization takes advantage of additional observed feedback to determine the probability of a decision violating a constraint while still optimizing for reward. We implemented the learner in Coba- a contextual bandit algorithm benchmarking application- and added functionality to accommodate multiple observations. We aim to show the feasibility of this approach by comparing the amount of constraint-violating behavior in our learner versus traditional contextual bandit algorithms.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Krystal-Maughan-150x150.jpg\" alt=\"portrait of Krystal Maughan\" class=\"wp-image-822538\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Krystal-Maughan-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Krystal-Maughan.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Krystal-Maughan-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"krystal-maughan\">Krystal Maughan<\/h3>\n\n\n\n<p>Krystal Maughan is from Trinidad and Tobago. She is currently pursuing a PhD in Computer Science, minoring in Pure Mathematics at the University of Vermont, focusing on isogeny-based cryptography with Christelle Vincent and Joe Near. She has previously published in Fairness and Privacy for workshops at NeuRIPS and MD4SG, contributed to open source for Haskell.org for Google Summer of Code and Mozilla\u2019s RustReach and interned at Apple, Microsoft, Mercury (a Haskell fintech), and Autodesk. She graduated with a Bachelor\u2019s in Film, Photography and Visual Arts, and a double minor in Art History and Technical Theatre from Ithaca College. Before grad school, she worked in Hollywood as a lighting and camera technician for high speed lighting and camera R&D tech startups, did a workshop at the Jet Propulsion Laboratory and learned to sail. She wants to continue isogeny \/ mathematical cryptography research after her PhD.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=2354\" target=\"_blank\" rel=\"noopener noreferrer\">Integrate Estimator Library into Azure Machine Learning (AML) Pipeline<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>In this work, we create a complete end-to-end pipeline for the user, from loading the data as a stream of decisions done by a reinforcement learning system and information of policies that we are trying to estimate counterfactually, running estimators from the estimators library as configured by the user, to running local and distributed computation on the compute clusters of Azure Machine Learning (AML) and visualizing the aggregated result for each estimator locally as an output of aggregated base on number of events counterfactuals for given policies\/estimators.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Jui-Pradhan-150x150.jpg\" alt=\"portrait of Jui Pradhan\" class=\"wp-image-822535\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Jui-Pradhan-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Jui-Pradhan.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Jui-Pradhan-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"jui-pradhan\">Jui Pradhan<\/h3>\n\n\n\n<p>Jui Pradhan is a final year student pursuing B.E. Computer Science and MSc. Economics from BITS Pilani. Her current research interests involve Artificial intelligence, Federated Learning, Algorithmic Game theory and Optimization. She has worked on projects at the intersection of Reinforcement Learning, NLP and Information retrieval. Before interning at Microsoft as a Vowpal Wabbit contributor, she was a mentor at Google Summer of Code- Sugarlabs and has contributed to several open-source projects. Outside of work, she loves to paint, write and explore different forms of creativity and art. After graduating in 2023, she hopes to work on impactful AI-driven projects and work part-time on research projects at academic research labs.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=2954\" target=\"_blank\" rel=\"noopener noreferrer\">Integrate Estimator Library into Azure Machine Learning (AML) Pipeline<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>The estimator library is a collection of estimators to perform off-policy evaluation. The current prototype of the estimator library lacked a clear structure, which made it hard to install and consume. Moreover, for researchers to contribute by adding new estimators, it is imperative for them to know which interfaces to use according to their problem type(cb, ccb, slates, ca, etc). Therefore, as a part of this project, we added interfaces for each problem type, worked on CI improvements, added new estimators, added tests, structured the estimator library and finally released it to PyPI as vw-estimators. Our work will capacitate the estimator library to be consumed by AML pipeline and end-users.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Manav-Headshot-150x150.jpg\" alt=\"portrait of Manav Singhal\" class=\"wp-image-822541\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Manav-Headshot-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Manav-Headshot.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Manav-Headshot-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"manav-singhal\">Manav Singhal<\/h3>\n\n\n\n<p>Manav Singhal&nbsp;is a senior year undergraduate student at the National Institute of Technology Karnataka, India studying Electrical and Electronics Engineering. His current research interests lie in improving the deployment of reinforcement learning in real-world scenarios and increasing the interpretability of machine learning models. Besides work, Manav loves reading, traveling, and running! After graduating, he wishes to pursue his graduate studies in Computer Science.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=1\" target=\"_blank\" rel=\"noopener noreferrer\">Empirical Analysis of Privacy Preserving Learning<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>In many real-world learning scenarios, due to privacy constraints (for example, General Data Protection Regulation), one cannot use the user feature mapping directly for personalization. In order to uphold the privacy of the user, we aim to study the effect of using aggregated data for learning. We explore the notion of \u201caggregation\u201d by saving only those features after training that have crossed a certain threshold of users. This project focuses on comparing the performance of the model without aggregation (public model) and the model with aggregation (private model), thus understanding how much this filtering helps in the learning process.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Varun-Suryan-150x150.jpg\" alt=\"portrait of Varun Suryan\" class=\"wp-image-822556\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Varun-Suryan-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Varun-Suryan.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Varun-Suryan-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"varun-suryan\">Varun Suryan<\/h3>\n\n\n\n<p>Varun Suryan grew up in northern India. He is a final-year Ph.D. student in Computer Science at the University of Maryland. His interests lie in reinforcement learning (RL), multi-armed bandits, and robotics. His Ph.D. focuses on improving the sample efficiency of RL agents with the help of simulators. He attended the Indian Institute of Technology Jodhpur and Virginia Tech for his B.Tech. and MS in Mechanical Engineering and Computer Engineering respectively. Varun is passionate about technology and loves collaborating with people from various domains. In the future, he wishes to pursue his work in RL and AI. In his spare time, he runs and plays tennis.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=737\" target=\"_blank\" rel=\"noopener noreferrer\">AutoML for Online Contextual Bandits<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>We propose the ChaChaCB algorithm for making online feature interaction choices for contextual bandits. This is crucial in online learning services which can significantly benefit from online autoML style algorithms to automatically choose hyperparameters\/configs. Currently, most of the tuning is done manually. This problem has been studied before under the full information (supervised) setting where each configuration has access to the revealed feedback from the environment. However, bandits present unique challenges, and not every configuration gets to receive feedback from the environment. By using importance weight to update the loss bounds for a subset of configurations, ChaChaCB performs competitively with several baselines. Further, we plan to integrate ChaChaCB as a learner in Coba \u2013 a standardized framework to test contextual bandit learners.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft size-thumbnail is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"150\" height=\"150\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Vishal-Vinod-150x150.jpg\" alt=\"portrait of Vishal Vinod\" class=\"wp-image-822559\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Vishal-Vinod-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Vishal-Vinod.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Vishal-Vinod-180x180.jpg 180w\" sizes=\"auto, (max-width: 150px) 100vw, 150px\" \/><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"vishal-vinod\">Vishal Vinod<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/vishal-v-link\/\" target=\"_blank\" rel=\"noopener noreferrer\">Vishal Vinod<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is a Computer Science master\u2019s student at University of California, San Diego. His current research interests are in continual learning, 3D computer vision and domain adaptation to improve the performance of autonomous systems in real-world scenarios. Vishal is strongly motivated by AI4SocialGood and aims to work on socially impactful AI research applications. Apart from research, he is involved in the open-source community and reads up on developmental economics.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=1420\" target=\"_blank\" rel=\"noopener noreferrer\">VW feature transformation without redeploying the source<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Feature transformation are necessary to prototype, mutate or compare the performance of a trained model. Currently, creating a feature mutation in VW requires the implementation of a new C++ reduction for each mutation, making it harder to work on new ones. This project simplifies feature modifications by implementing a generic reduction such that example modification functions can be registered with the interface and used without having to implement a reduction for each transformation. This allows mutation functions such as deleting a feature, dropout, normalization, logarithmic mutation, feature binarization, etc. to be registered on the stack for the model in memory without redeploying the source. The generic reduction enables feature engineering pipelines using only callable functions to modify the example, and also allows comparing the performance of models with and without mutations in a single run for benchmarking and avoids the step of transforming the data by other means.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity is-style-end-mark\"\/>\n\n\n\n\n\n<figure class=\"wp-block-image alignleft is-resized\"><a href=\"https:\/\/www.linkedin.com\/in\/magarw10\/\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/milind-agarwal.jpg\" alt=\"head shot of Milind Agarwal\" class=\"wp-image-690198\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/milind-agarwal.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/milind-agarwal-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/milind-agarwal-180x180.jpg 180w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"milind-agarwal\">Milind Agarwal<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/magarw10\/\" target=\"_blank\" rel=\"noopener noreferrer\">Milind Agarwal<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is a combined undergraduate and master\u2019s student in Computer Science at Johns Hopkins University. His current research interests are natural language processing and machine translation for low-resource and endangered language settings. Before interning at Microsoft, he previously worked in many different academic research labs at Johns Hopkins gaining experience in a wide variety of fields including NLP, machine translation, computational biology, data visualization, and software development. After graduation in 2021, he hopes to join a Ph.D. program where he can continue to work on challenging NLP problems.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=1230\" target=\"_blank\" rel=\"noopener noreferrer\">Challenge: Contextual Bandit Data Visualization with Jupyter Notebooks<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Exploratory data analysis and data visualization have become an essential part of any data scientist\u2019s toolkit. Visualizations not only allow you to kickstart your analysis by easily understanding the patterns in your data but also help you visually inspect your policies to understand their behaviour. We present cb_visualize, a python-based visualization library specialized for contextual bandits features and policy visualizations. This library offers robust visualizations for data exploration, training, feature importance, and action distributions and supports common contextual-bandit dataset formats used by Vowpal Wabbit like text, JSON, and DSJSON. We hope that this toolkit will be an asset for researchers and customers alike to better present and understand their data and analyses.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft is-resized\"><a href=\"https:\/\/www.linkedin.com\/in\/sharad-chitlangia\/\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/sharad-chitlangia.jpg\" alt=\"head shot of Sharad Chitlangia\" class=\"wp-image-690195\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/sharad-chitlangia.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/sharad-chitlangia-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/sharad-chitlangia-180x180.jpg 180w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"sharad-chitlangia\">Sharad Chitlangia<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/sharad-chitlangia\/\" target=\"_blank\" rel=\"noopener noreferrer\">Sharad Chitlangia<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is a senior year undergraduate student at BITS Pilani Goa, where I studied Electronics. I am specializing in the field of Artificial Intelligence. I\u2019ve previously worked heavily at the intersection of Machine Learning and Systems and Explainable AI. Aside from work, I spend a lot of time in the Open Source Community and working on improving accessibility, especially in AI research.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=72\" target=\"_blank\" rel=\"noopener noreferrer\">Challenge: Pushing the Limits of VW with Flatbuffers<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>VowpalWabbit is known for its abilitiy to solve complex machine learning problems extremely fast. Through this project, we aim to take this ability, even further, by the introduction of Flatbuffers. Flatbuffers is an efficient cross-platform serialization library known for its memory access efficiency and speed. We develop Flatbuffer schemas, for input examples, to be able to store them as binary buffers and show a performance increase of 30%, or more compared to traditional formats.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft is-resized\"><a href=\"https:\/\/www.linkedin.com\/in\/harish-kamath\/\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/harish-kamath.jpg\" alt=\"head shot of Harish Kamath\" class=\"wp-image-690186\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/harish-kamath.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/harish-kamath-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/harish-kamath-180x180.jpg 180w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"harish-kamath\">Harish Kamath<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/harish-kamath\/\" target=\"_blank\" rel=\"noopener noreferrer\">Harish Kamath<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is a Computer Science\/Math undergraduate at Georgia Tech. His passions focus on reinforcement learning, generalization in learning, and making newer technologies cheaper, faster, and more accessible. Outside of work, I love playing\/watching basketball, dance, and running! Once I graduate, I hope I end up somewhere where I can make the biggest lasting impact for the most people.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=2317\" target=\"_blank\" rel=\"noopener noreferrer\">Challenge: Conversion of VowpalWabbit models into ONNX format<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Currently, ONNX is the leading standard to represent machine learning models across platforms and frameworks. It describes a model as a computational graph consisting of a set of standard operators from an operator set that is constantly evolving to accommodate new types of models and operations. Being able to describe a model in ONNX format is important, as it allows for (1) models to be optimized and run across different architectures using a single runtime, and (2) it allows models created in different frameworks to interact with each other. Although other leading frameworks such as Tensorflow and Pytorch have mature tools to convert into ONNX format, VowpalWabbit today does not yet have the capability baked into the framework. This project focuses on introducing this functionality to VowpalWabbit, so that we can combine the fast model training and inference speed of VW with the representational capacity of other frameworks. We introduce new sparse operators that are used to instantiate VW regression models efficiently in ONNX format, show that you can directly translate regression and contextual bandits models with these operators, and give an example of such models being run in RLClientLib to show that they can now be ported into any inference framework.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft is-resized\"><a href=\"https:\/\/www.linkedin.com\/in\/cassandra-marcussen-2b13b0142\/\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/cassandra-marcussen.jpg\" alt=\"head shot of Cassandra Marcussen\" class=\"wp-image-690183\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/cassandra-marcussen.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/cassandra-marcussen-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/cassandra-marcussen-180x180.jpg 180w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"cassandra-marcussen\">Cassandra Marcussen<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/cassandra-marcussen-2b13b0142\/\" target=\"_blank\" rel=\"noopener noreferrer\">Cassandra Marcussen<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is a junior at Columbia University studying Mathematics and Computer Science. Her interests lie in artificial intelligence, theoretical computer science, and contributing to technology within these fields through efficient computing and low-level optimizations. Cassandra is enthusiastic about open source code, and has loved working on an impactful open source system such as Vowpal Wabbit. In the future, she wishes to pursue graduate studies in Computer Science.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=1177\" target=\"_blank\" rel=\"noopener noreferrer\">Challenge: Parallelized Parsing<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Modern machines often utilize many threads to achieve good performance. Currently, VW uses a single thread to read in and parse input, and a single thread to learn. The parse thread presents a bottleneck, slowing down VW as a whole. By extracting the input reading into a separate thread and extending the parser to support many threads, VW can better utilize resources, achieve better performance, and have an improved design by separating logical components into independent modules. This project focuses on improving performance and design for the text input format, and also ensures compatibility with the cache input format.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft is-resized\"><a href=\"https:\/\/www.linkedin.com\/in\/newtonmwai\/\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/newton-mwai-300x300.jpg\" alt=\"head shot of Newton Mwai\" class=\"wp-image-690192\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/newton-mwai.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/newton-mwai-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/newton-mwai-180x180.jpg 180w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"newton-mwai-kinyanjui\">Newton Mwai Kinyanjui<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/newtonmwai\/\" target=\"_blank\" rel=\"noopener noreferrer\">Newton Mwai Kinyanjui<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is from Nairobi Kenya. I\u2019m currently pursuing my Ph.D. at Chalmers University of Technology in Sweden, working in causal inference and reinforcement learning towards machine learning for improved decision making in healthcare with Fredrik Johansson. I graduated from Carnegie Mellon University Africa with a Master of Science in Electrical and Computer Engineering.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=248\" target=\"_blank\" rel=\"noopener noreferrer\">Challenge: Library of contextual bandit estimators<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Estimators are used in off-policy evaluation. One common estimator is IPS, and others are DR and PseudoInverse. These estimators work better or worse in different settings. This project explores reference implementations of each and allows for comparison between them to aid in understanding. We extend the estimators library and implement an interface to help researchers and data scientists test different estimators quickly and easily.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-text-color has-blue-color has-alpha-channel-opacity has-blue-background-color has-background is-style-dots\"\/>\n\n\n\n<figure class=\"wp-block-image alignleft is-resized\"><a href=\"https:\/\/markrucker.net\/\"><img loading=\"lazy\" decoding=\"async\" width=\"300\" height=\"300\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/mark-rucker.jpg\" alt=\"head shot of Mark Rucker\" class=\"wp-image-690189\" style=\"width:75px;height:75px\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/mark-rucker.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/mark-rucker-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/mark-rucker-180x180.jpg 180w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><\/a><\/figure>\n\n\n\n<h3 class=\"wp-block-heading\" id=\"mark-rucker\">Mark Rucker<\/h3>\n\n\n\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/markrucker.net\/\" target=\"_blank\" rel=\"noopener noreferrer\">Mark Rucker<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is currently a 2nd year PhD student at the University of Virginia with a previous 8-year career as an enterprise software engineer. Mark\u2019s PhD research explores how reinforcement learning models can be used to encourage health behavior change in individuals managing chronic health conditions. This research combines state of the art machine learning with web and mobile app development to support in-situ randomized control trials of behavior change interventions. After graduation Mark hopes to once again return to industry in order to develop high-quality products that deeply impact people\u2019s lives.<\/p>\n\n\n\n<p><strong>Demo: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=2209\" target=\"_blank\" rel=\"noopener noreferrer\">Challenge: COBA: A Modern Benchmarking Package for Reproducible Contextual Bandit Research<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/strong><\/p>\n\n\n\n<p>Performance benchmarking on well-defined problems is a pillar of modern machine learning research. With clear problems and metrics, benchmarking has allowed the research community to maintain a high-level of independent effort while still making real and meaningful progress over time. The elegance of benchmarking \u2014 define, measure, repeat \u2014 however, belies real engineering challenges such as software maintenance, data distribution, statistical aggregation, and reproducibility to name a few. These challenges are especially salient in contextual bandit research where one not only needs a data set but also a harness to emulate interaction with the data. In an effort to reduce these burdens, while not losing any of benchmarking\u2019s benefits, we present COBA, an ultra light-weight Python package for benchmarking contextual bandit algorithms. COBA uses a small set of clean and consistent interfaces to satisfy four core use cases: (1) creating reproducible benchmarks, (2) sharing reproducible benchmarks, (3) evaluating custom algorithms, and (4) exploring evaluation results.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity is-style-end-mark\"\/>\n\n\n\n\n\n<div style=\"height:15px\" aria-hidden=\"true\" class=\"wp-block-spacer\"><\/div>\n\n\n\n\n\n<h3 class=\"wp-block-heading\" id=\"frequently-asked-questions\">Frequently asked questions<\/h3>\n\n\n\n\n\n<p>No. RL Open Source Fest is an activity that the student performs as an independent developer in collaboration with the real world reinforcement learning team at Microsoft Research, for which they are paid a stipend.<\/p>\n\n\n\n\n\n<p>Yes. Think of this as a fun project on the side.<\/p>\n\n\n\n\n\n<p>As much as you like!<\/p>\n\n\n\n\n\n<p>The program occurs entirely online. There is no requirement to travel as part of the program.<\/p>\n\n\n\n\n\n<p>You must be currently enrolled in an accredited academic institute\u2019s undergraduate, Masters, or PhD program. Check out the application process <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/academic-program\/rl-open-source-fest\/apply\/\">here<\/a>.<\/p>\n\n\n\n\n\n<p>Additional questions? Feel free to send them to us at <a href=\"mailto:RLOSFEST@microsoft.com\">RLOSFEST@microsoft.com<\/a>.<\/p>\n\n\n","protected":false},"featured_media":718090,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":true,"_classifai_error":"","msr_hide_image_in_river":0,"footnotes":""},"msr-opportunity-type":[187426],"msr-region":[256048],"msr-locale":[268875],"msr-program-audience":[243724],"msr-post-option":[],"msr-impact-theme":[],"class_list":["post-633669","msr-academic-program","type-msr-academic-program","status-publish","has-post-thumbnail","hentry","msr-opportunity-type-challenges","msr-region-global","msr-locale-en_us","msr-program-audience-students"],"msr_description":"The Reinforcement Learning (RL) Open Source Fest is a global online program focused on introducing students to open source\u202freinforcement learning programs and software development\u202fwhile working alongside\u202fresearchers, data scientists, and engineers\u202fon the\u202fReal World Reinforcement Learning\u202fteam\u202fat Microsoft Research NYC. Students\u202fwill\u202fwork on a\u202ffour-month\u202fresearch programming project\u202fduring their break from university (May-August 2020). Accepted students will receive a\u202f$10,000 USD\u202fstipend.","msr_social_media":[],"related-researchers":[{"type":"user_nicename","display_name":"Jacob Alber","user_id":36747,"people_section":"Microsoft Reinforcement Learning team","alias":"jaalber"},{"type":"user_nicename","display_name":"Rajan Chari","user_id":36765,"people_section":"Microsoft Reinforcement Learning team","alias":"ranaras"},{"type":"user_nicename","display_name":"Miro Dud\u00edk","user_id":32867,"people_section":"Microsoft Reinforcement Learning team","alias":"mdudik"},{"type":"user_nicename","display_name":"Dylan Foster","user_id":40330,"people_section":"Microsoft Reinforcement Learning team","alias":"dylanfoster"},{"type":"user_nicename","display_name":"Rafah Hosn","user_id":36783,"people_section":"Microsoft Reinforcement Learning team","alias":"raaboulh"},{"type":"user_nicename","display_name":"Akshay Krishnamurthy","user_id":30913,"people_section":"Microsoft Reinforcement Learning team","alias":"akshaykr"},{"type":"user_nicename","display_name":"John Langford","user_id":32204,"people_section":"Microsoft Reinforcement Learning team","alias":"jcl"},{"type":"user_nicename","display_name":"Paul Mineiro","user_id":33272,"people_section":"Microsoft Reinforcement Learning team","alias":"pmineiro"},{"type":"user_nicename","display_name":"Edaena Salinas Jasso","user_id":36383,"people_section":"Microsoft Reinforcement Learning team","alias":"edaenasj"},{"type":"user_nicename","display_name":"Siddhartha Sen","user_id":33656,"people_section":"Microsoft Reinforcement Learning team","alias":"sidsen"},{"type":"user_nicename","display_name":"Alex Slivkins","user_id":33685,"people_section":"Microsoft Reinforcement Learning team","alias":"slivkins"},{"type":"user_nicename","display_name":"Cheng Tan","user_id":37953,"people_section":"Microsoft Reinforcement Learning team","alias":"chetan"},{"type":"user_nicename","display_name":"Alexey Taymanov","user_id":37616,"people_section":"Microsoft Reinforcement Learning team","alias":"ataymano"}],"tab-content":[{"id":0,"name":"About","content":"<strong>Program dates:<\/strong> Summer (May \u2013 August 2022) or Fall (September \u2013 December 2022)\r\n<strong>Deadline to apply:<\/strong> April 4, 2022, 11:59 PM ET\r\n<div>\r\n[msr-button text=\"Apply\" url=\"https:\/\/webportalapp.com\/sp\/login\/2022_msr_rl_open_source_fest\" new-window=\"true\" ]\r\n<\/div>\r\n<div style=\"height: 15px\"><\/div>\r\nThe Reinforcement Learning (RL) Open Source Fest is a global online program focused on introducing students to open-source reinforcement learning programs and software development while working alongside researchers, data scientists, and engineers on the <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/real-world-reinforcement-learning\/\">Real World Reinforcement Learning<\/a> team at Microsoft Research NYC. Students will work on a four-month research programming project for either a Summer (May-August 2022) or Fall session (September \u2013 December 2022). Accepted students will receive a $10,000 USD stipend. Selected students will receive their stipend payment at the beginning of their session. Microsoft sends the payment directly to a student's academic institution, which then disperses funds according to the institution's guidelines.\r\n\r\nOur goal is to bring together a diverse group of students from around the world to collectively solve open-source reinforcement learning problems and advance the state-of-the-art research\u202fand development\u202falongside the RL community while providing open-source code written and released to benefit all.\r\n\r\nAt the end of the program, students will present each of their projects to the Microsoft Research Real World Reinforcement Learning team online.\r\n<h2>Open-source projects<\/h2>\r\n<a href=\"https:\/\/vowpalwabbit.org\/\" target=\"_blank\" rel=\"noopener\">Vowpal Wabbit (VW)<\/a> is an open-source machine learning library created by John Langford and developed by Microsoft Research with the help of many contributors. It is a fast, flexible, online, and active learning solution that empowers people to solve complex interactive machine learning problems, with a large focus on contextual bandits and reinforcement learning. It is a vehicle for both research prototyping and driving bleeding edge algorithms to production. RL OS Fest is all about open-source projects in the Vowpal Wabbit ecosystem.\r\n\r\n<a href=\"https:\/\/vowpalwabbit.org\/rlos\/2022\/projects\">View open-source projects &gt;<\/a>\r\n<h2>Eligibility<\/h2>\r\nTo be eligible for the program, students must be enrolled in or accepted into an accredited institution including colleges, universities, Master programs, PhD programs, and undergraduate programs.\r\n<h2>Student responsibilities during the program<\/h2>\r\n<ul>\r\n \t<li>Submit quality work: code compiles, has unit tests and documentation, and\u00a0passes code review<\/li>\r\n \t<li>Regularly communicate work completed, what you intend to do next, and blockers<\/li>\r\n \t<li>Re-evaluate project tasks if you\u2019re significantly ahead or behind schedule<\/li>\r\n \t<li>Regular check-ins\u00a0with your mentor\/collaborator<\/li>\r\n \t<li>Listen and respond to feedback<\/li>\r\n \t<li>Pro-active learning<\/li>\r\n<\/ul>\r\n<h2>What makes a successful project?<\/h2>\r\nSuccess looks different for every project. Challenging yourself and developing skills and knowledge are the most important part. Producing some sort of deliverable item is great, but not strictly required. We all know how development and experimentation goes, unforeseen\u00a0problems can come up and present new challenges and that's all part of the process. You'll have a mentor and support along the way.\r\n<ul>\r\n \t<li>A successful engineering-oriented project might include pull requests merging your work, a design document, tests, and general documentation<\/li>\r\n \t<li>A successful data science-oriented project might involve pull requests, reproducible experiments, datasets, a report, and visualized results<\/li>\r\n \t<li>A successful prototyping-oriented project might include an MVP, tests, and documentation<\/li>\r\n<\/ul>\r\n\r\n<strong>Contact us:<\/strong> If you have questions about this program, please send us an email at <a href=\"mailto:RLOSFEST@microsoft.com\">RLOSFEST@microsoft.com<\/a>."},{"id":1,"name":"Timeline","content":"<h2>Program Timeline<\/h2>\r\n*<em>The upcoming program dates are subject to change, and will be finalized and updated here by March 1, 2022<\/em>\r\n\r\nMarch 1, 2022 | Application period opens\r\nApril 4, 2022 | Application period closes\r\n\r\nApril 25, 2022 | Selected applicants notified\r\nMay 9, 2022| Summer projects begin\r\nAugust 15, 2022 | Summer project presentations\r\n\r\nSeptember 12, 2022| Fall projects begin\r\nDecember 2, 2022 | Fall project presentations"},{"id":2,"name":"Apply","content":"<h3>How to submit an application<\/h3>\r\nThe below outlines the information necessary to submit your application in our submission portal.\r\n<ol>\r\n \t<li><strong>Application Details<\/strong>\r\nYou will be asked to fill out the following questions:<\/li>\r\n<\/ol>\r\n<ol>\r\n \t<li style=\"list-style-type: none\">\r\n<ul>\r\n \t<li>Are you currently enrolled in an accredited university or college? (Please note, proof of enrollment will be required)<\/li>\r\n \t<li>Select your country<\/li>\r\n \t<li>Upload your resume or a document containing a list of classes completed to date<\/li>\r\n \t<li>Upload existing or past personal projects you've worked on or open source projects to which you've contributed<\/li>\r\n \t<li>Choose your preferred project from the list of <a href=\"https:\/\/vowpalwabbit.org\/rlos\/2022\/projects\">Open Source Projects<\/a><\/li>\r\n \t<li>Select a program session date (Summer session takes place May - August 2022 and Fall session takes place September - December 2022)<\/li>\r\n \t<li>Upload your proposal for the selected Open Source Project. Include why you want to work on this problem specifically. Provide a rough outline of how you plan to execute on the project selected. This should include a week-by-week plan of what you'd need to learn and the challenges you foresee.<\/li>\r\n \t<li>Complete a pre-screening exercise according to the requirements of your selected project. If your exercise requires uploading files, provide a link to a GitHub repository containing these files.<\/li>\r\n<\/ul>\r\n<\/li>\r\n<\/ol>\r\nThe below outlines important information pertaining to your application:\r\n<ul>\r\n \t<li>Proposals submitted to Microsoft will not be returned. Microsoft cannot assume responsibility for the confidentiality of information submitted in the proposal. Therefore, proposals should not contain information that is confidential, restricted, or sensitive.<\/li>\r\n \t<li>Incomplete proposals will not be considered.<\/li>\r\n \t<li>Due to the volume of submissions, Microsoft Research cannot provide individual feedback on proposals.<\/li>\r\n<\/ul>\r\n\r\n<div>\r\n[msr-button text=\"Apply\" url=\"https:\/\/webportalapp.com\/sp\/login\/2022_msr_rl_open_source_fest\" new-window=\"true\" ]\r\n<\/div>"},{"id":3,"name":"Alumni","content":"<h2>2021 Alumni<\/h2>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/muhammad-ammar-abid-3916191a3\/\" target=\"_blank\" rel=\"noopener\"><img class=\"alignleft wp-image-822565\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2022\/02\/Muhamma-Ammar-Amid-v2.jpg\" alt=\"head shot of Muhammad Ammar Abid\" width=\"175\" height=\"175\" data-wp-editing=\"1\" \/><\/a>\r\n<h3>Muhammad Ammar Abid<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/muhammad-ammar-abid-3916191a3\/\" target=\"_blank\" rel=\"noopener\">Muhammad Ammar Abid<\/a> is a senior year undergraduate at NUCES (FAST) Peshawar campus in Pakistan. He has done more than 150 online courses, spending more than 800 hours online the past few years learning anything beneficial, new and exciting related to technology and his journey for learning still continues. Ammar's goal is to empower himself and other people through learning technology while believing the famous saying that knowledge is something nobody can take away from you. Ammar's Final Year Project is Real Time Pakistani Currency Detection for Visually Impaired aimed at solving the less accessible products' problem of currency detection in Pakistan. Ammar believes that his collaboration in Vowpal Wabbit will empower people around the globe and help them to achieve more. Ammar's future priority is to end up at a diverse and inclusive place where his collaboration can solve real world problems while empowering people in the process.\r\n\r\n<a href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=1501\" target=\"_blank\" rel=\"noopener\"><strong>Tensorboard and Tensorwatch Integration<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/EC_bViQjMy8?t=1501\" new-window=\"true\" ]<\/div>\r\nVisualization is necessary for brain to process large information. Tensorboard provides visualization and tooling necessary which can be effectively used for machine learning experimentation. Integrated in the Vowpal Wabbit ecosystem, this could help the users to focus more on the problem and extract meaningful results with help of visualization. Currently, we have extended the Vowpal Wabbit Python bindings to support outputting progress updates and model details to Tensorboard\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822562\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Wilson-Cheung.jpg\" alt=\"head shot of Wilson Cheung\" width=\"175\" height=\"175\" \/>\r\n<h3>Wilson Cheung<\/h3>\r\nWilson Cheung recently completed his M.S. in Analytics from Georgia Institute of Technology. He previously worked as a data scientist at Booz Allen Hamilton where he developed many artificial intelligence and data engineering capabilities to address technological problems faced by government clients in healthcare and defense industries. He currently works as a data scientist at Amazon Web Services and is hoping to develop his professional career towards AI-based personalization using reinforcement learning and bandit-based methods learned in the participation of RLOS.\r\n\r\n<a href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=30\" target=\"_blank\" rel=\"noopener\"><strong>Extend FairLearn to include RL bias analysis using VW<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/EC_bViQjMy8?t=30\" new-window=\"true\" ]<\/div>\r\nAnalyzing outcomes of a learned policy in an online setting through contextual bandits poses many unforeseen consequences in the study of responsible AI. Particularly, given that the objective for many RL systems is to maximize accumulated rewards, any policy can potentially carry latent harms throughout the online evaluation process. By using logged contextual bandits data generated via Vowpal Wabbit under an assumed logged policy, we compute FairLearn-supported fairness metrics to identify these harms and use counterfactual analysis to assess the quality of evaluation policies. Lastly, we identify ways to mitigate the impacts of these harms under the reduced weighted multi-class classification setting.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822547\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Monika-Farsang.jpg\" alt=\"head shot of M\u00f3nika Farsang\" width=\"175\" height=\"175\" \/>\r\n<h3>M\u00f3nika Farsang<\/h3>\r\nM\u00f3nika Farsang is from Budapest, Hungary. She recently graduated from the Budapest University of Technology and Economics, where she studied Mechatronics Engineering. Her specialization was in the field of Intelligent Embedded Systems. She is passionate about solving challenging problems and likes to contribute to open-source projects, because she loves to see that science and technology can become more accessible to everyone. Her research interests are machine learning, in particular reinforcement learning, bio-inspired solutions, and robotics. She hopes to join a PhD program, where she can continue her studies in this area.\r\n\r\n<a href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=2288\" target=\"_blank\" rel=\"noopener\"><strong>Safe Contextual Bandits<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=2288\" new-window=\"true\" ]<\/div>\r\nContextual bandit algorithms optimize the mean value of the reward distribution without paying attention to worst-case scenarios. However, there are many safety-critical domains where this kind of behavior is undesirable. Consequently, our goal is to implement safe contextual bandits which focus specifically on the worst cases. To achieve this, we use conditional value at risk (CVaR), which means the expected return in the worst q% of the cases. By using this, we optimize the average of the tail instead of the average of the whole distribution to maintain safety and avoid choosing bad actions. CVaR has a dual representation mathematically, which results in off-policy learning with passing modified rewards to the contextual bandit. By leveraging this formulation, we optimize the cut-off value of the distribution online. To optimize the cut-off point, we use a no-regret algorithm called FreeGrad, which is practical because it does not contain any hyperparameter tuning. The results of our project demonstrate that better worst-case behavior can be achieved by optimizing the CVaR of the distribution compared to typical contextual bandit policies.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822553\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Nishant-Kumar.jpg\" alt=\"head shot of Nishant Kumar\" width=\"175\" height=\"175\" \/>\r\n<h3>Nishant Kumar<\/h3>\r\n\"Nishant is currently pursuing his bachelor's in Electronics Engineering from the Indian Institute of Technology (BHU), Varanasi, India. As an undergraduate researcher, most of his work has been broadly in AI, with a particular focus on Reinforcement Learning, Multiagent systems and communication, and game AI. He also specializes in writing code to develop and maintain intelligent systems. He has worked on a diverse set of problems in AI, from implementing efficient RL agents using C++ code to enabling and enhancing parallel processing capabilities in machine learning libraries. Apart from that, he likes contributing to open-source code and reading about cybersecurity, blockchain, astronomy, and human psychology. \"\r\n\r\n<a href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=660\" target=\"_blank\" rel=\"noopener\"><strong>VW Parallel parsing improvements<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=660\" new-window=\"true\" ]<\/div>\r\nVowpal Wabbit is known for its blazing-fast performance. However, VW's parsers can be a bottleneck for most operations, so an effective way to multithread the parsers is required to unleash their true potential. Last year, parallel parsing support for text input format was provided. This project builds upon that by providing a better and more efficient way to read and write cache, support for multiple passes, multiline examples, and JSON\/DsJSON input formats.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822544\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Milena-Mathew.jpg\" alt=\"head shot of Milena Mathew\" width=\"175\" height=\"175\" \/>\r\n<h3>Milena Mathew<\/h3>\r\nMilena Mathew is an undergraduate majoring in Electrical Engineering and Computer Science at UC Berkeley. She's broadly interested in the intersection of computer science and physics and has previously worked on applying machine learning techniques to problems in the natural sciences. After graduating, Milena hopes to work in industrial R&amp;D. In her free time, you can typically find her tackling the latest crossword or baking a batch of cookies.\r\n\r\n<a href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=2901\" target=\"_blank\" rel=\"noopener\"><strong>Safe Contextual Bandits<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=2901\" new-window=\"true\" ]<\/div>\r\nContextual bandit algorithms are typically designed to maximize the expected reward over time. However, in systems where there\u2019s a safety constraint, avoiding bad actions may be valued over purely maximizing reward. We aim to account for both of these goals by developing a chance-constrained policy optimizing learner. Chance constrained policy optimization takes advantage of additional observed feedback to determine the probability of a decision violating a constraint while still optimizing for reward. We implemented the learner in Coba- a contextual bandit algorithm benchmarking application- and added functionality to accommodate multiple observations. We aim to show the feasibility of this approach by comparing the amount of constraint-violating behavior in our learner versus traditional contextual bandit algorithms.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822538\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Krystal-Maughan.jpg\" alt=\"head shot of Krystal Maughan\" width=\"175\" height=\"175\" \/>\r\n<h3>Krystal Maughan<\/h3>\r\nKrystal Maughan is from Trinidad and Tobago. She is currently pursuing a PhD in Computer Science, minoring in Pure Mathematics at the University of Vermont, focusing on isogeny-based cryptography with Christelle Vincent and Joe Near. She has previously published in Fairness and Privacy for workshops at NeuRIPS and MD4SG, contributed to open source for Haskell.org for Google Summer of Code and Mozilla\u2019s RustReach and interned at Apple, Microsoft, Mercury (a Haskell fintech), and Autodesk. She graduated with a Bachelor\u2019s in Film, Photography and Visual Arts, and a double minor in Art History and Technical Theatre from Ithaca College. Before grad school, she worked in Hollywood as a lighting and camera technician for high speed lighting and camera R&amp;D tech startups, did a workshop at the Jet Propulsion Laboratory and learned to sail. She wants to continue isogeny \/ mathematical cryptography research after her PhD.\r\n\r\n<a href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=2354\" target=\"_blank\" rel=\"noopener\"><strong>Integrate Estimator Library into Azure Machine Learning (AML) Pipeline<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/EC_bViQjMy8?t=2354\" new-window=\"true\" ]<\/div>\r\nIn this work, we create a complete end-to-end pipeline for the user, from loading the data as a stream of decisions done by a reinforcement learning system and information of policies that we are trying to estimate counterfactually, running estimators from the estimators library as configured by the user, to running local and distributed computation on the compute clusters of Azure Machine Learning (AML) and visualizing the aggregated result for each estimator locally as an output of aggregated base on number of events counterfactuals for given policies\/estimators.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822535\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Jui-Pradhan.jpg\" alt=\"head shot of Jui Pradhan\" width=\"175\" height=\"175\" \/>\r\n<h3>Jui Pradhan<\/h3>\r\nJui Pradhan is a final year student pursuing B.E. Computer Science and MSc. Economics from BITS Pilani. Her current research interests involve Artificial intelligence, Federated Learning, Algorithmic Game theory and Optimization. She has worked on projects at the intersection of Reinforcement Learning, NLP and Information retrieval. Before interning at Microsoft as a Vowpal Wabbit contributor, she was a mentor at Google Summer of Code- Sugarlabs and has contributed to several open-source projects. Outside of work, she loves to paint, write and explore different forms of creativity and art. After graduating in 2023, she hopes to work on impactful AI-driven projects and work part-time on research projects at academic research labs.\r\n\r\n<a href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=2954\" target=\"_blank\" rel=\"noopener\"><strong>Integrate Estimator Library into Azure Machine Learning (AML) Pipeline<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/EC_bViQjMy8?t=2954\" new-window=\"true\" ]<\/div>\r\nThe estimator library is a collection of estimators to perform off-policy evaluation. The current prototype of the estimator library lacked a clear structure, which made it hard to install and consume. Moreover, for researchers to contribute by adding new estimators, it is imperative for them to know which interfaces to use according to their problem type(cb, ccb, slates, ca, etc). Therefore, as a part of this project, we added interfaces for each problem type, worked on CI improvements, added new estimators, added tests, structured the estimator library and finally released it to PyPI as vw-estimators. Our work will capacitate the estimator library to be consumed by AML pipeline and end-users.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822541\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Manav-Headshot.jpg\" alt=\"head shot of Manav Singhal\" width=\"175\" height=\"175\" \/>\r\n<h3>Manav Singhal<\/h3>\r\nManav Singhal\u00a0is a senior year undergraduate student at the National Institute of Technology Karnataka, India studying Electrical and Electronics Engineering. His current research interests lie in improving the deployment of reinforcement learning in real-world scenarios and increasing the interpretability of machine learning models. Besides work, Manav loves reading, traveling, and running! After graduating, he wishes to pursue his graduate studies in Computer Science.\r\n\r\n<a href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=1\" target=\"_blank\" rel=\"noopener\"><strong>Empirical Analysis of Privacy Preserving Learning<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=1\" new-window=\"true\" ]<\/div>\r\nIn many real-world learning scenarios, due to privacy constraints (for example, General Data Protection Regulation), one cannot use the user feature mapping directly for personalization. In order to uphold the privacy of the user, we aim to study the effect of using aggregated data for learning. We explore the notion of \u201caggregation\u201d by saving only those features after training that have crossed a certain threshold of users. This project focuses on comparing the performance of the model without aggregation (public model) and the model with aggregation (private model), thus understanding how much this filtering helps in the learning process.\r\n\r\n<hr \/>\r\n\r\n<img class=\"alignleft wp-image-822556\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Varun-Suryan.jpg\" alt=\"head shot of Varun Suryan\" width=\"175\" height=\"175\" \/>\r\n<h3>Varun Suryan<\/h3>\r\nVarun Suryan grew up in northern India. He is a final-year Ph.D. student in Computer Science at the University of Maryland. His interests lie in reinforcement learning (RL), multi-armed bandits, and robotics. His Ph.D. focuses on improving the sample efficiency of RL agents with the help of simulators. He attended the Indian Institute of Technology Jodhpur and Virginia Tech for his B.Tech. and MS in Mechanical Engineering and Computer Engineering respectively. Varun is passionate about technology and loves collaborating with people from various domains. In the future, he wishes to pursue his work in RL and AI. In his spare time, he runs and plays tennis.\r\n\r\n<a href=\"https:\/\/youtu.be\/EC_bViQjMy8?t=737\" target=\"_blank\" rel=\"noopener\"><strong>AutoML for Online Contextual Bandits<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/EC_bViQjMy8?t=737\" new-window=\"true\" ]<\/div>\r\nWe propose the ChaChaCB algorithm for making online feature interaction choices for contextual bandits. This is crucial in online learning services which can significantly benefit from online autoML style algorithms to automatically choose hyperparameters\/configs. Currently, most of the tuning is done manually. This problem has been studied before under the full information (supervised) setting where each configuration has access to the revealed feedback from the environment. However, bandits present unique challenges, and not every configuration gets to receive feedback from the environment. By using importance weight to update the loss bounds for a subset of configurations, ChaChaCB performs competitively with several baselines. Further, we plan to integrate ChaChaCB as a learner in Coba \u2013 a standardized framework to test contextual bandit learners.\r\n\r\n<hr \/>\r\n\r\n<a href=\"https:\/\/www.linkedin.com\/in\/vishal-v-link\/\" target=\"_blank\" rel=\"noopener\"><img class=\"alignleft wp-image-822559\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Vishal-Vinod.jpg\" alt=\"head shot of Vishal Vinod\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Vishal Vinod<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/vishal-v-link\/\" target=\"_blank\" rel=\"noopener\">Vishal Vinod<\/a> is a Computer Science master\u2019s student at University of California, San Diego. His current research interests are in continual learning, 3D computer vision and domain adaptation to improve the performance of autonomous systems in real-world scenarios. Vishal is strongly motivated by AI4SocialGood and aims to work on socially impactful AI research applications. Apart from research, he is involved in the open source community and reads up on developmental economics.\r\n\r\n<a href=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=1420\" target=\"_blank\" rel=\"noopener\"><strong>VW feature transformation without redeploying the source<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/Ggnuwfiwn3E?t=1420\" new-window=\"true\" ]<\/div>\r\nFeature transformation are necessary to prototype, mutate or compare the performance of a trained model. Currently, creating a feature mutation in VW requires the implementation of a new C++ reduction for each mutation, making it harder to work on new ones. This project simplifies feature modifications by implementing a generic reduction such that example modification functions can be registered with the interface and used without having to implement a reduction for each transformation. This allows mutation functions such as deleting a feature, dropout, normalization, logarithmic mutation, feature binarization, etc. to be registered on the stack for the model in memory without redeploying the source. The generic reduction enables feature engineering pipelines using only callable functions to modify the example, and also allows comparing the performance of models with and without mutations in a single run for benchmarking and avoids the step of transforming the data by other means.\r\n\r\n<hr \/>\r\n\r\n<h2>2020 Alumni<\/h2>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/magarw10\/\"><img class=\"alignleft wp-image-690198\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/milind-agarwal.jpg\" alt=\"head shot of Milind Agarwal\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Milind Agarwal<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/magarw10\/\" target=\"_blank\" rel=\"noopener\">Milind Agarwal<\/a> is a combined undergraduate and master's student in Computer Science at Johns Hopkins University. His current research interests are natural language processing and machine translation for low-resource and endangered language settings. Before interning at Microsoft, he previously worked in many different academic research labs at Johns Hopkins gaining experience in a wide variety of fields including NLP, machine translation, computational biology, data visualization, and software development. After graduation in 2021, he hopes to join a Ph.D. program where he can continue to work on challenging NLP problems.\r\n\r\n<a href=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=1230\" target=\"_blank\" rel=\"noopener\"><strong>Challenge: Contextual Bandit Data Visualization with Jupyter Notebooks<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=1230\" new-window=\"true\" ]<\/div>\r\nExploratory data analysis and data visualization have become an essential part of any data scientist's toolkit. Visualizations not only allow you to kickstart your analysis by easily understanding the patterns in your data but also help you visually inspect your policies to understand their behaviour. We present cb_visualize, a python-based visualization library specialized for contextual bandits features and policy visualizations. This library offers robust visualizations for data exploration, training, feature importance, and action distributions and supports common contextual-bandit dataset formats used by Vowpal Wabbit like text, JSON, and DSJSON. We hope that this toolkit will be an asset for researchers and customers alike to better present and understand their data and analyses.\r\n\r\n<hr \/>\r\n\r\n<a href=\"https:\/\/www.linkedin.com\/in\/sharad-chitlangia\/\"><img class=\"alignleft wp-image-690195\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/sharad-chitlangia.jpg\" alt=\"head shot of Sharad Chitlangia\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Sharad Chitlangia<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/sharad-chitlangia\/\" target=\"_blank\" rel=\"noopener\">Sharad Chitlangia<\/a> is a senior year undergraduate student at BITS Pilani Goa, where I studied Electronics. I am specializing in the field of Artificial Intelligence. I\u2019ve previously worked heavily at the intersection of Machine Learning and Systems and Explainable AI. Aside from work, I spend a lot of time in the Open Source Community and working on improving accessibility, especially in AI research.\r\n\r\n<a href=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=72\" target=\"_blank\" rel=\"noopener\"><strong>Challenge: Pushing the Limits of VW with Flatbuffers<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=72\" new-window=\"true\" ]<\/div>\r\nVowpalWabbit is known for its abilitiy to solve complex machine learning problems extremely fast. Through this project, we aim to take this ability, even further, by the introduction of Flatbuffers. Flatbuffers is an efficient cross-platform serialization library known for its memory access efficiency and speed. We develop Flatbuffer schemas, for input examples, to be able to store them as binary buffers and show a performance increase of 30%, or more compared to traditional formats.\r\n\r\n<hr \/>\r\n\r\n<a href=\"https:\/\/www.linkedin.com\/in\/harish-kamath\/\"><img class=\"alignleft wp-image-690186\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/harish-kamath.jpg\" alt=\"head shot of Harish Kamath\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Harish Kamath<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/harish-kamath\/\" target=\"_blank\" rel=\"noopener\">Harish Kamath<\/a> is a Computer Science\/Math undergraduate at Georgia Tech. His passions focus on reinforcement learning, generalization in learning, and making newer technologies cheaper, faster, and more accessible. Outside of work, I love playing\/watching basketball, dance, and running! Once I graduate, I hope I end up somewhere where I can make the biggest lasting impact for the most people.\r\n\r\n<a href=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=2317\" target=\"_blank\" rel=\"noopener\"><strong>Challenge: Conversion of VowpalWabbit models into ONNX format<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/CeOcNK1xSSA?t=2317\" new-window=\"true\" ]<\/div>\r\nCurrently, ONNX is the leading standard to represent machine learning models across platforms and frameworks. It describes a model as a computational graph consisting of a set of standard operators from an operator set that is constantly evolving to accommodate new types of models and operations. Being able to describe a model in ONNX format is important, as it allows for (1) models to be optimized and run across different architectures using a single runtime, and (2) it allows models created in different frameworks to interact with each other. Although other leading frameworks such as Tensorflow and Pytorch have mature tools to convert into ONNX format, VowpalWabbit today does not yet have the capability baked into the framework. This project focuses on introducing this functionality to VowpalWabbit, so that we can combine the fast model training and inference speed of VW with the representational capacity of other frameworks. We introduce new sparse operators that are used to instantiate VW regression models efficiently in ONNX format, show that you can directly translate regression and contextual bandits models with these operators, and give an example of such models being run in RLClientLib to show that they can now be ported into any inference framework.\r\n\r\n<hr \/>\r\n\r\n<a href=\"https:\/\/www.linkedin.com\/in\/cassandra-marcussen-2b13b0142\/\"><img class=\"alignleft wp-image-690183\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/cassandra-marcussen.jpg\" alt=\"head shot of Cassandra Marcussen\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Cassandra Marcussen<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/cassandra-marcussen-2b13b0142\/\" target=\"_blank\" rel=\"noopener\">Cassandra Marcussen<\/a> is a junior at Columbia University studying Mathematics and Computer Science. Her interests lie in artificial intelligence, theoretical computer science, and contributing to technology within these fields through efficient computing and low-level optimizations. Cassandra is enthusiastic about open source code, and has loved working on an impactful open source system such as Vowpal Wabbit. In the future, she wishes to pursue graduate studies in Computer Science.\r\n\r\n<a href=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=1177\" target=\"_blank\" rel=\"noopener\"><strong>Challenge: Parallelized Parsing<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=1177\" new-window=\"true\" ]<\/div>\r\nModern machines often utilize many threads to achieve good performance. Currently, VW uses a single thread to read in and parse input, and a single thread to learn. The parse thread presents a bottleneck, slowing down VW as a whole. By extracting the input reading into a separate thread and extending the parser to support many threads, VW can better utilize resources, achieve better performance, and have an improved design by separating logical components into independent modules. This project focuses on improving performance and design for the text input format, and also ensures compatibility with the cache input format.\r\n\r\n<hr \/>\r\n\r\n<a href=\"https:\/\/www.linkedin.com\/in\/newtonmwai\/\"><img class=\"alignleft wp-image-690192\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/newton-mwai-300x300.jpg\" alt=\"head shot of Newton Mwai\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Newton Mwai Kinyanjui<\/h3>\r\n<a href=\"https:\/\/www.linkedin.com\/in\/newtonmwai\/\" target=\"_blank\" rel=\"noopener\">Newton Mwai Kinyanjui<\/a> is from Nairobi Kenya. I'm currently pursuing my Ph.D. at Chalmers University of Technology in Sweden, working in causal inference and reinforcement learning towards machine learning for improved decision making in healthcare with Fredrik Johansson. I graduated from Carnegie Mellon University Africa with a Master of Science in Electrical and Computer Engineering.\r\n\r\n<a href=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=248\" target=\"_blank\" rel=\"noopener\"><strong>Challenge: Library of contextual bandit estimators<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=248\" new-window=\"true\" ]<\/div>\r\nEstimators are used in off-policy evaluation. One common estimator is IPS, and others are DR and PseudoInverse. These estimators work better or worse in different settings. This project explores reference implementations of each and allows for comparison between them to aid in understanding. We extend the estimators library and implement an interface to help researchers and data scientists test different estimators quickly and easily.\r\n\r\n<hr \/>\r\n\r\n<a href=\"https:\/\/markrucker.net\/\"><img class=\"alignleft wp-image-690189\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/09\/mark-rucker.jpg\" alt=\"head shot of Mark Rucker\" width=\"175\" height=\"175\" \/><\/a>\r\n<h3>Mark Rucker<\/h3>\r\n<a href=\"https:\/\/markrucker.net\/\" target=\"_blank\" rel=\"noopener\">Mark Rucker<\/a> is currently a 2nd year PhD student at the University of Virginia with a previous 8 year career as an enterprise software engineer. Mark's PhD research explores how reinforcement learning models can be used to encourage health behavior change in individuals managing chronic health conditions. This research combines state of the art machine learning with web and mobile app development to support in-situ randomized control trials of behavior change interventions. After graduation Mark hopes to once again return to industry in order to develop high-quality products that deeply impact people's lives.\r\n\r\n<a href=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=2209\" target=\"_blank\" rel=\"noopener\"><strong>Challenge: COBA: A Modern Benchmarking Package for Reproducible Contextual Bandit Research<\/strong><\/a>\r\n<div>[msr-button text=\"Watch demo\" url=\"https:\/\/youtu.be\/sDxhRkUPfwI?t=2209\" new-window=\"true\" ]<\/div>\r\nPerformance benchmarking on well-defined problems is a pillar of modern machine learning research. With clear problems and metrics, benchmarking has allowed the research community to maintain a high-level of independent effort while still making real and meaningful progress over time. The elegance of benchmarking -- define, measure, repeat -- however, belies real engineering challenges such as software maintenance, data distribution, statistical aggregation, and reproducibility to name a few. These challenges are especially salient in contextual bandit research where one not only needs a data set but also a harness to emulate interaction with the data. In an effort to reduce these burdens, while not losing any of benchmarking's benefits, we present COBA, an ultra light-weight Python package for benchmarking contextual bandit algorithms. COBA uses a small set of clean and consistent interfaces to satisfy four core use cases: (1) creating reproducible benchmarks, (2) sharing reproducible benchmarks, (3) evaluating custom algorithms, and (4) exploring evaluation results."},{"id":4,"name":"FAQ","content":"<h3>Frequently asked questions<\/h3>\r\n[accordion]\r\n[panel header=\"Is RL Open Source Fest considered an internship, a job, or any form of employment?\"]\r\nNo. RL Open Source Fest is an activity that the student performs as an independent developer in collaboration with the real world reinforcement learning team at Microsoft Research, for which they are paid a stipend.\r\n[\/panel]\r\n[panel header=\"Can I\u202fwork on this while doing an internship\/working this summer?\"]\r\nYes. Think of this as a fun project on the side.\r\n[\/panel]\r\n[panel header=\"What is the time commitment?\"]\r\nAs much as you like!\r\n[\/panel]\r\n[panel header=\"Where does RL Open Source Fest occur?\"]\r\nThe program occurs entirely online. There is no requirement to travel as part of the program.\r\n[\/panel]\r\n[panel header=\"What are the eligibility requirements for participation?\"]\r\nYou must be currently enrolled in an accredited academic institute's undergraduate, Masters, or PhD program. Check out the application process <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-academic-program&amp;p=633669&amp;secret=l5MD9h#!apply\">here<\/a>.\r\n[\/panel]\r\n[\/accordion]\r\n\r\nAdditional questions? Feel free to send them to us at <a href=\"mailto:RLOSFEST@microsoft.com\">RLOSFEST@microsoft.com<\/a>."}],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program\/633669","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-academic-program"}],"version-history":[{"count":100,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program\/633669\/revisions"}],"predecessor-version":[{"id":1016904,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-academic-program\/633669\/revisions\/1016904"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/718090"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=633669"}],"wp:term":[{"taxonomy":"msr-opportunity-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-opportunity-type?post=633669"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=633669"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=633669"},{"taxonomy":"msr-program-audience","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-program-audience?post=633669"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=633669"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=633669"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}