{"id":734461,"date":"2021-02-25T13:00:28","date_gmt":"2021-02-25T21:00:28","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-research-item&#038;p=734461"},"modified":"2021-03-17T13:20:25","modified_gmt":"2021-03-17T20:20:25","slug":"microsoft-vision-model-resnet-50-pretrained-vision-model-built-with-web-scale-data","status":"publish","type":"msr-video","link":"https:\/\/www.microsoft.com\/en-us\/research\/video\/microsoft-vision-model-resnet-50-pretrained-vision-model-built-with-web-scale-data\/","title":{"rendered":"Microsoft Vision Model ResNet-50: Pretrained vision model built with web-scale data"},"content":{"rendered":"<p>Pretrained computer vision models, combined with transfer learning, can dramatically bring down the cost and time it takes to build a model that performs a vision task, such as image classification, object detection, and image retrieval. The Microsoft Vision Model ResNet-50 is a large pretrained vision model, built with Microsoft Bing web-scale image data, that sets state-of-the-art across seven popular computer vision benchmarks.<\/p>\n<p>In this webinar, join Junwon Park and Zygmunt Lenyk, the Program Manager and the Software Engineer behind the Microsoft Vision Model respectively, to learn how the team built Microsoft Vision Model using multi-task learning and web-supervised datasets. You\u2019ll examine how the model lowers cost in a production setting while achieving state-of-the-art performance by using multi-task learning and optimizing separately for multiple classification tasks, including ImageNet-22k, COCO, and two web-supervised datasets of 40 million image-label pairs collected from image search engines.<\/p>\n<p>The researchers will take you through installing and using the Microsoft Vision Model to build an image classification model. Then, they will go over how Microsoft Vision Model uses multi-task learning and web-supervision to achieve a robust performance across multiple computer vision benchmarks.<\/p>\n<p>Together, you\u2019ll explore:<\/p>\n<ul>\n<li>How Microsoft Bing leverages Microsoft Vision Model to achieve cost savings and accuracy improvement simultaneously.<\/li>\n<li>How we built the model using multi-task learning and web-supervision.<\/li>\n<li>How you can download and use Microsoft Vision Model to build an image classifier model for any scenario.<\/li>\n<\/ul>\n<p><strong>Resource list:<\/strong><\/p>\n<ul>\n<li>Project Page: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/aka.ms\/microsoftvision\">https:\/\/aka.ms\/microsoftvision<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>Career Page: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/aka.ms\/bingmultimediajobs\">https:\/\/aka.ms\/bingmultimediajobs<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li>Announcement Blog: <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-vision-model-resnet-50-combines-web-scale-data-and-multi-task-learning-to-achieve-state-of-the-art\/\">Microsoft Vision Model ResNet-50 combines web-scale data and multi-task learning to achieve state of the art<\/a><\/li>\n<\/ul>\n<p>*This on-demand webinar features a previously recorded Q&A session and open captioning.<\/p>\n<p>Explore more <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" target=\"_blank\" href=\"https:\/\/aka.ms\/msrwebinars\">Microsoft Research webinars ><span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Pretrained computer vision models, combined with transfer learning, can dramatically bring down the cost and time it takes to build a model that performs a vision task, such as image classification, object detection, and image retrieval. The Microsoft Vision Model ResNet-50 is a large pretrained vision model, built with Microsoft Bing web-scale image data, that [&hellip;]<\/p>\n","protected":false},"featured_media":734464,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_hide_image_in_river":0,"footnotes":""},"research-area":[13556,13562,13555],"msr-video-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-session-type":[],"msr-impact-theme":[],"msr-pillar":[],"msr-episode":[],"msr-research-theme":[],"class_list":["post-734461","msr-video","type-msr-video","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-research-area-computer-vision","msr-research-area-search-information-retrieval","msr-locale-en_us"],"msr_download_urls":"","msr_external_url":"https:\/\/youtu.be\/oQqxkO3BN3Q","msr_secondary_video_url":"","msr_video_file":"","_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/734461","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-video"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/734461\/revisions"}],"predecessor-version":[{"id":734467,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video\/734461\/revisions\/734467"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/734464"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=734461"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=734461"},{"taxonomy":"msr-video-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-video-type?post=734461"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=734461"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=734461"},{"taxonomy":"msr-session-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-session-type?post=734461"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=734461"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=734461"},{"taxonomy":"msr-episode","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-episode?post=734461"},{"taxonomy":"msr-research-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-research-theme?post=734461"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}