{"id":3281,"date":"2015-06-11T10:30:00","date_gmt":"2015-06-11T10:30:00","guid":{"rendered":"https:\/\/blogs.technet.microsoft.com\/inside_microsoft_research\/2015\/06\/11\/microsoft-researchers-tie-for-best-image-captioning-technology\/"},"modified":"2016-07-20T07:29:15","modified_gmt":"2016-07-20T14:29:15","slug":"microsoft-researchers-tie-for-best-image-captioning-technology","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-researchers-tie-for-best-image-captioning-technology\/","title":{"rendered":"Microsoft researchers tie for best image captioning technology"},"content":{"rendered":"<p class=\"posted-by\">Posted by <span class=\"author\">Allison Linn<\/span><\/p>\n<p><img decoding=\"async\" src=\"https:\/\/msdnshared.blob.core.windows.net\/media\/TNBlogsFS\/prod.evol.blogs.technet.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/00\/90\/35\/baseball-player-3-frames-550.png\" alt=\" \" style=\"margin-left:auto;margin-right:auto;vertical-align:middle\" \/><\/p>\n<p>Researchers representing Microsoft and Google will present their latest advances Friday in <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/blogs.microsoft.com\/next\/2015\/05\/28\/picture-this-microsoft-research-project-can-interpret-caption-photos\/\" title=\"automated image captioning\" target=\"_blank\">automated image captioning<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, a hot field that could have broad implications for <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/about\/our-research\/machine-learning.aspx\" title=\"Machine Learning and Artificial Intelligence research at Microsoft\" target=\"_blank\">artificial intelligence<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<p>The researchers will be speaking at a workshop that is part of <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.pamitc.org\/cvpr15\/\" title=\"CVPR\" target=\"_blank\">CVPR<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, an annual conference on the most cutting-edge advances in <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/about\/our-research\/computer-vision.aspx\" title=\"Computer Vision research at Microsoft\" target=\"_blank\">computer vision<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> research. The <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/lsun.cs.princeton.edu\/#schedule\" title=\"workshop\" target=\"_blank\">workshop<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> is highlighting the winners of several image-related challenges.<\/p>\n<p>The two companies&rsquo; research groups <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/mscoco.org\/dataset\/#leaderboard-cap\" title=\"tied for first place\" target=\"_blank\">tied for first place<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> in the recent <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/mscoco.org\/dataset\/#cap2015\" title=\"MS COCO Image Captioning Challenge 2015\" target=\"_blank\">MS COCO Image Captioning Challenge 2015<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. There were 15 submissions from top universities and industrial research labs vying to automatically create the most informative and interesting captions.<\/p>\n<p>The winners were decided based on two main metrics: The share of captions that were equal to or better than a caption written by a person, and the share of captions that would pass a Turing test.<\/p>\n<p>The <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/en.wikipedia.org\/wiki\/Turing_test\" title=\"Wikipedia: Turing test\" target=\"_blank\">Turing test<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, named after a paper published by Alan Turing in 1950, is a test of whether a human would believe something generated by a computer was actually written by a human.<\/p>\n<p>The Microsoft team outperformed competitors on the Turing test element, while the Google team won for the share of captions that were as good, or better, than what people could produce.<\/p>\n<p>The field of automated image captioning has exploded since researchers hit upon the idea of using neural networks, which are computing elements that are modeled loosely after the human brain, to connect vision to language.<\/p>\n<p>Many researchers see image captioning as the basis for more sophisticated artificial intelligence systems that can see, hear, speak and even understand.<\/p>\n<p><strong>Related:<\/strong><\/p>\n<ul>\n<li>Research paper: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/research.microsoft.com\/en-us\/projects\/image_captioning\/\" title=\"From Captions to Visual Concepts and Back\" target=\"_blank\">From Captions to Visual Concepts and Back<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<li><a href=\"\/b\/inside_microsoft_research\/archive\/2015\/06\/08\/microsoft-researchers-accelerate-computer-vision-accuracy-and-improve-3d-scanning-models.aspx\" title=\"Microsoft researchers accelerate computer vision accuracy and improve 3D scanning models\">Microsoft researchers accelerate computer vision accuracy and improve 3D scanning models<\/a><\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/blogs.microsoft.com\/next\/2015\/05\/28\/picture-this-microsoft-research-project-can-interpret-caption-photos\/\" title=\"Picture this: Microsoft research project can interpret, caption photos\" target=\"_blank\">Picture this: Microsoft research project can interpret, caption photos<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<\/ul>\n<p><em>Allison Linn is a senior writer at Microsoft Research. <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"https:\/\/x.com\/allisondlinn\" title=\"Follow Allison on Twitter\" target=\"_blank\">Follow Allison on Twitter<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Posted by Allison Linn Researchers representing Microsoft and Google will present their latest advances Friday in automated image captioning, a hot field that could have broad implications for artificial intelligence. The researchers will be speaking at a workshop that is part of CVPR, an annual conference on the most cutting-edge advances in computer vision research. [&hellip;]<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[1],"tags":[187359,200567,186897,201155,202927,204357],"research-area":[],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-3281","post","type-post","status-publish","format-standard","hentry","category-research-blog","tag-artificial-intelligence","tag-automated-image-captioning","tag-computer-vision","tag-cvpr","tag-ms-coco-image-captioning-challenge-2015","tag-turing-test","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"June 11, 2015","formattedExcerpt":"Posted by Allison Linn Researchers representing Microsoft and Google will present their latest advances Friday in automated image captioning, a hot field that could have broad implications for artificial intelligence. The researchers will be speaking at a workshop that is part of CVPR, an annual&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/3281","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=3281"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/3281\/revisions"}],"predecessor-version":[{"id":260781,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/3281\/revisions\/260781"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=3281"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=3281"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=3281"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=3281"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=3281"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=3281"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=3281"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=3281"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=3281"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=3281"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=3281"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}