{"id":4882,"date":"2015-06-08T09:00:33","date_gmt":"2015-06-08T16:00:33","guid":{"rendered":"https:\/\/blogs.msdn.microsoft.com\/msr_er\/?p=4882"},"modified":"2016-07-20T07:29:16","modified_gmt":"2016-07-20T14:29:16","slug":"microsofts-rick-szeliski-previews-cvpr-2015","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsofts-rick-szeliski-previews-cvpr-2015\/","title":{"rendered":"Microsoft&#8217;s Rick Szeliski previews CVPR 2015"},"content":{"rendered":"<p>I read through some\u00a0of the papers to be presented at CVPR 2015 this week and noticed interesting trends emerging. The opening session addresses two of the most exciting and active areas of research within computer vision, namely deep learning and modeling from depth cameras.<\/p>\n<p>The session on deep learning includes papers that show how deep convolution networks can be extended to perform per-pixel segmentation, how to improve performance with various warping and architectural enhancements, the invariance (and other properties) of deep network layers, and how to &#8220;fool&#8221; deep neural nets with blatantly impossible images (or reconstruct plausible inputs). Deep learning papers are scattered throughout the rest of the conference, and without double are the most active areas of research in computer vision at the moment.<\/p>\n<p>In 3D modeling from depth camera images, there are papers on modeling and tracking moving deformable objects (as opposed to the usual case of static scenes). Several papers advance the state of the art in recovering 3D models from single (monocular) images, most commonly using prior knowledge about interior or exterior architectural (and furniture) layout.<\/p>\n<p>Another area closely related to visual object detection and recognition that has also received wide coverage in advance of the conference, is language and vision, which includes automatic image caption generation.\u00a0This year\u2019s CVPR conference presents several\u00a0papers from different institutions on this topic.\u00a0An <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"https:\/\/www.codalab.org\/competitions\/3221#results\" target=\"_blank\">online evaluation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> shows the current leaderboard results. 3D models and pose estimation also continue to be used as part of object detection and recognition systems.<\/p>\n<p>Traditional computer vision algorithms continue to improve, both in terms of speed and accuracy. For example, an optic flow algorithm that used principal component analysis (PCA) or overlapping parametric models followed by a layered assignment of pixels produces high quality and\/or dramatically faster speeds.\u00a0Another algorithm, based on first matching salient edges, outperforms all previously developed algorithms (on the Sintel data set) by a wide margin.\u00a0Optic flow algorithms are also being developed for depth map videos,\u00a0otherwise known as scene flow.<\/p>\n<p>In stereo matching, improvements can be obtained by classifying the expected orientation of surfaces or by using an alternative regularization term that is invariant to surface parameterization.\u00a03D reconstruction from field cameras (consisting of many small lenslets) also continues to advance with better small baseline correspondence methods, layered models, and photometric shading constraints.<\/p>\n<p>Note:\u00a0I don\u2019t work in the areas of object detection tracking, action and pose recognition, face recognition and tracking, and lots of others, so I didn\u2019t look at any of these papers.<\/p>\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/RickSzeliski.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-4902\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/RickSzeliski.jpg\" alt=\"RickSzeliski\" width=\"208\" height=\"208\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/RickSzeliski.jpg 208w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/RickSzeliski-150x150.jpg 150w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2015\/06\/RickSzeliski-180x180.jpg 180w\" sizes=\"auto, (max-width: 208px) 100vw, 208px\" \/><\/a><strong>Rick Szeliski<\/strong> leads the\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.research.microsoft.com\/IVM\/\" target=\"_blank\">Interactive Visual Media Group<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0at\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.research.microsoft.com\/\" target=\"_blank\">Microsoft Research<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and is an Affiliate Professor at the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.cs.washington.edu\/\" target=\"_blank\">University of Washington<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. His research interests include using vision to automatically build 3-D models from images, computational photography, and image-based rendering.<\/p>\n<p>For more computer science research news, visit <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" href=\"http:\/\/www.researchnews.com\/\" target=\"_blank\">ResearchNews.com<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<span style=\"font-size: 13.0pt;font-family: 'Calibri',sans-serif;color: #333333\"><br \/>\n<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>I read through some\u00a0of the papers to be presented at CVPR 2015 this week and noticed interesting trends emerging. The opening session addresses two of the most exciting and active areas of research within computer vision, namely deep learning and modeling from depth cameras. The session on deep learning includes papers that show how deep [&hellip;]<\/p>\n","protected":false},"author":32627,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[194471,194459],"tags":[194526,195229,186925,195945,193504,197640],"research-area":[],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-4882","post","type-post","status-publish","format-standard","hentry","category-computer-vision","category-researchnews","tag-3d-modeling","tag-cvpr-2015","tag-deep-learning","tag-interactive-visual-media-group","tag-microsoft-research","tag-university-of-washington-uw","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"June 8, 2015","formattedExcerpt":"I read through some\u00a0of the papers to be presented at CVPR 2015 this week and noticed interesting trends emerging. The opening session addresses two of the most exciting and active areas of research within computer vision, namely deep learning and modeling from depth cameras. The&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/4882","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/32627"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=4882"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/4882\/revisions"}],"predecessor-version":[{"id":260787,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/4882\/revisions\/260787"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=4882"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=4882"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=4882"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=4882"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=4882"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=4882"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=4882"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=4882"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=4882"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=4882"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=4882"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}