{"id":3271,"date":"2015-06-08T09:00:00","date_gmt":"2015-06-08T09:00:00","guid":{"rendered":"https:\/\/blogs.technet.microsoft.com\/inside_microsoft_research\/2015\/06\/08\/microsoft-researchers-accelerate-computer-vision-accuracy-and-improve-3d-scanning-models\/"},"modified":"2016-07-20T07:29:16","modified_gmt":"2016-07-20T14:29:16","slug":"microsoft-researchers-accelerate-computer-vision-accuracy-and-improve-3d-scanning-models","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-researchers-accelerate-computer-vision-accuracy-and-improve-3d-scanning-models\/","title":{"rendered":"Microsoft researchers accelerate computer vision accuracy and improve 3D scanning models"},"content":{"rendered":"<p class=\"posted-by\">Posted by <span class=\"author\">George Thomas Jr.<\/span><\/p>\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"CVPR Boston\" href=\"http:\/\/www.pamitc.org\/cvpr15\/\" target=\"_blank\"><img decoding=\"async\" style=\"float: right; margin: 6px 8px;\" title=\"CVPR Boston\" src=\"https:\/\/msdnshared.blob.core.windows.net\/media\/TNBlogsFS\/prod.evol.blogs.technet.com\/CommunityServer.Blogs.Components.WeblogFiles\/00\/00\/00\/90\/35\/cvpr-boston-logo.PNG\" alt=\"CVPR Boston\" \/><span class=\"sr-only\"> (opens in new tab)<\/span><\/a>You may already have heard about the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Microsoft technology that can automatically identify objects in a picture and write an accurate caption\" href=\"http:\/\/blogs.microsoft.com\/next\/2015\/05\/28\/picture-this-microsoft-research-project-can-interpret-caption-photos\/\" target=\"_blank\">Microsoft technology that can automatically identify objects in a picture and write an accurate caption<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> for it, but those types of research advancements don\u2019t occur in a vacuum.<\/p>\n<p>Indeed, interdisciplinary research combining <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Computer Vision research at Microsoft\" href=\"http:\/\/research.microsoft.com\/en-us\/about\/our-research\/computer-vision.aspx\" target=\"_blank\">computer vision<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Machine Learning and Artificial Intelligence research at Microsoft\" href=\"http:\/\/research.microsoft.com\/en-us\/about\/our-research\/machine-learning.aspx\" target=\"_blank\">machine learning, artificial intelligence<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Computer Systems and Networking research at Microsoft\" href=\"http:\/\/research.microsoft.com\/en-us\/about\/our-research\/computer-systems-networking.aspx\" target=\"_blank\">computer systems and networking<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> are just some of <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Microsoft's research areas\" href=\"http:\/\/research.microsoft.com\/en-us\/about\/our-research\/default.aspx\" target=\"_blank\">Microsoft\u2019s research areas<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> at the core of the burgeoning field commonly referred to as \u201c<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Wikipedia: Deep learning\" href=\"https:\/\/en.wikipedia.org\/wiki\/Deep_learning\" target=\"_blank\">deep learning<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.\u201d Advancements in deep learning technology are fundamental to Microsoft\u2019s mission to empower every person and organization on the planet to achieve more.<\/p>\n<p>Deep learning also is fundamental to a bevy of research being presented this week at the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"28th IEEE Conference on Computer Vision and Pattern Recognition (CVPR) in Boston\" href=\"http:\/\/www.pamitc.org\/cvpr15\/\" target=\"_blank\">28th IEEE Conference on Computer Vision and Pattern Recognition (CVPR) in Boston<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<p>The latest breakthroughs include dramatic speed improvements that accelerate computer vision image recognition and new algorithms that improve the clarity of 3D-scanned images using <a title=\"Kinect\" href=\"https:\/\/www.microsoft.com\/en-us\/kinectforwindows\/\" target=\"_blank\">Kinect<\/a> or Kinect-like sensors.<\/p>\n<p>In <em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Convolutional Neural Networks at Constrained Time Cost\" href=\"http:\/\/arxiv.org\/pdf\/1412.1710v1.pdf\" target=\"_blank\">Convolutional Neural Networks at Constrained Time Cost<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(324 KB .pdf), lead researcher <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Kaiming He\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/kahe\/\" target=\"_blank\">Kaiming He<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and principal researcher <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> address the issue of time-consuming computations required by continuous advancements in computer vision image classification accuracy. They propose models that are faster and more accurate than existing fast models and also practical for widespread use.<\/p>\n<p>Sun and He also collaborated with researchers from <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Xi&rsquo;an Jiaotong University\" href=\"http:\/\/en.xjtu.edu.cn\/\" target=\"_blank\">Xi\u2019an Jiaotong University<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> on <em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Efficient and Accurate Approximations of Nonlinear Convolutional Networks\" href=\"http:\/\/arxiv.org\/pdf\/1411.4229v1.pdf\" target=\"_blank\">Efficient and Accurate Approximations of Nonlinear Convolutional Networks<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(541 KB .pdf), which proposes a method that accelerates such <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Wikipedia: Convolutional neural networks\" href=\"https:\/\/en.wikipedia.org\/wiki\/Convolutional_neural_network\" target=\"_blank\">networks<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> by as much as four times with an error rate of less than 1 percent.<\/p>\n<p>At CVPR, Microsoft researchers also will present advancements in 3D digitization and 3D scanning using Kinect and Kinect-like sensors.<\/p>\n<p>In <em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Large-Scale and Drift-Free Surface Reconstruction\" href=\"http:\/\/www.cv-foundation.org\/openaccess\/content_cvpr_2015\/papers\/Fioraio_Large-Scale_and_Drift-Free_2015_CVPR_paper.pdf\" target=\"_blank\">Large-Scale and Drift-Free Surface Reconstruction<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(9.6 MB .pdf), researcher <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jonathan Taylor\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jota\/\" target=\"_blank\">Jonathan Taylor<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and principal researchers <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Andrew Fitzgibbon\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/awf\/\" target=\"_blank\">Andrew Fitzgibbon<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Shahram Izadi\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/shahrami\/\" target=\"_blank\">Shahram Izadi<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> collaborated with researchers from the University of Bologna to introduce a method of large-scale 3D scanning that computes in minutes, not hours, and works even in low-lighting conditions or other challenging conditions such as in complete darkness.<\/p>\n<p>&#8220;As shown by the body of work at CVPR, the Kinect has accelerated research on 3D scanning to the point now where even capturing models of moving scenes or large-scale scenes is possible,&#8221; Izadi said.<\/p>\n<p>Microsoft researchers also are presenting new research that significantly improves scans of objects that are in motion. In <em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"3D Scanning Deformable Objects with a Single RGBD Sensor\" href=\"http:\/\/www.cv-foundation.org\/openaccess\/content_cvpr_2015\/papers\/Dou_3D_Scanning_Deformable_2015_CVPR_paper.pdf\" target=\"_blank\">3D Scanning Deformable Objects with a Single RGBD Sensor<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(10.1 MB .pdf), Taylor, Fitzgibbon and Izadi collaborated with researchers from the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"University of North Carolina at Chapel Hill\" href=\"http:\/\/www.unc.edu\/\" target=\"_blank\">University of North Carolina at Chapel Hill<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> to develop a scanning method that uses only a single Kinect sensor without heavily constraining user or camera motion.<\/p>\n<p>&#8220;The logical next step is to use these models for recognition and bring the worlds of deep learning and reconstruction together,&#8221; Izadi added. &#8220;This brings us closer to computers that understand the user and their environments in much richer ways.&#8221;<\/p>\n<h2>Additional research presented at the 28th IEEE Conference on Computer Vision and Pattern Recognition<\/h2>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Learning an Efficient Model of Hand Shape Variation From Depth Images\" href=\"http:\/\/research-srv.microsoft.com\/pubs\/244775\/HandShapeVariation.pdf\" target=\"_blank\">Learning an Efficient Model of Hand Shape Variation From Depth Images<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/em>\u00a0(1.9 MB.pdf)<br \/>\nContributing Microsoft researchers: Sameh Khamis, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jonathan Taylor\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jota\/\" target=\"_blank\">Jonathan Taylor<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jamie Shotton\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jamiesho\/\" target=\"_blank\">Jamie Shotton<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Cem Keskin\" href=\"https:\/\/www.linkedin.com\/pub\/cem-keskin\/15\/92a\/236\" target=\"_blank\">Cem Keskin<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Shahram Izadi\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/shahrami\/\" target=\"_blank\">Shahram Izadi<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Andrew Fitzgibbon\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/awf\/\" target=\"_blank\">Andrew Fitzgibbon<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nA new method of scanning human hands to generate a low-dimensional generic hand model, using machine learning.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization\" href=\"http:\/\/www.robots.ox.ac.uk\/~tvg\/publications\/2015\/relocalization_camera_ready_submitted.pdf\" target=\"_blank\">Exploiting Uncertainty in Regression Forests for Accurate Camera Relocalization<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em> (875 KB .pdf)<br \/>\nContributing Microsoft researchers: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jamie Shotton\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jamiesho\/\" target=\"_blank\">Jamie Shotton<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Andrew Fitzgibbon\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/awf\/\" target=\"_blank\">Andrew Fitzgibbon<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Shahram Izadi\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/shahrami\/\" target=\"_blank\">Shahram Izadi<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nPresents a new method that improves camera relocalization by up to 40% more frames than the current state of the art.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"A Light Transport Model for Mitigating Multipath Interference in Time-of-Flight Sensors\" href=\"http:\/\/web.media.mit.edu\/~achoo\/temp\/global_direct_cvpr15.pdf\" target=\"_blank\">A Light Transport Model for Mitigating Multipath Interference in Time-of-Flight Sensors<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(3.0 MB .pdf)<br \/>\nContributing Microsoft researchers: Nikhil Naik, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Christoph Rhemann\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/chrheman\/\" target=\"_blank\">Christoph Rhemann<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Shahram Izadi\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/shahrami\/\" target=\"_blank\">Shahram Izadi<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Sing Bing Kang\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/sbkang\/\" target=\"_blank\">Sing Bing Kang<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nPresents a method for correcting multi-path interference in the time-of-flight-based Kinect sensor in the Xbox One camera, using a new computational camera technique.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Computationally Bounded Retrieval\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/pkohli\/papers\/rkki_cvpr2015.pdf\" target=\"_blank\">Computationally Bounded Retrieval<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(522 KB .pdf)<br \/>\n<em>Contributing Microsoft researchers: Cem Keskin, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Pushmeet Kohli\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/pkohli\/\" target=\"_blank\">Pushmeet Kohli<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Shahram Izadi\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/shahrami\/\" target=\"_blank\">Shahram Izadi<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/em><br \/>\nA new method for image search retrieval that shows improvements in accuracy and speed over current methods.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"A Geodesic-Preserving Method for Image Warping\" href=\"http:\/\/www.cv-foundation.org\/openaccess\/content_cvpr_2015\/papers\/Li_A_Geodesic-Preserving_Method_2015_CVPR_paper.pdf\" target=\"_blank\">A Geodesic-Preserving Method for Image Warping<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(8.4 MB .pdf)<br \/>\nContributing Microsoft researchers: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Kaiming He\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/kahe\/\" target=\"_blank\">Kaiming He<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nA new method that improves the visual quality of panoramic and wide-angle images.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Cascaded Hand Pose Regression\" href=\"http:\/\/research.microsoft.com:8082\/en-US\/people\/yichenw\/cvpr15_handposeregression.pdf\" target=\"_blank\">Cascaded Hand Pose Regression<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(1.1 MB.pdf)<br \/>\nContributing Microsoft researchers: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Yichen Wei\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/yichenw\/\" target=\"_blank\">Yichen Wei<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nA novel approach that demonstrates accurate, high-speed hand tracking using consumer depth sensors.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Learning a Convolutional Neural Network for Non-Uniform Motion Blur Removal\" href=\"http:\/\/arxiv.org\/pdf\/1503.00593v3.pdf\" target=\"_blank\">Learning a Convolutional Neural Network for Non-Uniform Motion Blur Removal<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(9.4 MB .pdf)<br \/>\nContributing Microsoft researcher: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nProposes a deep learning-based approach to correcting non-uniform motion blur in images.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Sparse Projections for High-Dimensional Binary Codes\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/kahe\/publications\/cvpr15spbe.pdf\" target=\"_blank\">Sparse Projections for High-Dimensional Binary Codes<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0 <\/em>(304 KB .pdf)<br \/>\nContributing Microsoft researchers: Yan Xia, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Kaiming He\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/kahe\/\" target=\"_blank\">Kaiming He<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Pushmeet Kohli\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/pkohli\/\" target=\"_blank\">Pushmeet Kohli<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nA new method that increases accuracy and speed of image retrieval and image classification by an order of magnitude.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Convolutional Feature Masking for Joint Object and Stuff Segmentation\" href=\"http:\/\/arxiv.org\/pdf\/1412.1283v4.pdf\" target=\"_blank\">Convolutional Feature Masking for Joint Object and Stuff Segmentation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(2.8 MB .pdf)<br \/>\nContributing Microsoft researchers: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jifeng Dai\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jifdai\/\" target=\"_blank\">Jifeng Dai<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Kaiming He\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/kahe\/\" target=\"_blank\">Kaiming He<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nA new method that demonstrates state-of-the-art in object recognition and labeling at fast speeds.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Global Refinement of Random Forest\" href=\"http:\/\/home.ustc.edu.cn\/~sqren\/RefineRF\/ImprovingRF_cvpr2015.pdf\" target=\"_blank\">Global Refinement of Random Forest<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(627 KB .pdf)<br \/>\nContributing Microsoft researchers: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Xudong Cao\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/xudongca\/\" target=\"_blank\">Xudong Cao<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Yichen Wei\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/yichenw\/\" target=\"_blank\">Yichen Wei<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>,\u00a0<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Jian Sun\" href=\"http:\/\/research.microsoft.com\/en-us\/people\/jiansun\/\" target=\"_blank\">Jian Sun<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nProposes two new methods within machine learning &#8212; global refinement and global pruning &#8212; that both greatly improves accuracy and reduces the storage needed for the random forest learning method.<\/p>\n<p><em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Light Field Layer Matting\" href=\"http:\/\/research.microsoft.com\/pubs\/244365\/Fiss-LFLM-CVPR15.pdf\" target=\"_blank\">Light Field Layer Matting<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0<\/em>(2.7 MB.pdf)<br \/>\nContributing Microsoft researcher: <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" rel=\"noopener noreferrer\" title=\"Rick Szeliski\" href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/szeliski\/\" target=\"_blank\">Rick Szeliski<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><br \/>\nApplying a &#8220;matting&#8221; technique to clean up and sharpen images that contain an obscure foreground, like a picture of a bird outside of a dirty window.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Posted by George Thomas Jr. You may already have heard about the Microsoft technology that can automatically identify objects in a picture and write an accurate caption for it, but those types of research advancements don\u2019t occur in a vacuum. Indeed, interdisciplinary research combining computer vision, machine learning, artificial intelligence, computer systems and networking are [&hellip;]<\/p>\n","protected":false},"author":30766,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[],"msr_hide_image_in_river":0,"footnotes":""},"categories":[194471],"tags":[200165,187359,200837,200907,186895,186897,201109,201159,186925,202141,202183,202267,202305,187348,186418,186595,203051,203431,203553,203773,203871,197617,197852,204705,204723,197863],"research-area":[13562],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-3271","post","type-post","status-publish","format-standard","hentry","category-computer-vision","tag-28th-ieee-conference-on-computer-vision-and-pattern-recognition","tag-artificial-intelligence","tag-cem-keskin","tag-christoph-rhemann","tag-computer-systems","tag-computer-vision","tag-convolutional-neural-networks","tag-cvpr-boston-2015","tag-deep-learning","tag-jamie-shotton","tag-jian-sun","tag-jonathan-taylor","tag-kaiming-he","tag-kinect","tag-machine-learning","tag-networking","tag-nikhil-naik","tag-pushmeet-kohli","tag-rick-szeliski","tag-shahram-izadi","tag-sing-bing-kang","tag-university-of-north-carolina-at-chapel-hill","tag-xian-jiaotong-university","tag-xudong-cao","tag-yan-xia","tag-yichen-wei","msr-research-area-computer-vision","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","byline":"","formattedDate":"June 8, 2015","formattedExcerpt":"Posted by George Thomas Jr. You may already have heard about the Microsoft technology that can automatically identify objects in a picture and write an accurate caption for it, but those types of research advancements don\u2019t occur in a vacuum. Indeed, interdisciplinary research combining computer&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/3271","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/30766"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=3271"}],"version-history":[{"count":1,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/3271\/revisions"}],"predecessor-version":[{"id":235645,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/3271\/revisions\/235645"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=3271"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=3271"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=3271"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=3271"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=3271"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=3271"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=3271"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=3271"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=3271"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=3271"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=3271"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}