Microsoft Research Blog

Artificial intelligence

  1. Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency 

    February 4, 2021 | Seokju Lee, Sunghoon Im, Stephen Lin, and In So Kweon

    We present an end-to-end joint training framework that explicitly models 6-DoF motion of multiple dynamic objects, ego-motion and depth in a monocular camera setup without supervision. Our technical contributions are three-fold. First, we highlight the fundamental difference between inverse and forward projection while modeling the…

  2. Object-Centric Image Generation from Layouts 

    February 2, 2021

    Despite recent impressive results on single-object and single-domain image generation, the generation of complex scenes with multiple objects remains challenging. In this paper, we start with the idea that a model must be able to understand individual objects and relationships between objects in order to…

  3. The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics 

    February 1, 2021

    We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent…

  4. Is the Most Accurate AI the Best Teammate? Optimizing AI for Teamwork 

    February 1, 2021

    AI practitioners typically strive to develop the most accurate systems, making an implicit assumption that the AI system will function autonomously. However, in practice, AI systems often are used to provide advice to people in domains ranging from criminal justice and finance to healthcare. In…

  5. Leveraging Expert Consistency to Improve Algorithmic Decision Support 

    January 23, 2021 | Maria De-Arteaga, Artur Dubrawski, and Alex Chouldechova

    Due to their promise of superior predictive power relative to human assessment, machine learning models are increasingly being used to support high-stakes decisions. However, the nature of the labels available for training these models often hampers the usefulness of predictive models for decision support. In…

  6. Towards Automating Code Review Activities 

    January 6, 2021

    Code reviews are popular in both industrial and open source projects. The benefits of code reviews are widely recognized and include better code quality and lower likelihood of introducing bugs. However, since code review is a manual activity it comes at the cost of spending…

  7. ChartOCR: Data Extraction from Charts Images via a Deep Hybrid Framework 

    January 5, 2021 | Junyu Luo, Zekun Li, Jinpeng Wang, and Chin-Yew Lin

    Chart images are commonly used for data visualization. Automatically reading the chart values is a key step for chart content understanding. Charts have a lot of variations in style (e.g., bar chart, line chart, pie chart and etc.), which makes pure rule-based data extraction methods…

  8. CASINet: Content-Adaptive Scale Interaction Networks for scene parsing 

    January 2, 2021

    Abstract Objects at different spatial positions in an image exhibit different scales. Adaptive receptive fields are expected to capture suitable ranges of context for accurate pixel level semantic prediction. Recently, atrous convolution with different dilation rates has been used to generate features of multi-scales through…