Microsoft Research Blog

Artificial intelligence

Self-supervised self-supervision by combining deep learning and probabilistic logic

February 4, 2021 | Hunter Lang and Hoifung Poon

Labeling training examples at scale is a perennial challenge in machine learning. Self-supervision methods compensate for the lack of direct supervision by leveraging prior knowledge to automatically generate noisy labeled examples. Deep probabilistic logic (DPL) is a unifying framework for self-supervised learning that represents unknown…
Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency

February 4, 2021 | Seokju Lee, Sunghoon Im, Stephen Lin, and In So Kweon

We present an end-to-end joint training framework that explicitly models 6-DoF motion of multiple dynamic objects, ego-motion and depth in a monocular camera setup without supervision. Our technical contributions are three-fold. First, we highlight the fundamental difference between inverse and forward projection while modeling the…
Towards Topic-Aware Slide Generation For Academic Papers With Unsupervised Mutual Learning

February 2, 2021 | Da-Wei Li, Danqing Huang, Tingting Ma, and Chin-Yew Lin

Slides are commonly used to present information and tell stories. In academic and research communities, slides are typically used to summarize findings in accepted papers for presentation in meetings and conferences. These slides for academic papers usually contain common and essential topics such as major…
Object-Centric Image Generation from Layouts

February 2, 2021

Despite recent impressive results on single-object and single-domain image generation, the generation of complex scenes with multiple objects remains challenging. In this paper, we start with the idea that a model must be able to understand individual objects and relationships between objects in order to…
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics

February 1, 2021

We introduce GEM, a living benchmark for natural language Generation (NLG), its Evaluation, and Metrics. Measuring progress in NLG relies on a constantly evolving ecosystem of automated metrics, datasets, and human evaluation standards. Due to this moving target, new models often still evaluate on divergent…
Is the Most Accurate AI the Best Teammate? Optimizing AI for Teamwork

February 1, 2021

AI practitioners typically strive to develop the most accurate systems, making an implicit assumption that the AI system will function autonomously. However, in practice, AI systems often are used to provide advice to people in domains ranging from criminal justice and finance to healthcare. In…
Leveraging Expert Consistency to Improve Algorithmic Decision Support

January 23, 2021 | Maria De-Arteaga, Artur Dubrawski, and Alex Chouldechova

Due to their promise of superior predictive power relative to human assessment, machine learning models are increasingly being used to support high-stakes decisions. However, the nature of the labels available for training these models often hampers the usefulness of predictive models for decision support. In…
Hybrid Cascade Point Search Network for High Precision Bar Chart Component Detection

January 10, 2021 | Junyu Luo, Jinpeng Wang, and Chin-Yew Lin

Bar charts are commonly used for data visualization. One common form of chart distribution is in its image form. To enable machine comprehension of chart images, precise detection of chart components in chart images is a critical step. Existing image object detection methods do not…
Towards Automating Code Review Activities

January 6, 2021

Code reviews are popular in both industrial and open source projects. The benefits of code reviews are widely recognized and include better code quality and lower likelihood of introducing bugs. However, since code review is a manual activity it comes at the cost of spending…
ChartOCR: Data Extraction from Charts Images via a Deep Hybrid Framework

January 5, 2021 | Junyu Luo, Zekun Li, Jinpeng Wang, and Chin-Yew Lin

Chart images are commonly used for data visualization. Automatically reading the chart values is a key step for chart content understanding. Charts have a lot of variations in style (e.g., bar chart, line chart, pie chart and etc.), which makes pure rule-based data extraction methods…
CASINet: Content-Adaptive Scale Interaction Networks for scene parsing

January 2, 2021

Abstract Objects at different spatial positions in an image exhibit different scales. Adaptive receptive fields are expected to capture suitable ranges of context for accurate pixel level semantic prediction. Recently, atrous convolution with different dilation rates has been used to generate features of multi-scales through…
Human-Guided Learning of Column Networks: Knowledge Injection for Relational Deep Learning

January 1, 2021

Recently, deep models have been successfully adopted in several applications, especially where low-level representations are needed. However, sparse, noisy samples and structured domains (with multiple objects and interactions) are some of the open challenges in most deep models. Column Networks, a deep architecture, can succinctly…

No results