Microsoft Research Blog

Artificial intelligence

How Incidental are the Incidents? Characterizing and Prioritizing Incidents for Large-Scale Online Service Systems

December 20, 2020

Although tremendous efforts have been devoted to the quality assurance of online service systems, in reality, these systems still come across many incidents (i.e., unplanned interruptions and outages), which can decrease user satisfaction or cause economic loss. To better understand the characteristics of incidents and…
Transfer learning-based ensemble support vector machine model for automated COVID-19 detection using lung computerized tomography scan data.

December 18, 2020

The novel discovered disease coronavirus popularly known as COVID-19 is caused due to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and declared a pandemic by the World Health Organization (WHO). An early-stage detection of COVID-19 is crucial for the containment of the pandemic it has…
Exploiting Sample Uncertainty for Domain Adaptive Person Re-Identification

December 16, 2020

Many unsupervised domain adaptive (UDA) person re-identification (ReID) approaches combine clustering-based pseudo-label prediction with feature fine-tuning. However, because of domain gap, the pseudo-labels are not always reliable and there are noisy/incorrect labels. This would mislead the feature representation learning and deteriorate the performance. In this…
Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning

December 16, 2020 | Zeyuan Allen-Zhu, Yuanzhi Li, and Zeyuan Allen-Zhu

We formally study how Ensemble of deep learning models can improve test accuracy, and how the superior performance of ensemble can be distilled into a single model using Knowledge Distillation. We consider the challenging case where the ensemble is simply an average of the outputs…
Learning Omni-frequency Region-adaptive Representations for Real Image Super-Resolution.

December 11, 2020

Traditional single image super-resolution (SISR) methods that focus on solving single and uniform degradation (i.e., bicubic down-sampling), typically suffer from poor performance when applied into real-world low-resolution (LR) images due to the complicated realistic degradations. The key to solving this more challenging real image super-resolution…
Identification of Significant Permissions for Efficient Android Malware Detection

December 10, 2020 | Hemant Rathore, Sanjay K. Sahay, Ritvik Rajvanshi, and Mohit Sewak

Since Google unveiled Android OS for smartphones, malware are thriving with 3Vs, i.e. volume, velocity and variety. A recent report indicates that one out of every five business/industry mobile application leaks sensitive personal data. Traditional signature/heuristic based malware detection systems are unable to cope up…
VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data

December 10, 2020

Deep generative models often perform poorly in real-world applications due to the heterogeneity of natural data sets. Heterogeneity arises from data containing different types of features (categorical, ordinal, continuous, etc.) and features of the same type having different marginal distributions. We propose an extension of…
Detection of Malicious Android Applications: Classical Machine Learning vs. Deep Neural Network Integrated with Clustering

December 10, 2020 | Hemant Rathore, Sanjay K. Sahay, Shivin Thukral, and Mohit Sewak

Today anti-malware community is facing challenges due to ever-increasing sophistication and volume of malware attacks developed by adversaries. Traditional malware detection mechanisms are not able to cope-up against next-generation malware attacks. Therefore in this paper, we propose effective and efficient Android malware detection models based…
UnMask: Adversarial Detection and Defense Through Robust Feature Alignment

December 9, 2020 | Scott Freitas, Shang-Tse Chen, Zijie J. Wang, and Duen Horng (Polo) Chau

Recent research has demonstrated that deep learning architectures are vulnerable to adversarial attacks, high-lighting the vital need for defensive techniques to detect and mitigate these attacks before they occur. We present UnMask, an adversarial detection and defense framework based on robust feature alignment. UnMask combats…
Machine Learning for Glacier Monitoring in the Hindu Kush Himalaya

December 8, 2020

Glacier mapping is key to ecological monitoring in the hkh region. Climate change poses a risk to individuals whose livelihoods depend on the health of glacier ecosystems. In this work, we present a machine learning based approach to support ecological monitoring, with a focus on…
Fusing Context Into Knowledge Graph for Commonsense Question Answering

December 8, 2020

Commonsense question answering (QA) requires a model to grasp commonsense and factual knowledge to answer questions about world events. Many prior methods couple language modeling with knowledge graphs (KG). However, although a KG contains rich structural information, it lacks the context to provide a more…
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption

December 7, 2020

In this paper, we propose Text-Aware Pre-training (TAP) for Text-VQA and Text-Caption tasks. These two tasks aim at reading and understanding scene text in images for question answering and image caption generation, respectively. In contrast to the conventional vision-language pre-training that fails to capture scene…

No results