Microsoft Research Blog

Artificial intelligence

Fed^2: Feature-Aligned Federated Learning

August 13, 2021

Abstract to come...Federated learning learns from scattered data by fusing collaborative models from local nodes. However, the conventional coordinate-based model averaging by FedAvg ignored the random information encoded per parameter and may suffer from structural feature misalignment. In this work, we propose Fed2, a feature-aligned…
Generalized Zero-Shot Extreme Multi-label Learning

August 13, 2021

Extreme Multi-label Learning (XML) involves assigning the subset of most relevant labels to a data point from millions of label choices. A hitherto unaddressed challenge in XML is that of predicting unseen labels with no training points. These form a significant fraction of total labels…
Reinforced Iterative Knowledge Distillation for Cross-Lingual Named Entity Recognition

August 13, 2021

Named entity recognition (NER) is a fundamental component in many applications, such as Web Search and Voice Assistants. Although deep neural networks greatly improve the performance of NER, due to the requirement of large amounts of training data, deep neural networks can hardly scale out…
Deep Learning Model Based CO2 Emissions Prediction using Vehicle Telematics Sensors Data

August 6, 2021 | Mukul Singh and Rahul Kumar Dubey

Climate change is one of the greatest environmental hazards of today. Global warming, due to an increase in greenhouse gases has resulted in a continuous global increase in temperature. CO2 continues to be the leading contributor to the greenhouse effect, with transport being a major…
ProLinguist: Program Synthesis for Linguistics and NLP

August 1, 2021

We introduce ProLinguist, an approach that uses program synthesis to automatically synthesize explicit string transformation rules from input-output examples for NLP tasks. Our algorithm is able to learn rules not only where the output depends on the surrounding input context, but also stateful rules, where…
Automatic Rephrasing of Transcripts-based Action Items

August 1, 2021 | Amir Cohen, Amir Kantor (amkantor), Sagi Hilleli, and Eyal Kolman

The automated transcription of spoken language, and meetings, in particular, is becoming more widespread as automatic speech recognition systems are becoming more accurate. This trend has significantly accelerated since the outbreak of the COVID-19 pandemic, which led to a major increase in the number of…
Robust Android Malware Detection System Against Adversarial Attacks Using Q-Learning

July 31, 2021 | Hemant Rathore, Sanjay K. Sahay, Piyush Nikam, and Mohit Sewak

Since the inception of Andoroid OS, smartphones sales have been growing exponentially, and today it enjoys the monopoly in the smartphone marketplace. The widespread adoption of Android smartphones has drawn the attention of malware designers, which threatens the Android ecosystem. The current state-of-the-art Android malware…
Evidential Deep Learning for Guided Molecular Property Prediction and Discovery

July 26, 2021

While neural networks achieve state-of-the-art performance for many molecular modeling and structure–property prediction tasks, these models can struggle with generalization to out-of-domain examples, exhibit poor sample efficiency, and produce uncalibrated predictions. In this paper, we leverage advances in evidential deep learning to demonstrate a new…
AI4VIS: Survey on Artificial Intelligence Approaches for Data Visualization

July 26, 2021

Visualizations themselves have become a data format. Akin to other data formats such as text and images, visualizations are increasingly created, stored, shared, and (re-)used with artificial intelligence (AI) techniques. In this survey, we probe the underlying vision of formalizing visualizations as an emerging data…
MegaLMM: Mega-scale linear mixed models for genomic predictions with thousands of traits.

July 22, 2021 | Daniel E. Runcie, Jiayi Qu, Hao Cheng, and Lorin Crawford

Large-scale phenotype data can enhance the power of genomic prediction in plant and animal breeding, as well as human genetics. However, the statistical foundation of multi-trait genomic prediction is based on the multivariate linear mixed effect model, a tool notorious for its fragility when applied…
Adaptive Transfer Learning on Graph Neural Networks

July 18, 2021 | Xueting Han, Zhenhuan Huang, Bang An, and Jing Bai

Graph neural networks (GNNs) is widely used to learn a powerful representation of graph-structured data. Recent work demonstrates that transferring knowledge from self-supervised tasks to downstream tasks could further improve graph representation. However, there is an inherent gap between self-supervised tasks and downstream tasks in…
WILDS: A Benchmark of in-the-Wild Distribution Shifts

July 17, 2021

Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity, these real-world distribution shifts are under-represented in the datasets widely used in the ML community…

No results