Microsoft Research Blog

Artificial intelligence

  1. Fed^2: Feature-Aligned Federated Learning 

    August 13, 2021

    Abstract to come...Federated learning learns from scattered data by fusing collaborative models from local nodes. However, the conventional coordinate-based model averaging by FedAvg ignored the random information encoded per parameter and may suffer from structural feature misalignment. In this work, we propose Fed2, a feature-aligned…

  2. Generalized Zero-Shot Extreme Multi-label Learning 

    August 13, 2021

    Extreme Multi-label Learning (XML) involves assigning the subset of most relevant labels to a data point from millions of label choices. A hitherto unaddressed challenge in XML is that of predicting unseen labels with no training points. These form a significant fraction of total labels…

  3. ProLinguist: Program Synthesis for Linguistics and NLP 

    August 1, 2021

    We introduce ProLinguist, an approach that uses program synthesis to automatically synthesize explicit string transformation rules from input-output examples for NLP tasks. Our algorithm is able to learn rules not only where the output depends on the surrounding input context, but also stateful rules, where…

  4. Automatic Rephrasing of Transcripts-based Action Items 

    August 1, 2021 | Amir Cohen, Amir Kantor (amkantor), Sagi Hilleli, and Eyal Kolman

    The automated transcription of spoken language, and meetings, in particular, is becoming more widespread as automatic speech recognition systems are becoming more accurate. This trend has significantly accelerated since the outbreak of the COVID-19 pandemic, which led to a major increase in the number of…

  5. Robust Android Malware Detection System Against Adversarial Attacks Using Q-Learning 

    July 31, 2021 | Hemant Rathore, Sanjay K. Sahay, Piyush Nikam, and Mohit Sewak

    Since the inception of Andoroid OS, smartphones sales have been growing exponentially, and today it enjoys the monopoly in the smartphone marketplace. The widespread adoption of Android smartphones has drawn the attention of malware designers, which threatens the Android ecosystem. The current state-of-the-art Android malware…

  6. Evidential Deep Learning for Guided Molecular Property Prediction and Discovery 

    July 26, 2021

    While neural networks achieve state-of-the-art performance for many molecular modeling and structure–property prediction tasks, these models can struggle with generalization to out-of-domain examples, exhibit poor sample efficiency, and produce uncalibrated predictions. In this paper, we leverage advances in evidential deep learning to demonstrate a new…

  7. AI4VIS: Survey on Artificial Intelligence Approaches for Data Visualization 

    July 26, 2021

    Visualizations themselves have become a data format. Akin to other data formats such as text and images, visualizations are increasingly created, stored, shared, and (re-)used with artificial intelligence (AI) techniques. In this survey, we probe the underlying vision of formalizing visualizations as an emerging data…

  8. Adaptive Transfer Learning on Graph Neural Networks 

    July 18, 2021 | Xueting Han, Zhenhuan Huang, Bang An, and Jing Bai

    Graph neural networks (GNNs) is widely used to learn a powerful representation of graph-structured data. Recent work demonstrates that transferring knowledge from self-supervised tasks to downstream tasks could further improve graph representation. However, there is an inherent gap between self-supervised tasks and downstream tasks in…

  9. WILDS: A Benchmark of in-the-Wild Distribution Shifts 

    July 17, 2021

    Distribution shifts -- where the training distribution differs from the test distribution -- can substantially degrade the accuracy of machine learning (ML) systems deployed in the wild. Despite their ubiquity, these real-world distribution shifts are under-represented in the datasets widely used in the ML community…