Microsoft Research Blog

Research Blog

µTransfer: A technique for hyperparameter tuning of enormous neural networks

March 8, 2022 | Edward Hu, Greg Yang, and Jianfeng Gao

Great scientific achievements cannot be made by trial and error alone. Every launch in the space program is underpinned by centuries of fundamental research in aerodynamics, propulsion, and celestial bodies. In the same way, when it comes to building large-scale AI systems, fundamental research forms…
COMPASS: COntrastive Multimodal Pretraining for AutonomouS Systems

February 23, 2022

Humans have the fundamental cognitive ability to perceive the environment through multimodal sensory signals and utilize this to accomplish a wide variety of tasks. It is crucial that an autonomous agent can similarly perceive the underlying state of an environment from different sensors and appropriately…
Using reinforcement learning to identify high-risk states and treatments in healthcare

February 2, 2022 | Mehdi Fatemi, Taylor Killian, and Marzyeh Ghassemi

As the pandemic overburdens medical facilities and clinicians become increasingly overworked, the ability to make quick decisions on providing the best possible treatment is even more critical. In urgent health situations, such decisions can mean life or death. However, certain treatment protocols can pose a…
Advancing AI trustworthiness: Updates on responsible AI research

February 1, 2022 | Mihaela Vorvoreanu and Kathy Walker

Inflated expectations around the capabilities of AI technologies may lead people to believe that computers can’t be wrong. The truth is AI failures are not a matter of if but when. AI is a human endeavor that combines information about people and the physical world…
DeepSpeed: Advancing MoE inference and training to power next-generation AI scale

January 19, 2022 | DeepSpeed Team and Andrey Proskurin

In the last three years, the largest trained dense models have increased in size by over 1,000 times, from a few hundred million parameters to over 500 billion parameters in Megatron-Turing NLG 530B (MT-NLG). Improvements in model quality with size suggest that this trend will…
EzPC: Increased data security in the AI model validation process

January 12, 2022 | Nishanth Chandran, Divya Gupta, Aseem Rastogi, and Rahul Sharma

From manufacturing and logistics to agriculture and transportation, the expansion of artificial intelligence (AI) in the last decade has revolutionized a multitude of industries—examples include enhancing predictive analytics on the manufacturing floor and making microclimate predictions so that farmers can respond and save their crops…
Azure AI milestone: Microsoft KEAR surpasses human performance on CommonsenseQA benchmark

December 20, 2021

KEAR (Knowledgeable External Attention for commonsense Reasoning)—along with recent milestones in computer vision and neural text-to-speech—is part of a larger Azure AI (opens in new tab) mission to provide relevant, meaningful AI solutions and services that work better for people because they better capture how people learn and work—with…
Azure AI milestone: New Neural Text-to-Speech models more closely mirror natural speech

December 17, 2021 | Sheng Zhao

Neural Text-to-Speech—along with recent milestones in computer vision and question answering—is part of a larger Azure AI (opens in new tab) mission to provide relevant, meaningful AI solutions and services that work better for people because they better capture how people learn and work—with improved…
Research at Microsoft 2021: Collaborating for real-world change

December 15, 2021

Over the past 30 years, Microsoft Research has undergone a shift in how it approaches innovation, broadening its mission to include not only advancing the state of computing but also using technology to tackle some of the world’s most pressing challenges. That evolution has never…
Azure AI milestone: New foundation model Florence v1.0 advances state of the art, topping popular computer vision leaderboards

December 14, 2021 | Project Florence Team

The Project Florence Team Florence v1.0—along with recent milestones in Neural Text-to-Speech and question answering—is part of a larger Azure AI (opens in new tab)mission to provide relevant, meaningful AI solutions and services that work better for people because they better capture how people learn…
$Diagram shows The role of computational modelling in the early-stage drug-discovery process. Following target identification and the screening of many molecules to identify possible candidates, the process of optimization can occur by human-led cycles of synthesis and testing in the laboratory. However, if computational modelling is used, most molecules are tested in silico, and it becomes necessary to synthesize and test only a small fraction of the candidate molecules. Just as with in vitro testing, in silico testing must be followed by clinical trials before the drug reaches the market.$

FS-Mol: Bringing Deep Learning to Early-Stage Drug Discovery

December 10, 2021 | Marc Brockschmidt and Megan Stanley

The drug development process is an iterative one that consists of discovery, design, and testing. Historically, drugs were derived from plants and discovered through trial-and-error experiments. Fortunately, this drug discovery process now occurs in a lab, with each iteration of custom-designed compounds producing a more…
Finding and fixing bugs with deep learning

December 8, 2021 | Miltos Allamanis and Marc Brockschmidt

Finding and fixing bugs in code is a time-consuming, and often frustrating, part of everyday work for software developers. Can deep learning address this problem and help developers deliver better software, faster? In a new paper, Self-Supervised Bug Detection and Repair, presented at the 2021…

No results