Microsoft Research Blog

Research Blog

VinVL: Advancing the state of the art for vision-language models

January 14, 2021 | Pengchuan Zhang, Lei Zhang, and Jianfeng Gao

Humans understand the world by perceiving and fusing information from multiple channels, such as images viewed by the eyes, voices heard by the ears, and other forms of sensory input. One of the core aspirations in AI is to develop algorithms that endow computers with…
Microsoft DeBERTa surpasses human performance on the SuperGLUE benchmark

January 6, 2021 | Pengcheng He, Xiaodong Liu, Jianfeng Gao, and Weizhu Chen

Natural language understanding (NLU) is one of the longest running goals in AI, and SuperGLUE is currently among the most challenging benchmarks for evaluating NLU models. The benchmark consists of a wide range of NLU tasks, including question answering, natural language inference, co-reference resolution, word…
Unadversarial examples: Designing objects for robust vision

December 22, 2020 | Hadi Salman

Many of the items and objects we use in our daily lives were designed with people in mind. In October, the Reserve Bank of Australia put out into the world its redesigned $100 banknote. Some design elements remained the same—such as color and size, characteristics…
Research at Microsoft 2020: Addressing the present while looking to the future

December 17, 2020

Microsoft researchers pursue the big questions about what the world will be like in the future and the role technology will play. Not only do they take on the responsibility of exploring the long-term vision of their research, but they must also be ready to…
‘Seeing’ on tiny battery-powered microcontrollers with RNNPool

December 11, 2020

Computer vision has rapidly evolved over the past decade, allowing for such applications as Seeing AI, a camera app that describes aloud a person’s surroundings, helping those who are blind or have low vision; systems that can detect whether a product, such as a computer…
MPNet combines strengths of masked and permuted language modeling for language understanding

December 9, 2020 | Xu Tan

Pretrained language models have been a hot research topic in natural language processing. These models, such as BERT, are usually pretrained on large-scale language corpora with carefully designed pretraining objectives and then fine-tuned on downstream tasks to boost the accuracy. Among these, masked language modeling…
NeurIPS 2020: Moving toward real-world reinforcement learning via batch RL, strategic exploration, and representation learning

December 7, 2020

As human beings, we encounter unfamiliar situations all the time—learning to drive, living on our own for the first time, starting a new job. And while we can anticipate what to expect based on what others have told us or what we’ve picked up from…
Utilizing consumer cameras for contact-free physiological measurement in telehealth and beyond

December 2, 2020 | Daniel McDuff and Xin Liu

According to the CDC WONDER Online Database (opens in new tab), heart disease is currently the leading cause of death for both men and women in the United States. However, most deaths due to cardiovascular diseases could be prevented with suitable interventions. Early detection of…
A Microsoft custom data type for efficient inference

December 2, 2020

AI is taking on an increasingly important role in many Microsoft products, such as Bing and Office 365. In some cases, it’s being used to power outward-facing features like semantic search in Microsoft Word or intelligent answers in Bing, and deep neural networks (DNNs) are…
Adversarial machine learning and instrumental variables for flexible causal modeling

December 1, 2020 | Vasilis Syrgkanis

We are going through a new shift in machine learning (ML), where ML models are increasingly being used to automate decision-making in a multitude of domains: what personalized treatment should be administered to a patient, what discount should be offered to an online customer, and…
The human side of AI for chess

November 30, 2020 | Reid McIlroy-Young, Ashton Anderson, Jon Kleinberg, and Siddhartha Sen

Editor’s note: The section “Modeling individual players’ styles with Maia” has been updated as of July 12, 2021. As artificial intelligence continues its rapid progress, equaling or surpassing human performance on benchmarks in an increasing range of tasks, researchers in the field are directing more…
Project InnerEye evaluation shows how AI can augment and accelerate clinicians’ ability to perform radiotherapy planning 13 times faster

November 30, 2020

Up to half of the population in the United States (opens in new tab) and United Kingdom (opens in new tab) will be diagnosed with cancer at some point in their lives. Of those, half will be treated with radiotherapy (RT), often in combination with…