Microsoft Research Blog

Research Blog

CodeXGLUE: A benchmark dataset and open challenge for code intelligence

September 29, 2020

According to Evans Data Corporation (opens in new tab), there are 23.9 million professional developers in 2019, and the population is expected to reach 28.7 million in 2024. With the growing population of developers, code intelligence, which aims to leverage AI to help software developers…
Measuring dataset similarity using optimal transport

September 24, 2020 | David Alvarez-Melis and Nicolo Fusi

Is FashionMNIST, a dataset of images of clothing items labeled by category, more similar to MNIST or to USPS, both of which are classification datasets of handwritten digits? This is a pretty hard question to answer, but the solution could have an impact on various…
Project InnerEye open-source deep learning toolkit: Democratizing medical imaging AI

September 22, 2020 | Javier Alvarez-Valle and Gregory J. Moore, MD, PhD

For over a decade, the Project InnerEye team at Microsoft Research Cambridge has been developing state-of-the-art machine learning methods for the automatic, quantitative analysis of three-dimensional medical images. An important application is to assist clinicians for image preparation and planning tasks for radiotherapy cancer treatment…
In search for future of cloud storage, researchers look to holographic storage solutions

September 22, 2020 | Benn Thomsen, Dushyanth Narayanan, and Ant Rowstron

Data storage has always been a key tenet of compute, and with the massive growth in cloud compute, the demand for cloud data storage has opened an avenue for both revisiting prior technologies and developing new ones. It is projected that around 125 zettabytes of…
Dialogue as Dataflow: A new approach to conversational AI

September 21, 2020

By the Semantic Machines research team “Easier said than done.” These four words reflect the promise of conversational AI. It takes just seconds to ask When are Megan and I both free? but much longer to find out manually from a calendar. Indeed, almost everything…
DeepSpeed: Extreme-scale model training for everyone

September 10, 2020 | DeepSpeed Team, Rangan Majumder, and Junhua Wang

In February, we announced DeepSpeed, an open-source deep learning training optimization library, and ZeRO (Zero Redundancy Optimizer), a novel memory optimization technology in the library, which vastly advances large model training by improving scale, speed, cost, and usability. DeepSpeed has enabled researchers to create Turing…
Expressive Pixels: A new visual communication platform to support creativity, accessibility, and innovation

September 3, 2020

The need to express oneself is innate for every person in the world, and its roots run through art, technology, communication, and the acts of learning and building things from the ground up. It’s no coincidence, then, that a new platform being released by Microsoft…
Platform for Situated Intelligence: An open-source framework for multimodal, integrative AI

September 2, 2020 | Dan Bohus and Sean Andrist

Over the years at Microsoft Research, we’ve studied how to build AI systems that perceive, understand, and act in a human-filled world in real time. Our motivation has been to create computing systems that can support interactive experiences akin to what we expect when we…
Domain-specific language model pretraining for biomedical natural language processing

August 31, 2020 | Hoifung Poon and Jianfeng Gao

COVID-19 highlights a perennial problem facing scientists around the globe: how do we stay up to date with the cutting edge of scientific knowledge? In just a few months since the pandemic emerged, tens of thousands of research papers have been published concerning COVID-19 and…
Microsoft HoloLens 2: Improved Research Mode to facilitate computer vision research

August 28, 2020 | Marc Pollefeys

Since its launch in November 2019, Microsoft HoloLens 2 has helped enterprises in manufacturing, construction, healthcare, and retail onboard employees more quickly, complete tasks faster, and greatly reduce errors and waste. It sets the high-water mark for intelligent edge devices by leveraging a multitude of…
MineRL sample-efficient reinforcement learning challenge—back for a second year—benefits organizers, as well as larger research community

August 20, 2020 | Noboru Sean Kuno

To unearth a diamond in the block-based open world of Minecraft requires the acquisition of materials and the construction of tools before any diamond mining can even begin. Players need to gather wood, which they’ll use to make a wood pickaxe for mining stone underground.…
Research Collection: The Unseen History of Audio and Acoustics Research at Microsoft

August 12, 2020

Getting the sound right is a crucial ingredient in natural user interfaces, immersive gaming, realistic virtual and mixed reality, and ubiquitous computing.

No results