Learning to Map Natural Language to General Purpose Source Code
Models that map natural language (NL) to source code in general purpose languages such as Java, Python, and SQL find utility amongst two main audiences viz. developers who can manipulate the generated code, and non-expert…
Layer Trajectory BLSTM: New evolution enhances speech recognition technology
Speech is a signal that can enable natural interaction between human and machine. In order to facilitate this exchange, machines have to be able to recognize what a human has spoken, both the words and…
Microsoft at Interspeech 2019
Interspeech is the world‘s largest and most comprehensive conference on the science and technology of spoken language processing. Microsoft joins the conference as a proud gold sponsor. Stop by our booth to chat with our…
Bring your phones to the conference table: creating ad hoc microphone arrays from personal devices
Recent advances in machine learning and signal processing, as well as the availability of massive computing power, have resulted in dramatic and steady improvement in speech recognition accuracy. Voice interfaces to digital devices have become…
Structure Visual Understanding and Interaction with Human and Environment
The visual world around us is highly structured. As 2D projection of our world, images are also structured. In images, there are usually a background and some foreground objects (e.g., kites and birds in the…