Microsoft Research Blog

Artificial intelligence

  1. A Simple Recipe towards Reducing Hallucination in Neural Surface Realisation 

    July 1, 2019

    Recent neural language generation systems often hallucinate contents (i.e., producing irrelevant or contradicted facts), especially when trained on loosely corresponding pairs of the input structure and text. To mitigate this issue, we propose to integrate a language understanding module for data refinement with self-training iterations…

  2. Automated Chess Commentator Powered by Neural Chess Engine. 

    July 1, 2019 | Hongyu Zang, Zhiwei Yu, and Xiaojun Wan

    In this paper, we explore a new approach for automated chess commentary generation, which aims to generate chess commentary texts in different categories (e.g., description, comparison, planning, etc.). We introduce a neural chess engine into text generation models to help with encoding boards, predicting moves,…

  3. Pose synthesis in unseen human poses 

    June 19, 2019 | Fanny Nina Paravecino, James Hall, and Rita Brugarolas Brufau

    Techniques related to synthesizing an image of a person in an unseen pose are discussed. Such techniques include detecting a body part occlusion for a body part in a representation of the person in a first image and, in response to the detected occlusion, projecting…

  4. Learning Joint Reconstruction of Hands and Manipulated Objects 

    June 15, 2019

    Estimating hand-object manipulations is essential for in- terpreting and imitating human actions. Previous work has made significant progress towards reconstruction of hand poses and object shapes in isolation. Yet, reconstructing hands and objects during manipulation is a more challeng- ing task due to significant occlusions…

  5. Densely Semantically Aligned Person Re-Identification 

    June 15, 2019 | Zhizheng Zhang, Cuiling Lan, Wenjun Zeng, and Zhibo Chen

    We propose a densely semantically aligned person re-identification (re-ID) framework. It fundamentally addresses the body misalignment problem caused by pose/viewpoint variations, imperfect person detection, occlusion, etc.. By leveraging the estimation of the dense semantics of a person image, we construct a set of densely semantically…

  6. Synthesis and machine learning for heterogeneous extraction 

    June 7, 2019

    We present a way to combine techniques from the program synthesis and machine learning communities to extract structured information from heterogeneous data. Such problems arise in several situations such as extracting attributes from web pages, machine-generated emails, or from data obtained from multiple sources. Our…

  7. Identifying Security Bug Reports Based Solely on Report Titles and Noisy Data 

    May 31, 2019 | Mayana Pereira, Alok Kumar, and Scott Cristiansen

    Identifying security bug reports (SBRs) is a vital step in the software development life-cycle. In supervised machine learning based approaches, it is usual to assume that entire bug reports are available for training and that their labels are noise free. To the best of our…

  8. Mask-Guided Portrait Editing With Conditional GANs 

    May 31, 2019

    Portrait editing is a popular subject in photo manipulation.The Generative Adversarial Network (GAN) advances the generating of realistic faces and allows more face editing. In this paper, we argue about three issues in existing techniques: diversity, quality, and controllability for portrait synthesis and editing. To…

  9. Face Parsing With RoI Tanh-Warping 

    May 31, 2019

    Face parsing computes pixel-wise label maps for different semantic components (e.g., hair, mouth, eyes) from face images. Existing face parsing literature have illustrated significant advantages by focusing on individual regions of interest (RoIs) for faces and facial components. However,the traditional crop-and-resize focusing mechanism ignores all…

  10. Bidirectional Learning for Domain Adaptation of Semantic Segmentation 

    May 31, 2019 | Yunsheng Li, Lu Yuan, and Nuno Vasconcelos

    Domain adaptation for semantic image segmentation is very necessary since manually labeling large datasets with pixel-level labels is expensive and time consuming. Existing domain adaptation techniques either work on limited datasets, or yield not so good performance compared with supervised learning. In this paper, we…

  11. Embedding Imputation with Grounded Language Information. 

    May 29, 2019 | Ziyi Yang, Chenguang Zhu, Vin Sachidananda, and Eric Darve

    Due to the ubiquitous use of embeddings as input representations for a wide range of natural language tasks, imputation of embeddings for rare and unseen words is a critical problem in language processing. Embedding imputation involves learning representations for rare or unseen words during the…