Microsoft Research Blog

Artificial intelligence

  1. Group Sampling for Scale Invariant Face Detection 

    July 28, 2020

    Detectors based on deep learning tend to detect multi-scale faces on a single input image for efficiency. Recent works, such as FPN and SSD, generally use feature maps from multiple layers with different spatial resolutions to detect objects at different scales, e.g., high-resolution feature maps…

  2. Domain-Adaptive Neural Automated Essay Scoring. 

    July 25, 2020 | Yue Cao, Hanqi Jin, Xiaojun Wan, and Zhiwei Yu

    Automated essay scoring (AES) is a promising, yet challenging task. Current state-of-the-art AES models ignore the domain difference and cannot effectively leverage data from different domains. In this paper, we propose a domain-adaptive framework to improve the domain adaptability of AES models. We design two…

  3. Generalizing Variational Autoencoders with Hierarchical Empirical Bayes. 

    July 19, 2020 | Wei Cheng, Gregory Darnell, Sohini Ramachandran, and Lorin Crawford

    Variational Autoencoders (VAEs) have experienced recent success as data-generating models by using simple architectures that do not require significant fine-tuning of hyperparameters. However, VAEs are known to suffer from over-regularization which can lead to failure to escape local maxima. This phenomenon, known as posterior collapse,…

  4. InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training 

    July 14, 2020

    In this work, we formulate cross-lingual language model pre-training as maximizing mutual information between multilingual-multi-granularity texts. The unified view helps us to better understand the existing methods for learning cross-lingual representations. More importantly, the information-theoretic framework inspires us to propose a pre-training task based on…

  5. Programming by Rewards 

    July 14, 2020

    We formalize and study ``programming by rewards'' (PBR), a new approach for specifying and synthesizing subroutines for optimizing some quantitative metric such as performance, resource utilization, or correctness over a benchmark. A PBR specification consists of (1) input features $x$, and (2) a reward function…

  6. The Non-IID Data Quagmire of Decentralized Machine Learning 

    July 11, 2020 | Kevin Hsieh, Amar Phanishayee, Onur Mutlu, and Phillip Gibbons

    Many large-scale machine learning (ML) applications need to perform decentralized learning over datasets generated at different devices and locations. Such datasets pose a significant challenge to decentralized learning because their different contexts result in significant data distribution skew across devices/locations. In this paper, we take…

  7. Improved Sleeping Bandits with Stochastic Action Sets and Adversarial Rewards 

    July 11, 2020 | Aadirupa Saha, Pierre Gaillard, and Michal Valko

    In this paper, we consider the problem of sleeping bandits with stochastic action sets and adversarial rewards. In this setting, in contrast to most work in bandits, the actions may not be available at all times. For instance, some products might be out of stock…

  8. Attribute2Font: Creating Fonts You Want From Attributes 

    July 8, 2020 | Yizhi Wang, Yue Gao, and Zhouhui Lian

    Font design is now still considered as an exclusive privilege of professional designers, whose creativity is not possessed by existing software systems. Nevertheless, we also notice that most commercial font products are in fact manually designed by following specific requirements on some attributes of glyphs,…

  9. MichiGAN: multi-input-conditioned hair image generation for portrait editing 

    July 7, 2020

    Despite the recent success of face image generation with GANs, conditional hair editing remains challenging due to the under-explored complexity of its geometry and appearance. In this paper, we present MichiGAN (Multi-Input-Conditioned Hair Image GAN), a novel conditional image generation method for interactive portrait hair…

  10. MichiGAN: multi-input-conditioned hair image generation for portrait editing 

    July 7, 2020

    Despite the recent success of face image generation with GANs, conditional hair editing remains challenging due to the under-explored complexity of its geometry and appearance. In this paper, we present MichiGAN (Multi-Input-Conditioned Hair Image GAN), a novel conditional image generation method for interactive portrait hair…

  11. The State and Fate of Linguistic Diversity and Inclusion in the NLP World 

    July 5, 2020

    Language technologies contribute to promoting multilingualism and linguistic diversity around the world. However, only a very small number of the over 7000 languages of the world are represented in the rapidly evolving language technologies and applications. In this paper we look at the relation between…

  12. Multi-scale Genomic Inference using Biologically Annotated Neural Networks 

    July 2, 2020

    With the emergence of large-scale genomic datasets, there is a unique opportunity to integrate machine learning approaches as standard tools within genome-wide association (GWA) studies. Unfortunately, while machine learning methods have been shown to account for nonlinear data structures and exhibit greater predictive power over…