Microsoft Research Blog

English

  1. FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation 

    May 12, 2024

    Singing Accompaniment Generation (SAG), which generates instrumental music to accompany input vocals, is crucial to developing human-AI symbiotic art creation systems. The state-of-the-art method, SingSong, utilizes a multi-stage autoregressive (AR) model for SAG, however, this method is extremely slow as it generates semantic and acoustic…

  2. Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia 

    May 11, 2024

    AI tools are increasingly deployed in community contexts. However, datasets used to evaluate AI are typically created by developers and annotators outside a given community, which can yield misleading conclusions about AI performance. How might we empower communities to drive the intentional design and curation…

  3. CodeAid: Evaluating a Classroom Deployment of an LLM-based Programming Assistant that Balances Student and Educator Needs 

    May 11, 2024

    Timely, personalized feedback is essential for students learning programming. LLM-powered tools like ChatGPT offer instant support, but reveal direct answers with code, which may hinder deep conceptual engagement. We developed CodeAid, an LLM-powered programming assistant delivering helpful, technically correct responses, without revealing code solutions. CodeAid…

  4. Understanding the Role of Large Language Models in Personalizing and Scaffolding Strategies to Combat Academic Procrastination 

    May 11, 2024

    Traditional interventions for academic procrastination often fail to capture the nuanced, individual-specific factors that underlie them. Large language models (LLMs) hold immense potential for addressing this gap by permitting open-ended inputs, including the ability to customize interventions to individuals' unique needs. However, user expectations and…

  5. PhotoScout: Synthesis-Powered Multi-Modal Image Search 

    May 11, 2024 | Celeste Barnaby, Qiaochu Chen, Chenglong Wang, and Isil Dillig

    Due to the availability of increasingly large amounts of visual data, there is a growing need for tools that can help users find relevant images. While existing tools can perform image retrieval based on similarity or metadata, they fall short in scenarios that necessitate semantic…

  6. A Design Space for Intelligent and Interactive Writing Assistants 

    May 11, 2024

    In our era of rapid technological advancement, the research landscape for writing assistants has become increasingly fragmented across various research communities. We seek to address this challenge by proposing a design space as a structured way to examine and explore the multidimensional space of intelligent…

  7. background pattern

    Microsoft at CHI 2024 

    May 11, 2024 | Emily Maryatt and Tu Ong

    Microsoft Research is proud to be a sponsor of the ACM Computer Human Interaction (CHI) 2024 Conference on Human Factors in Computing Systems (opens in new tab). CHI brings together researchers and practitioners from all over the world and from diverse cultures, backgrounds, and positionalities,…

  8. Beyond Theory: A UX Outcomes Casebook for HCI Education 

    May 11, 2024

    The CHI community has expressed a growing interest in creating and sharing educational materials related to User Experience (UX) outcomes, particularly emphasizing summative research. Based on insights gathered at a CSCW 2003 workshop on understanding and evaluating UX outcomes at scale, we identified two areas…

  9. MAIDR: Making Statistical Visualizations Accessible with Multimodal Data Representation 

    May 11, 2024

    This paper investigates new data exploration experiences that enable blind users to interact with statistical data visualizations−bar plots, heat maps, box plots, and scatter plots−leveraging multimodal data representations. In addition to sonification and textual descriptions that are commonly employed by existing accessible visualizations, our MAIDR…