Publication Challenges in Human-Agent Communication Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld MSR-TR-2024-53 | December 2024 Published by Microsoft Project
Publication Microsoft New Future of Work Report 2024 Jenna Butler, Mihaela Vorvoreanu, Rebecca Janssen, Abigail Sellen, Nicole Immorlica, Adam Troy, Advait Sarkar, Alex Farach, Alex Chouldechova, Alexandra Olteanu, Alexia Cambon, Arjun Radhakrishna, Asta Roseway, Ben Zorn, Brent Hecht, Daniel G. Goldstein, Dhruv Joshi, Ed Cutrell, Emre Kiciman, Gonzalo Ramos, Gustavo Soares, Hanna Wallach, Ian Drosos, Jack Williams (johnwilliams), Jacki O'Neill, Jake Hofman, Jaime Teevan, Javier Hernandez, Jennifer Wortman Vaughan, Jina Suh, John Tang, Justin Edwards, Kalika Bali, Kori Inkpen, Krishna Madhavan, Laylah Bulman, Leon Reicherts, Lev Tankelevitch, Longqi Yang, Martez Mott, Millicent Ochieng, Mercy Muchai, Nancy Baym, Najeeb Abdulhamid, Nicolai Marquardt, Ken Hinckley, Michael Bentley, Dave Brown, Hugo Romat, Nathalie Henry Riche, Samuel Maina, Shamsi Iqbal, Siân Lindley, Stephanie Nyairo, Su Lin Blodgett, Sumit Gulwani, Sunayana Sitaram, Vu Le MSR-TR-2024-56 | December 2024 Published by Microsoft Project Project
Publication Gaps Between Research and Practice When Measuring Representational Harms Caused by LLM-Based Systems Emma Harvey, Emily Sheng, Su Lin Blodgett, Alex Chouldechova, Jean Garcia-Gathright, Alexandra Olteanu, Hanna Wallach November 2024
Publication Dimensions of Generative AI Evaluation Design Alex Dow, Jennifer Wortman Vaughan, Solon Barocas, Chad Atalla, Alex Chouldechova, Hanna Wallach November 2024
Publication Evaluating Generative AI Systems is a Social Science Measurement Challenge Hanna Wallach, Meera Desai, Nick Pangakis, A. Feder Cooper, Angelina Wang, Solon Barocas, Alex Chouldechova, Chad Atalla, Su Lin Blodgett, Emily Corvi, Alex Dow, Jean Garcia-Gathright, Alexandra Olteanu, Stefanie Reed, Emily Sheng, Dan Vann, Jennifer Wortman Vaughan, Matthew Vogel, Hannah Washington, Abigail Z. Jacobs November 2024
Publication (De)Noise: Moderating the Inconsistency Between Human Decision-Makers Nina Grgić-Hlača, Junaid Ali, Krishna P. Gummadi, Jennifer Wortman Vaughan ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2024) | November 2024
Publication ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles Kayo Yin, Chinmay Singh, Fyodor O. Minakov, Vanessa Milan, Hal Daumé, Cyril Zhang, Alex Lu, Danielle Bragg Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing | November 2024 Project
Publication Easy2Hard-Bench: Standardized Difficulty Labels for Profiling LLM Performance and Generalization Mucong Ding, Chenghao Deng, Jocelyn Choo, Zichu Wu, Aakriti Agrawal, Avi Schwarzschild, Tianyi Zhou, Tom Goldstein, John Langford, A. Anandkumar, Furong Huang NeurIPS 2024 | September 2024
Publication Understanding the Impacts of Language Technologies’ Performance Disparities on African American Language Speakers Jay Cunningham, Su Lin Blodgett, Hal Daumé III, Christina Harrington, Hanna Wallach, Michael Madaio Findings of the Association for Computational Linguistics: ACL 2024 | August 2024