Publication SureMap: Simultaneous mean estimation for single-task and multi-task disaggregated evaluation Lester Mackey, Miro Dudík, Alex Chouldechova October 2024
Publication The Power of Resets in Online Reinforcement Learning Zakaria Mhammedi, Dylan Foster, Alexander Rakhlin NeurIPS 2024 | October 2024
Publication ReXTime: A Benchmark Suite for Reasoning-Across-Time in Videos Jr-Jen Chen, Yu-Chien Liao, Hsi-Che Lin, Yu-Chu Yu, Yen-Chun Chen, Yu-Chiang Frank Wang 2024 Neural Information Processing Systems | October 2024
Publication RedCode: Risky Code Execution and Generation Benchmark for Code Agents Chengquan Guo, Xun Liu, Chulin Xie, Andy Zhou, Yi Zeng, Zinan Lin, Dawn Song, Bo Li NeurIPS 2024 | October 2024
Publication MedImageInsight: An Open-Source Embedding Model for General Domain Medical Imaging Noel Codella, Yu Gu, Shrey Jain, Ho Hin Lee, Asma Ben Abacha, Alberto Santamaria-Pang, Will Guyman, Natieek Sangani, Sheng Zhang, Hoifung Poon, Stephanie Hyland, Shruthi Bannur, Javier Alvarez-Valle, Xue Li, John Garett, Alan McMillan, Gaurav Rajguru, Madhu Maddi, Nilesh Vijayrania, Reehan Bhimai, Nick Mecklenburg, Rupal Jain, Daniel Holstein, Naveen Gaur, Vijay Aski, Jenq-Neng Hwang, Thomas Lin, Ivan Tarapov, Matthew P Lungren, Mu Wei October 2024
Publication AI Should Challenge, Not Obey Advait Sarkar October 2024 CACM Cover Story Video Project Project Project
Publication Modeling health risks using neural network ensembles Brandon M. Smith, Antonio Criminisi, Noam Sorek, Yaar Harari, Neeraj Sood, Steven B. heymsfield Plos One | October 2024
Publication Not All Tokens Are What You Need for Pretraining Yeyun Gong, Xiao Liu, Yelong Shen, Ruochen Xu, Jian Jiao, Nan Duan, Weizhu Chen 2024 Neural Information Processing Systems | October 2024 Best Paper Runner Up
Publication Motion Graph Unleashed: A Novel Approach to Video Prediction Luming Liang, Ilya Zharkov October 2024