Publication TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Ronen Eldan, Yuanzhi Li May 2023 Project
Publication Automatic Prompt Optimization with “Gradient Descent” and Beam Search Reid Pryzant, Dan Iter, Jerry Li, Yin Tat Lee, Chenguang Zhu, Michael Zeng May 2023
Publication Logical Transformers: Infusing Logical Structures into Pre-Trained Language Models Borui Wang, Qiuyuan Huang, Budhaditya Deb, Aaron L Halfaker, Liqun Shao, Daniel McDuff, Ahmed Awadallah, Dragomir Radev, Jianfeng Gao Proceedings of ACL 2023 | May 2023 Project Project
Publication AutoMoE: Neural Architecture Search for Efficient Sparsely Activated Transformers Ganesh Jawahar, Subhabrata (Subho) Mukherjee, Xiaodong Liu, Young Jin Kim, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan, Ahmed Awadallah, Sébastien Bubeck, Jianfeng Gao ACL | May 2023 Project
Publication Relational Attention: Generalizing Transformers for Graph-Structured Tasks Cameron Diao, Ricky Loynd ICLR 2023 | May 2023 Spotlight
Publication Benchmarking Spatial Relationships in Text-to-Image Generation Tejas Gokhale, Hamid Palangi, Besmira Nushi, Vibhav Vineet, Eric Horvitz, Ece Kamar, Chitta Baral, Yezhou Yang MSR-TR-2023-44 | May 2023 Published by Microsoft Github
Publication The AI Revolution in Medicine: GPT-4 and Beyond Peter Lee, Carey Goldberg, Isaac S. Kohane, Sébastien Bubeck Published by Pearson Education | April 2023 ISBN: 9780138200138 Project
Publication Derivative Based Nonbacktracking Real-World Regex Matching with Backtracking Semantics Dan Moseley, Mario Nishio, Jose Perez Rodriguez, Olli Saarikivi, Stephen Toub, Margus Veanes, Tiki Wan, Eric Xu MSR-TR-2023-15 | April 2023 Published by Microsoft Extended version of paper that appears in PLDI 2023. Project
Publication A Large-scale Robustness Analysis of Video Action Recognition Models Madeline Chantry Schiappa, Naman Biyani, Prudvi Kamtam, Shruti Vyas, Hamid Palangi, Vibhav Vineet, Yogesh Rawat April 2023
Publication What do Compressed Large Language Models Forget? Robustness Challenges in Model Compression Mengnan Du, Subhabrata (Subho) Mukherjee, Yu Cheng, Milad Shokouhi, Xia Hu, Ahmed Awadallah EACL | April 2023 Project Project Project