Publication Survival Instinct in Offline Reinforcement Learning and Implicit Human Bias in Data Anqi Li, Dipendra Misra, Andrey Kolobov, Ching-An Cheng ICML 2023 – Interactive Learning with Implicit Human Feedback Workshop | June 2023 ORAL
Publication Orca: Progressive Learning from Complex Explanation Traces of GPT-4 Subhabrata (Subho) Mukherjee, Arindam Mitra, Ganesh Jawahar, Sahaj Agarwal, Hamid Palangi, Ahmed Awadallah arXiv: Computation and Language | June 2023 Video Project
Publication Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning Yu Yang, Besmira Nushi, Hamid Palangi, Baharan Mirzasoleiman ICML 2023 | June 2023
Publication MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations Anqi Li, Byron Boots, Ching-An Cheng ICML 2023 | June 2023
Publication DSEE: Dually Sparsity-embedded Efficient Tuning of Pre-trained Language Models Xuxi Chen, Tianlong Chen, Yu Cheng, Wei Chen, Zhangyang Wang, Ahmed Awadallah ACL 2023 | May 2023 Project
Publication The probability flow ODE is provably fast Sitan Chen, Sinho Chewi, Holden Lee, Yuanzhi Li, Jianfeng Lu, Adil Salim NeurIPS 2023 | May 2023
Publication TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Ronen Eldan, Yuanzhi Li May 2023 Project
Publication Automatic Prompt Optimization with “Gradient Descent” and Beam Search Reid Pryzant, Dan Iter, Jerry Li, Yin Tat Lee, Chenguang Zhu, Michael Zeng May 2023
Publication Logical Transformers: Infusing Logical Structures into Pre-Trained Language Models Borui Wang, Qiuyuan Huang, Budhaditya Deb, Aaron L Halfaker, Liqun Shao, Daniel McDuff, Ahmed Awadallah, Dragomir Radev, Jianfeng Gao Proceedings of ACL 2023 | May 2023 Project Project
Publication Relational Attention: Generalizing Transformers for Graph-Structured Tasks Cameron Diao, Ricky Loynd ICLR 2023 | May 2023 Spotlight