Sample-Efficient Online Learning in LM Agents via Hindsight Trajectory Rewriting
Michael Y. Hu, Ben Van Durme, Jacob Andreas, Harsh Jhamtani
October 2025
Michael Y. Hu, Ben Van Durme, Jacob Andreas, Harsh Jhamtani
October 2025
Harsh Jhamtani, Jacob Andreas, Ben Van Durme
ACL | July 2025
Yifei Xu, Tusher Chakraborty, Srinagesh Sharma, Leonardo Nunes, Emre Kiciman, Songwu Lu, Ranveer Chandra
Arxiv | June 2025
Katie Matton, Robert Osazuwa Ness, John Guttag, Emre Kiciman
2025 International Conference on Learning Representations | April 2025
Yongchao Chen, Harsh Jhamtani, Srinagesh Sharma, Chuchu Fan, Chi Wang
ICLR | April 2025
Yifei Xu, Tusher Chakraborty, Emre Kiciman, Bibek Aryal, Eduardo Rodrigues, Srinagesh Sharma, Roberto Estevao, Maria Angels de Luis Balaguer, Jessica Wolk, Rafael Padilha, Leonardo Nunes, Shobana Balakrishna, Songwu Lu, Ranveer Chandra
ICML’25 | February 2025
Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Ben Van Durme, Jason Eisner, Jacob Andreas
EMNLP | November 2024
Margaret Capetz, Swati Sharma, Rafael Padilha, Peder Olsen, Jessica Wolk, Emre Kiciman, Ranveer Chandra
ArXiv | November 2024, Vol abs/2411.16872
Maya Petersen, Ahmed Alaa, Emre Kiciman, Chris Holmes, Mark van der Laan
NEJM AI | November 2024
Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Ben Van Durme, Harsh Jhamtani
EMNLP | November 2024
Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Ben Van Durme
EMNLP | November 2024
Outstanding Paper Award
Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Ben Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng
ICLR 2025 | November 2024
Millicent Li, Tongfei Chen, Ben Van Durme, Patrick Xia
ICLR 2025 | October 2024
Spotlight
Emre Kiciman, Robert Osazuwa Ness, Amit Sharma, Chenhao Tan
Transactions on Machine Learning Research (TMLR) | August 2024
Outstanding Certification Finalist
Selected for presentation at ICLR 2025
Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme
IJCAI 2024 | August 2024
Ahmed Alaa, Rachael V. Phillips, Emre Kiciman, Laura B. Balzer, M. V. D. Laan, Maya Petersen
ArXiv | July 2024, Vol abs/2407.19118
Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Ben Van Durme, Harsh Jhamtani
NAACL | June 2024
Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman
ArXiv | March 2024, Vol abs/2403.14720
Jingwei Yi, Yueqi Xie, Bin Benjamin Zhu, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu
ArXiv | December 2023, Vol abs/2312.14197
Bruno Silva, Leonardo Nunes, Roberto Estevão, Vijay Aski, Ranveer Chandra
October 2023
Kumar Shridhar, Harsh Jhamtani, Hao Fang, Ben Van Durme, Jason Eisner, Patrick Xia
arXiv: Computation and Language | September 2023
Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein
Findings of ACL 2023 | July 2023
Michael Y. Hu, Ben Van Durme, Jacob Andreas, Harsh Jhamtani
October 2025
Harsh Jhamtani, Jacob Andreas, Ben Van Durme
ACL | July 2025
Yifei Xu, Tusher Chakraborty, Srinagesh Sharma, Leonardo Nunes, Emre Kiciman, Songwu Lu, Ranveer Chandra
Arxiv | June 2025
Katie Matton, Robert Osazuwa Ness, John Guttag, Emre Kiciman
2025 International Conference on Learning Representations | April 2025
Yongchao Chen, Harsh Jhamtani, Srinagesh Sharma, Chuchu Fan, Chi Wang
ICLR | April 2025
Yifei Xu, Tusher Chakraborty, Emre Kiciman, Bibek Aryal, Eduardo Rodrigues, Srinagesh Sharma, Roberto Estevao, Maria Angels de Luis Balaguer, Jessica Wolk, Rafael Padilha, Leonardo Nunes, Shobana Balakrishna, Songwu Lu, Ranveer Chandra
ICML’25 | February 2025
Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Ben Van Durme, Jason Eisner, Jacob Andreas
EMNLP | November 2024
Margaret Capetz, Swati Sharma, Rafael Padilha, Peder Olsen, Jessica Wolk, Emre Kiciman, Ranveer Chandra
ArXiv | November 2024, Vol abs/2411.16872
Maya Petersen, Ahmed Alaa, Emre Kiciman, Chris Holmes, Mark van der Laan
NEJM AI | November 2024
Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Ben Van Durme
EMNLP | November 2024
Outstanding Paper Award
Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Ben Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng
ICLR 2025 | November 2024
Millicent Li, Tongfei Chen, Ben Van Durme, Patrick Xia
ICLR 2025 | October 2024
Spotlight
Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme
IJCAI 2024 | August 2024
Emre Kiciman, Robert Osazuwa Ness, Amit Sharma, Chenhao Tan
Transactions on Machine Learning Research (TMLR) | August 2024
Outstanding Certification Finalist
Selected for presentation at ICLR 2025
Ahmed Alaa, Rachael V. Phillips, Emre Kiciman, Laura B. Balzer, M. V. D. Laan, Maya Petersen
ArXiv | July 2024, Vol abs/2407.19118
Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Ben Van Durme, Harsh Jhamtani
NAACL | June 2024
Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman
ArXiv | March 2024, Vol abs/2403.14720
Jingwei Yi, Yueqi Xie, Bin Benjamin Zhu, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu
ArXiv | December 2023, Vol abs/2312.14197
Bruno Silva, Leonardo Nunes, Roberto Estevão, Vijay Aski, Ranveer Chandra
October 2023
Kumar Shridhar, Harsh Jhamtani, Hao Fang, Ben Van Durme, Jason Eisner, Patrick Xia
arXiv: Computation and Language | September 2023
Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein
Findings of ACL 2023 | July 2023
Harsh Jhamtani, Jacob Andreas, Ben Van Durme
ACL | July 2025
Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Ben Van Durme, Jason Eisner, Jacob Andreas
EMNLP | November 2024
Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Ben Van Durme, Harsh Jhamtani
EMNLP | November 2024
Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Ben Van Durme
EMNLP | November 2024
Outstanding Paper Award
Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme
IJCAI 2024 | August 2024
Kumar Shridhar, Harsh Jhamtani, Hao Fang, Ben Van Durme, Jason Eisner, Patrick Xia
arXiv: Computation and Language | September 2023
Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein
Findings of ACL 2023 | July 2023
Millicent Li, Tongfei Chen, Ben Van Durme, Patrick Xia
ICLR 2025 | October 2024
Spotlight
Maya Petersen, Ahmed Alaa, Emre Kiciman, Chris Holmes, Mark van der Laan
NEJM AI | November 2024
Ahmed Alaa, Rachael V. Phillips, Emre Kiciman, Laura B. Balzer, M. V. D. Laan, Maya Petersen
ArXiv | July 2024, Vol abs/2407.19118
Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman
ArXiv | March 2024, Vol abs/2403.14720
Michael Y. Hu, Ben Van Durme, Jacob Andreas, Harsh Jhamtani
October 2025
Bruno Silva, Leonardo Nunes, Roberto Estevão, Vijay Aski, Ranveer Chandra
October 2023
Harsh Jhamtani, Jacob Andreas, Ben Van Durme
ACL | July 2025
Yifei Xu, Tusher Chakraborty, Srinagesh Sharma, Leonardo Nunes, Emre Kiciman, Songwu Lu, Ranveer Chandra
Arxiv | June 2025
Katie Matton, Robert Osazuwa Ness, John Guttag, Emre Kiciman
2025 International Conference on Learning Representations | April 2025
Yongchao Chen, Harsh Jhamtani, Srinagesh Sharma, Chuchu Fan, Chi Wang
ICLR | April 2025
Yifei Xu, Tusher Chakraborty, Emre Kiciman, Bibek Aryal, Eduardo Rodrigues, Srinagesh Sharma, Roberto Estevao, Maria Angels de Luis Balaguer, Jessica Wolk, Rafael Padilha, Leonardo Nunes, Shobana Balakrishna, Songwu Lu, Ranveer Chandra
ICML’25 | February 2025
Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Ben Van Durme, Jason Eisner, Jacob Andreas
EMNLP | November 2024
Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Ben Van Durme, Harsh Jhamtani
EMNLP | November 2024
Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Ben Van Durme
EMNLP | November 2024
Outstanding Paper Award
Tong Chen, Hao Fang, Patrick Xia, Xiaodong Liu, Ben Van Durme, Luke Zettlemoyer, Jianfeng Gao, Hao Cheng
ICLR 2025 | November 2024
Millicent Li, Tongfei Chen, Ben Van Durme, Patrick Xia
ICLR 2025 | October 2024
Spotlight
Harsh Jhamtani, Hao Fang, Patrick Xia, Eran Levy, Jacob Andreas, Ben Van Durme
IJCAI 2024 | August 2024
Nikita Moghe, Patrick Xia, Jacob Andreas, Jason Eisner, Ben Van Durme, Harsh Jhamtani
NAACL | June 2024
Hao Fang, Anusha Balakrishnan, Harsh Jhamtani, John Bufe, Jean Crawford, Jayant Krishnamurthy, Adam Pauls, Jason Eisner, Jacob Andreas, Dan Klein
Findings of ACL 2023 | July 2023
Margaret Capetz, Swati Sharma, Rafael Padilha, Peder Olsen, Jessica Wolk, Emre Kiciman, Ranveer Chandra
ArXiv | November 2024, Vol abs/2411.16872
Maya Petersen, Ahmed Alaa, Emre Kiciman, Chris Holmes, Mark van der Laan
NEJM AI | November 2024
Emre Kiciman, Robert Osazuwa Ness, Amit Sharma, Chenhao Tan
Transactions on Machine Learning Research (TMLR) | August 2024
Outstanding Certification Finalist
Selected for presentation at ICLR 2025
Ahmed Alaa, Rachael V. Phillips, Emre Kiciman, Laura B. Balzer, M. V. D. Laan, Maya Petersen
ArXiv | July 2024, Vol abs/2407.19118
Keegan Hines, Gary Lopez, Matthew Hall, Federico Zarfati, Yonatan Zunger, Emre Kiciman
ArXiv | March 2024, Vol abs/2403.14720
Jingwei Yi, Yueqi Xie, Bin Benjamin Zhu, Emre Kiciman, Guangzhong Sun, Xing Xie, Fangzhao Wu
ArXiv | December 2023, Vol abs/2312.14197
Kumar Shridhar, Harsh Jhamtani, Hao Fang, Ben Van Durme, Jason Eisner, Patrick Xia
arXiv: Computation and Language | September 2023