{"id":1155379,"date":"2025-12-04T04:13:04","date_gmt":"2025-12-04T12:13:04","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&#038;p=1155379"},"modified":"2025-12-15T03:14:42","modified_gmt":"2025-12-15T11:14:42","slug":"towards-robust-generalization-in-agentic-ai-via-environment-scaling","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/towards-robust-generalization-in-agentic-ai-via-environment-scaling\/","title":{"rendered":"Towards Robust Generalization in Agentic AI via Environment Scaling"},"content":{"rendered":"<section class=\"mb-3 moray-highlight\">\n\t<div class=\"card-img-overlay mx-lg-0\">\n\t\t<div class=\"card-background  has-background- card-background--full-bleed\">\n\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"1920\" height=\"721\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1.jpg\" class=\"attachment-full size-full\" alt=\"background pattern\" style=\"\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1.jpg 1920w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1-300x113.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1-1024x385.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1-768x288.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1-1536x577.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1-1600x600.jpg 1600w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/11\/Towards-Robust-Generalization-in-Agentic-AI-via-Environment-Scaling_Banner-1920x721-1-240x90.jpg 240w\" sizes=\"auto, (max-width: 1920px) 100vw, 1920px\" \/>\t\t<\/div>\n\t\t<!-- Foreground -->\n\t\t<div class=\"card-foreground d-flex mt-md-n5 my-lg-5 px-g px-lg-0\">\n\t\t\t<!-- Container -->\n\t\t\t<div class=\"container d-flex mt-md-n5 my-lg-5 \">\n\t\t\t\t<!-- Card wrapper -->\n\t\t\t\t<div class=\"w-100 w-lg-col-5\">\n\t\t\t\t\t<!-- Card -->\n\t\t\t\t\t<div class=\"card material-md-card py-5 px-md-5\">\n\t\t\t\t\t\t<div class=\"card-body \">\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n<h1 class=\"wp-block-heading\" id=\"towards-robust-generalization-in-agentic-ai-via-environment-scaling\">Towards Robust Generalization in Agentic AI via Environment Scaling<\/h1>\n\n\n\n<p><\/p>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n<p>This project addresses the challenge of enabling AI agents to operate effectively in complex, realistic environments such as web navigation, computer use, and mobile interfaces. While current models excel in structured domains like mathematics and coding, they struggle with tasks requiring long-horizon planning, exploration, and error recovery. The research focuses on building a scalable infrastructure capable of hosting hundreds of virtual environments and advancing reinforcement learning with world-model techniques to improve agents\u2019 understanding and decision-making. The outcome will include an extensible training platform, enhanced RL frameworks, and AI agents that outperform state-of-the-art models on benchmarks for browser, computer, and phone use\u2014laying the foundation for robust, general-purpose agentic AI.<\/p>\n\n\n\n<p>This research is conducted via\u00a0<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/academic-program\/agentic-ai-research-and-innovation\/\">The Agentic AI Research and Innovation\u00a0<\/a>(AARI) Initiative which focuses on the next frontier of agentic systems through\u00a0<em>Grand Challenges<\/em>\u00a0with the academic community and Microsoft Research.<\/p>\n\n\n","protected":false},"excerpt":{"rendered":"<p>This project addresses the challenge of enabling AI agents to operate effectively in complex, realistic environments such as web navigation, computer use, and mobile interfaces. While current models excel in structured domains like mathematics and coding, they struggle with tasks requiring long-horizon planning, exploration, and error recovery. The research focuses on building a scalable infrastructure [&hellip;]<\/p>\n","protected":false},"featured_media":1155698,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13556],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-1155379","msr-project","type-msr-project","status-publish","has-post-thumbnail","hentry","msr-research-area-artificial-intelligence","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"","related-publications":[],"related-downloads":[],"related-videos":[],"related-groups":[],"related-events":[],"related-opportunities":[],"related-posts":[],"related-articles":[],"tab-content":[],"slides":[],"related-researchers":[{"type":"user_nicename","display_name":"Baolin Peng","user_id":43779,"people_section":"Section name 0","alias":"baolinpeng"},{"type":"guest","display_name":"Zhou  Yu","user_id":1157408,"people_section":"Section name 0","alias":""},{"type":"guest","display_name":"Xiao Yu","user_id":1157409,"people_section":"Section name 0","alias":""}],"msr_research_lab":[],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/1155379","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":5,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/1155379\/revisions"}],"predecessor-version":[{"id":1158521,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/1155379\/revisions\/1158521"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1155698"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1155379"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1155379"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1155379"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=1155379"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=1155379"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}