微软研究院博客

Multimodal reinforcement learning with agentic verifier for AI agents 

2026年1月20日
Argos improves multimodal RL by evaluating wh…

最新文章

Explore More

  • Events & conferences

    Events & conferences 

    Meet our community of researchers, learn about exciting research topics, and grow your network

  • Podcasts

    Podcasts 

    Ongoing conversations at the cutting edge of research

  • Microsoft Research Forum

    Microsoft Research Forum 

    Join us for a continuous exchange of ideas about research in the era of general AI