Multimodal reinforcement learning with agentic verifier for AI agents
Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and produces more reliable, data-efficient agents for real-world applications.
Microsoft Research India 2025 Highlights
Here are some highlights of the research and impact at Microsoft Research India in 2025, along with some glimpses of the lab milieu.
Research Intern – AI Evaluation and Alignment
Microsoft Research and Copilot Studio team are seeking Research Interns to help advance the quality, reliability and evaluation of Large Language Model (LLM)-based systems. Research Interns will collaborate with applied scientists and engineers to explore…
Rick Rashid & Founding Microsoft Research
We were delighted to host a fabulous talk from the founder of Microsoft Research, Rick Rashid! Rick shared lessons learned throughout his career, his motivation and inspiration for creating and designing Microsoft Research, as well…
Research Intern – AI/ML for Electricity Infrastructure Planning
Microsoft Research (MSR) is seeking Research Interns to work at the intersection of artificial intelligence (AI) and electricity infrastructure planning. Research Interns will join ongoing research efforts that explore how AI—especially large language models (LLMs)…
OptiMind: A small language model with optimization expertise
OptiMind is a small language model that converts business operation challenges, described naturally, into mathematical formulations that optimization software can solve. It reduces formulation time & errors & enables fast, privacy-preserving local use.
Research Intern – Security, Privacy and AI
This Research Internship will include (1) conducting threat modeling for Large Language Model (LLM)‑enabled agentic designs and identifying practical mitigations for Windows OS; (2) developing or refining formal security frameworks that support scalable designs and…
Research Intern – Foundations of GenAI
The mission of the AI Frontiers Lab is to expand the pareto frontier of AI capabilities, efficiency, and safety through innovations in foundation models and learning agent platforms. We are seeking a Research Intern to join…
KDD ’25 AI Reasoning Day keynote: Improving AI Reasoning through Intent, Interaction, and Inspection
https://ai-reasoning.github.io/ AI models are increasingly capable of solving sophisticated tasks that require reasoning. But how do we improve the quality of that reasoning, especially when the models operate as black boxes? In this talk, Sumit…
Agent-Pex: Automated Evaluation and Testing of AI Agents
Automated evaluation and testing of AI agents AI agents are rapidly transforming software, with projections of over a billion agents in operation by 2028. These agents, embedded in products like VS Code and M365 Copilot,…