photo of the Redmond lab building entrance

Microsoft Research Lab – Redmond

News & features

AsgardBench | three whit icons on a blue to purple gradient background | first icon shows a laptop screen with a eye in the upper right corner, second icon shows relational nodes | third icon is a security shield with a checkmark

Microsoft Research Blog

AsgardBench: A benchmark for visually grounded interactive planning

March 26, 2026 | Andrea Tupini, Lars Liden, Reuben Tan, Yu Wang, and Jianfeng Gao

Imagine a robot tasked with cleaning a kitchen. It needs to observe its environment, decide what to do, and adjust when things don’t go as expected, for example, when the mug it was tasked to wash is already clean, or…

V2GP framework | Three white line icons, showing a target within a rounded square, a checklist, and a robotic arm, on a blue‑to‑green gradient background.

Microsoft Research Blog

GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation

March 26, 2026 | Sehun Jung, HyunJee Song, Dong-Hee Kim, Reuben Tan, Jianfeng Gao, Yong Jae Lee, and Donghyun Kim

Vision-language models (VLMs) use images and text to plan robot actions, but they still struggle to decide what actions to take and where to take them. Most systems split these decisions into two steps: a VLM generates a plan in…

Three white line icons, showing network, workflow, and bug‑analysis icons, on a blue‑to‑purple gradient background.

Microsoft Research Blog

Systematic debugging for AI agents: Introducing the AgentRx framework

March 12, 2026 | Shraddha Barke, Arnav Goyal, Alind Khare, and Chetan Bansal

As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-step API workflows, a new challenge has emerged: transparency. When a human makes a mistake, we can usually trace…

In the news | National Academy of Engineering

Doug Burger elected to National Academy of Engineering

February 10, 2026

Academy membership honors individuals who have made outstanding contributions to engineering research, practice, or education. Burger was elected for accelerating cloud-scale computing and networking infrastructures with field-programmable systems.

Smart Replay - flowchart diagram showing the flow between Encoder, State Predictor, and Policy

Microsoft Research Blog

Rethinking imitation learning with Predictive Inverse Dynamics Models

February 5, 2026 | Pallavi Choudhury, Lukas Schäfer, Chris Lovett, Katja Hofmann, and Sergio Valcarcel Macua

This research looks at why Predictive Inverse Dynamics Models often outperform standard Behavior Cloning in imitation learning. By using simple predictions of what happens next, PIDMs reduce ambiguity and learn from far fewer demonstrations.

Three white icons on a blue‑green gradient: a ribcage scan, a circuit‑style document, and a neural network diagram

Microsoft Research Blog

UniRG: Scaling medical imaging report generation with multimodal reinforcement learning

January 27, 2026 | Sheng Zhang, Flora Liu, Guanghui Qin, Mu Wei, and Hoifung Poon

AI can help generate medical image reports, but today’s models struggle with varying reporting schemes. Learn how UniRG uses reinforcement learning to boost performance of medical vision-language models.

In the news | Association for Computing Machinery

Madanlal Musuvathi named ACM Fellow

January 21, 2026

Madanlal was selected by his peers for the development of methods in concurrency verification and testing, and machine learning systems design.

Diagram showing visual, audio, and document icons feeding into a central network icon of connected people, which then leads to a checkmark symbol, all on a blue‑to‑purple gradient background.

Microsoft Research Blog

Argos: Multimodal reinforcement learning with agentic verifier for AI agents

January 20, 2026 | Reuben Tan, Baolin Peng, Zhengyuan Yang, Oier Mees, and Jianfeng Gao

Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and produces more reliable, data-efficient agents for real-world applications.

A flowchart with three horizontal sections on a blue-to-green gradient background. The first section, labeled “Classification,” shows icons of a computer, an arrow pointing to a robot face, and another arrow pointing to a box labeled “TSP.” The second section, labeled “Inference,” displays a robot icon connected by arrows to two document icons, one of which includes a magnifying glass. The third section, labeled “Test-time scaling,” shows a document with a checkmark connected by an arrow to a circular refresh icon. Arrows indicate the flow between sections, starting from Classification to Inference and then to Test-time scaling.

Microsoft Research Blog

OptiMind: A small language model with optimization expertise

January 15, 2026 | Xinzhi Zhang, Zeyi Chen, Humishka Hope, Hugo Barbalho, Konstantina Mellou, Marco Molinaro, Janardhan (Jana) Kulkarni, Ishai Menache, and Sirui Li

OptiMind is a small language model that converts business operation challenges, described naturally, into mathematical formulations that optimization software can solve. It reduces formulation time & errors & enables fast, privacy-preserving local use.