AsgardBench: A benchmark for visually grounded interactive planning
Imagine a robot tasked with cleaning a kitche…
Imagine a robot tasked with cleaning a kitche…
Vision-language models (VLMs) use images and …
As AI agents transition from simple chatbots …
It seems counterintuitive: giving AI agents m…
We are pleased to announce Phi-4-reasoning-vi…
As synthetic media grows, verifying what’s re…
Project Silica introduces new techniques for …
This research looks at why Predictive Inverse…
Microsoft Research unveils Paza, a human-cent…
AI can help generate medical image reports, b…
Argos improves multimodal RL by evaluating wh…
Meet our community of researchers, learn about exciting research topics, and grow your network
Ongoing conversations at the cutting edge of research
Join us for a continuous exchange of ideas about research in the era of general AI