ニュース&特集
AsgardBench: A benchmark for visually grounded interactive planning
| Andrea Tupini, Lars Liden, Reuben Tan, Yu Wang, と Jianfeng Gao
Imagine a robot tasked with cleaning a k…
GroundedPlanBench: Spatially grounded long-horizon task planning for robot manipulation
| Sehun Jung, HyunJee Song, Dong-Hee Kim, Reuben Tan, Jianfeng Gao, Yong Jae Lee, と Donghyun Kim
Vision-language models (VLMs) use images…
Systematic debugging for AI agents: Introducing the AgentRx framework
| Shraddha Barke, Arnav Goyal, Alind Khare, と Chetan Bansal
As AI agents transition from simple chat…
ニュース | National Academy of Engineering
Doug Burger elected to National Academy of Engineering
Academy membership honors individuals wh…
Rethinking imitation learning with Predictive Inverse Dynamics Models
| Pallavi Choudhury, Lukas Schäfer, Chris Lovett, Katja Hofmann, と Sergio Valcarcel Macua
This research looks at why Predictive In…
UniRG: Scaling medical imaging report generation with multimodal reinforcement learning
| Sheng Zhang, Flora Liu, Guanghui Qin, Mu Wei, と Hoifung Poon
AI can help generate medical image repor…
ニュース | Association for Computing Machinery
Madanlal Musuvathi named ACM Fellow
Madanlal was selected by his peers for t…
Argos: Multimodal reinforcement learning with agentic verifier for AI agents
| Reuben Tan, Baolin Peng, Zhengyuan Yang, Oier Mees, と Jianfeng Gao
Argos improves multimodal RL by evaluati…
OptiMind: A small language model with optimization expertise
| Xinzhi Zhang, Zeyi Chen, Humishka Hope, Hugo Barbalho, Konstantina Mellou, Marco Molinaro, Janardhan (Jana) Kulkarni, Ishai Menache, と Sirui Li
OptiMind is a small language model that …