Agent-Pex: Automated Evaluation and Testing of AI Agents
Automated evaluation and testing of AI agents AI agents are rapidly transforming software, with projections of over a billion agents in operation by 2028. These agents, embedded in products like VS Code and M365 Copilot,…
Senior Researcher – AI Systems – Microsoft Research
Microsoft Research seeks a Senior Researcher specializing in one or more areas of Artificial Intelligence (AI) infrastructure, Machine Learning (ML) systems, and high-performance computing (HPC) systems. The position involves working collaboratively to advance state-of-the art…
Research Intern – System Modeling for Medical Imaging
Medical imaging instruments such as MRI scanners are complex dynamic systems whose non-ideal electrical and physical behavior significantly impacts image quality. This Research Internship focuses on developing models of MRI system behavior through calibration experiments,…
GridFM
Small foundation models for the electric grid GridFM is a Microsoft Research initiative to build a foundation model (FM) for electric power grids, applying modern AI methods—similar to large language/weather models—to complex grid physics. Traditional…
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
Geospatial Foundation Models – deep learning architectures trained on large-scale Earth observation and remote sensing data – have proved useful for object detection, semantic segmentation, and other prediction tasks on geospatial and geographic data. However,…
Senior Research Engineer Machine Learning, AI for Science
At Microsoft Research AI for Science, we believe deep learning has the potential to transform scientific modelling and discovery crucial for solving the most pressing problems facing society, including sustainable materials and discovery of new…
Member of Technical Staff – Data Scientist
We’re looking for data scientists to help build the next generation of post-training methods for frontier models at Microsoft AI. You’ll join a small, high-impact team working across all stages of post-training, with a focus on evaluation design,…