Multi-Object Advertisement Creative Generation
Applied Scientist II
Are you interested in solving large‑scale search problems and building next‑generation of Query Understanding solutions powered by LLMs and SLMs? The Bing Orca team is a research‑driven applied science and engineering group building the next…
SafeAgents
A unified framework for building and evaluating safe multi-agent systems SafeAgents provides a simple, framework-agnostic API for creating multi-agent systems with built-in safety evaluation, attack detection, and support for multiple agentic frameworks (Autogen, LangGraph, OpenAI…
BusyBox
BusyBox is a physical 3D-printable device for benchmarking affordance generalization in robot foundation models. It features Please check out our website (opens in new tab) for more details. For fully building a instrumented BusyBox capable…
Principal Applied Scientist – CoreAI
You’re joining Core AI, the team at the forefront of redefining how software is built and experienced. You will be a technical contributor driving the applied science foundation for observability in AI agents and multi-agent…
Member of Technical Staff, Principal Tech Lead Manager, Image Generation
We are hiring a Principal Tech Lead Manager to own and grow Copilot’s image generation capabilities. You will set the technical direction for image generation, lead a team of Applied AI engineers and platform engineers,…
Member of Technical Staff, Senior Applied AI Engineer, Image Generation
We’re hiring a Senior Applied AI Engineer, Image Generation to join a fast‑moving, high‑ownership team building next‑generation AI assistant and productivity capabilities. This role blends LLM product engineering, evaluation science, hillclimbing, and internal tool building…
TestExplora
This repository is the official implementation of the paper “TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation” It can be used for baseline evaluation using the prompts mentioned in the paper. TestExplora…