Member of Technical Staff, Senior Applied AI Engineer, Image Generation
We’re hiring a Senior Applied AI Engineer, Image Generation to join a fast‑moving, high‑ownership team building next‑generation AI assistant and productivity capabilities. This role blends LLM product engineering, evaluation science, hillclimbing, and internal tool building…
TestExplora
This repository is the official implementation of the paper “TestExplora: Benchmarking LLMs for Proactive Bug Discovery via Repository-Level Test Generation” It can be used for baseline evaluation using the prompts mentioned in the paper. TestExplora…
Systematic debugging for AI agents: Introducing the AgentRx framework
As AI agents transition from simple chatbots to autonomous systems capable of managing cloud incidents, navigating complex web interfaces, and executing multi-step API workflows, a new challenge has emerged: transparency. When a human makes a…
Senior Data Scientist – Business and Industry Solutions (BIS) team
Do you want to shape the future of the autonomous enterprise and lead the development of intelligent, agent-first experiences that transform how businesses operate? The Business and Industry Solutions (BIS) team is looking for a Senior Applied Scientist…
PlugMem: Transforming raw agent interactions into reusable knowledge
It seems counterintuitive: giving AI agents more memory can make them less effective. As interaction logs accumulate, they grow large, fill with irrelevant content, and become increasingly difficult to use. More memory means that agents must…