RadEdit model
RadEdit is a latent diffusion model trained to generate and edit chest X-rays from medical reports. It is described in detail in RadEdit: Stress-Testing Biomedical Vision Models via Diffusion Image Editing (F. Pérez-García, S. Bond-Taylor, et…
Advancing Reasoning Capabilities in Agentic AI Systems
This project aims to push the boundaries of agentic AI by addressing three critical challenges: long-horizon memory, safe and aligned tool usage, and adaptive reasoning. Current language models excel at text generation but lack agency—the…
Towards Autonomous and Reliable Supply Chains
This project explores how Generative AI can transform supply chain management from rule-based automation to true autonomy. Building on the MIT autonomous supply chain testbed, it integrates multiple AI agents that learn, adapt, and coordinate…
Physics-Guided Vision-Language World Models for Agentic 4D Scene Understanding
This project develops a unified framework for physically grounded world modelling that combines video-based temporal prediction with Gaussian Splatting for photorealistic 3D representation. A Physics Vision-Language Model translates natural-language instructions into transformations that respect physical…
AgentGuard: Early-Warning and Routing for Predictable AgenticAI on Azure
This project introduces AgentGuard, a monitoring and routing system designed to improve reliability and cost-efficiency in Azure-based agent workflows. By analysing early trajectory signals—such as reasoning patterns and tool usage—within the first 10–30% of an…
Towards Robust Generalization in Agentic AI via Environment Scaling
This project addresses the challenge of enabling AI agents to operate effectively in complex, realistic environments such as web navigation, computer use, and mobile interfaces. While current models excel in structured domains like mathematics and…
Visual episodic memory and use in agentic systems
Human intelligence is defined by the interplay of semantic and episodic memory. AI research almost exclusively develops semantic memory systems, but episodic memory has many open challenges, especially for vision. Episodic memory is the ability…