Giorgio Severi posts

Senior AI Safety Researcher

Giorgio is a Senior AI Safety Researcher at Microsoft, working on the AI Red Team to assess the security and safety of large, multimodal, and agentic AI systems. His work spans multiple areas of adversarial machine learning, with a particular focus on risks related to poisoning and long-term memory.

Research

February 9

3 min read

A one-prompt attack that breaks LLM safety alignment

As LLMs and diffusion models power more applications, their safety alignment becomes critical.
Research

February 4

7 min read

Detecting backdoored language models at scale

We’re releasing new research on detecting backdoors in open-weight language models and highlighting a practical scanner designed to detect backdoored models at scale and improve overall trust in AI systems.

A one-prompt attack that breaks LLM safety alignment

Detecting backdoored language models at scale

Get started with Microsoft Security