Provenance for Trustworthy Digital Media
Cryptographically securing the authenticity of digital assets As generative AI accelerates the creation of realistic synthetic media (e.g., photos, videos, audio, documents), it is increasingly difficult for people and platforms to distinguish authentic content from…
Principal Applied Science Manager
Microsoft Bing’s RAI Defensives team is focused on keeping Bing safe for our customers by detecting queries and content that require due diligence for the best user experience. All this using the state-of-the-art deep learnt…
SABER: Scaling-Aware Best-of-N Estimation of Risk
Scaling-Aware Best-of-N Estimation of Risk A Python package for predicting large-scale adversarial risk in Large Language Models under Best-of-N sampling. Paper: https://arxiv.org/pdf/2601.22636 (opens in new tab) Standard LLM safety evaluations use single-shot (ASR@1) metrics,…