The goal of our research is to make AI systems more efficient, reliable, and secure across all software and hardware layers. Our current projects span agentic AI operations, intelligent model serving, hardware introspection, and hardware security. We publish in top academic conferences including ISCA, SOSP, SIGCOMM, NSDI, and USENIX Security. Our work has been productized across several Microsoft groups (Azure Core, Azure Networking, Azure Hardware, Windows, Office, AI Core) and has been adopted by industry (JEDEC).
Together, all our work reflects our group’s vision:
co-designing novel and intelligent software systems and hardware-aware techniques to improve the efficiency, reliability, scalability, and trustworthiness of next-generation AI systems.
Our group is working on three main projects: LLexus, Switchcraft, and Project Sapphire (opens in new tab).
Recent highlights:
- State-of-the-art Rowhammer defense built in silicon for Azure Cobalt 200.
- Switchcraft model router shipped with Microsoft Foundry.
- LLexus is powering incident response for several internal groups including Azure PhyNet, Office for the Web, Azure EngOps.
Coming soon: Hiring full-time researchers and summer interns
Stay tuned for upcoming opportunities to join our research team.