About
I am a Principal Researcher at Microsoft Research. My recent research focuses on large-scale natural language processing and multimodal learning, which includes:
- LLM distillation and adaptation [1 (opens in new tab), 2 (opens in new tab), 3 (opens in new tab), 4 (opens in new tab)]
- LLM test-time scaling [5 (opens in new tab), 6 (opens in new tab), 7 (opens in new tab)]
- Building specialized foundation models [8 (opens in new tab), 9 (opens in new tab), 10 (opens in new tab), 11 (opens in new tab), 12 (opens in new tab)]
Personal webpage: https://sheng-z.github.io/ (opens in new tab)