소개
I am a Principal Researcher on the Efficient AI team within M365 Research, where I focus on systems for AI and the realization of a full-stack vision for global-scale AI inference. My work bridges research and production systems, advancing Microsoft’s AI inference infrastructure toward greater efficiency and scalability, lower costs, and predictable latency.