In the news | VentureBeat
Microsoft launched a new artificial intelligence model today that achieves remarkable mathematical reasoning capabilities while using far fewer computational resources than its larger competitors. The 14-billion-parameter Phi-4 frequently outperforms much larger models like Google’s Gemini Pro 1.5, marking a significant…
In the news | Techcrunch
Microsoft has revealed the newest addition to its Phi family of generative AI models. Called Phi-4, the model improves in several areas over its predecessors, Microsoft claims, particularly in solving math problems. That’s partly the result of better training data…
In the news | Tech Brew
One of the key questions driving Ece Kamar’s research as managing director of Microsoft’s AI Frontiers Lab is how to coordinate networks of these agents—AI systems that can perform autonomous tasks beyond the scope of chatbots. Late last year, her…
Microsoft Research Asia is pioneering innovations in Media Foundation to advance AI's ability in processing real-world media. As one of the keys focuses of the 2025 StarTrack Scholars Program, this research aims to provide new insights into multimodal large models.…
编者按:编者按:欢迎阅读“科研上新”栏目!“科研上新”汇聚了微软亚洲研究院最新的创新成果与科研动态。在这里,你可以快速浏览研究院的亮点资讯,保持对前沿领域的敏锐嗅觉,同时也能找到先进实用的开源工具。 12月10日至12月15日,全球最负盛名的人工智能盛会之一 NeurIPS 大会将在加拿大温哥华举办。因此,我们将通过三期“科研上新”为大家带来多篇微软亚洲研究院入选 NeurIPS 2024 的精选...
We’re excited to be a part of #NeurIPS2024! Explore the future of AI with over 100 groundbreaking papers, including oral and spotlight sessions, on reinforcement learning, advanced language model training, and multilingual, culturally inclusive benchmarks.
In the news | Yahoo Finance
According to Ashley Llorens, corporate vice president and managing director at Microsoft Research, AI models will soon be able to handle far more complex tasks, such as triaging customer requests or tracking employee expenses. Additionally, AI will become increasingly energy…
| Amber Tingle and Weizhu Chen
Next-token prediction trains a language model on all tokens in a sequence. VP Weizhu Chen discusses his team’s 2024 NeurIPS paper on how distinguishing between useful and “noisy” tokens in pretraining can improve token efficiency and model performance.
| Amber Tingle and Dylan Foster
Can existing algorithms designed for simple reinforcement learning problems be used to solve more complex RL problems? Researcher Dylan Foster discusses the modular approach he and his coauthors explored in their 2024 NeurIPS paper on RL under latent dynamics.