在 AI 时代,当文生图早已是秒出“大片”,文生视频也能复刻好莱坞级特效时,3D 生成却仍停留在难以令人满意的阶段——细节模糊、结构失真,缺乏立体感。 当你满怀期待地输入“一个透明的玻璃瓶”,AI 却只给出了一个实心的“泥疙瘩”。当你想要一座椰林摇曳、白沙碧海的海滨小镇,得到的却是橡皮泥捏成的模糊雕塑。你希望生成一棵枝叶轻盈飘逸的枫树,AI 却无法完整还原枝叶的自然形态与立体结构。 以上种种都是当...
Rho-alpha, which translates natural language commands into control signals for robotic systems doing bimanual manipulation tasks, aims to make physical systems more adaptable by using physical sensing modalities like touch and continuous learning from human feedback.
In the news | Association for Computing Machinery
Madanlal was selected by his peers for the development of methods in concurrency verification and testing, and machine learning systems design.
In the news | Microsoft Signal
As AI transforms how we work, live and learn, higher education is more than another player — it needs to lead the way. Higher education must strike a balance as it prepares the next generation for a world being reshaped…
| Reuben Tan, Baolin Peng, Zhengyuan Yang, Oier Mees, and Jianfeng Gao
Argos improves multimodal RL by evaluating whether an agent’s reasoning aligns with what it observes over time. The approach reduces visual hallucinations and produces more reliable, data-efficient agents for real-world applications.
In the news | Ragon Institute
The project represents a significant step forward in understanding immune responses at the tissue level. By generating and analyzing large datasets using artificial intelligence and machine learning, the team will develop tools to measure and predict how immune cells interact…
Imagine an AI assistant that can navigate a computer the same way humans do—clicking buttons, filling out forms, and moving between applications—all by simply interpreting what's on the screen. This vision is becoming a reality through computer use agents—AI systems…
| Xinzhi Zhang, Zeyi Chen, Humishka Hope, Hugo Barbalho, Konstantina Mellou, Marco Molinaro, Janardhan (Jana) Kulkarni, Ishai Menache, and Sirui Li
OptiMind is a small language model that converts business operation challenges, described naturally, into mathematical formulations that optimization software can solve. It reduces formulation time & errors & enables fast, privacy-preserving local use.
《AI Next》是微软亚洲研究院推出的一档利用 AI 技术制作的播客,内容聚焦 AI 前沿技术、科研趋势与社会影响。第一季主要围绕当今智能发展的核心议题,探索前沿趋势。 在《AI Next》第四期中,我们邀请到微软亚洲研究院首席科学家韦福如,从第一性原理出发,与大家探讨当前 AI 发展中最核心、具有争议的前沿问题。为何 Scaling 仍是 AI 的第一性原理,但必须走向“科学规模化”;为什么...