Microsoft Research India – The evolution
Learn about Microsoft Research India’s journey from its inception to becoming a leading research center in computer science in India.
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning
The video introduces MindJourney, a framework that enhances Vision-Language Models (VLMs), which excel at interpreting single images but struggle to infer the underlying three-dimensional world. By allowing the VLM to “imagine” moving through the scene…
MindJourney enables AI to explore simulated 3D worlds to improve spatial interpretation
MindJourney can enable AI to navigate and interpret 3D environments from limited visual input, potentially improving performance in navigation, planning, and safety-critical tasks.
MindJourney
MindJourney is a framework that equips AI agents with a “simulation loop” to explore hypothetical 3D viewpoints before answering spatial reasoning questions—tackling a core limitation of vision-language models (VLMs), which recognize objects well in 2D…
TRELLIS
TRELLIS is a large 3D asset generation model that creates high-quality 3D assets from simple text or image inputs. Using a unified latent space (SLAT), it delivers detailed, textured 3D models in formats like meshes,…