Event
Computer Vision in the Wild Workshop at CVPR 2026
Full workshop title: The 5th Workshop on Computer Vision in the Wild (CVinW): Towards Unified Multimodal Agents for Reasoning in the Wild Host conference: The Conference on Computer Vision and Pattern Recognition (CVPR) (opens in…
Video
GeoMind: A Multi-Agent Framework for Geospatial Decision Support
Rapid access to actionable geospatial insights is essential during disasters such as floods, wildfires, or earthquakes, where timely decisions can save lives and resources. In many scenarios, especially in low-resource settings or when GIS experts…
Microsoft Research Blog
UniRG: Scaling medical imaging report generation with multimodal reinforcement learning
AI can help generate medical image reports, but today’s models struggle with varying reporting schemes. Learn how UniRG uses reinforcement learning to boost performance of medical vision-language models.