Microsoft Research Blog
Frontiers of multimodal learning: A responsible AI approach
New evaluation methods and a commitment to continual improvement are musts if we’re to build multimodal AI systems that advance human goals. Learn about cutting-edge research into the responsible development and use of multimodal AI…
Publication
Kosmos-2.5: A Multimodal Literate Model
Project
DragNUWA
DragNUWA is a video generation model that utilizes text, images, and trajectory as three essential control factors to facilitate highly controllable video generation. DragNUWA is a video generation model that utilizes text, images, and trajectory…