Kahani
“Kahani” is a research prototype that allows the user to create stories with visually striking and culturally nuanced images (visuals/scenes) just by describing them in their local languages. This web-based tool will also allow users…
Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
“Kahani” is a research prototype that allows the user to create stories with visually striking and culturally nuanced images (visuals/scenes) just by describing them in their local languages. This web-based tool will also allow users…
This repository contains code to replicate the experiments in “Do Language Models Know When They’re Hallucinating References?”
BridgeData V2-compatible set of robotic manipulation trajectories collected at Microsoft Research. This download contains a set of object manipulation trajectories collected at Microsoft Research on a WidowX-250 robot in a setup and format compatible with…
Rats is a collection of tools to help researchers define and run experiments. It is designed to be a modular and extensible framework currently supporting building and running pipelines, integrating configs and services.
Official code and data repository of “BatteryML: An Open-Source Tool for Machine Learning on Battery Degradation” (ICLR 2024). Please star, watch, and fork BatteryML for the active updates! We appreciate any questions and suggestions! Our…
MOFDiff is a diffusion model for generating coarse-grained MOF structures. This codebase also contains the code for deconstructing/reconstructing the all-atom MOF structures to train MOFDiff and assemble CG structures generated by MOFDiff.
The AI Controller Interface is a system design and implementation that enables customer user code (AI Controllers, implemented as light-weight virtual machines) to tightly, efficiently, and securely integrate with LLM decoding in a cloud service.…
UDOP adopts an encoder-decoder Transformer architecture based on T5 for document AI tasks like document image classification, document parsing and document visual question answering. You can use the model for document image classification, document parsing…
Node Engine is a Python service that executes a computational flow. It is designed for rapid prototyping of services and applications, e.g. used as a chatbot service in a larger system. Each call to the service…