Agent Lightning
The absolute trainer to light up AI agents. We present Agent Lightning, a flexible and extensible framework that enables seamless agent optimization for any existing agent framework.
Discover an index of datasets, SDKs, APIs and open-source tools developed by Microsoft researchers and shared with the global academic community below. These experimental technologies—available through Azure AI Foundry Labs (opens in new tab)—offer a glimpse into the future of AI innovation.
The absolute trainer to light up AI agents. We present Agent Lightning, a flexible and extensible framework that enables seamless agent optimization for any existing agent framework.
AI Behavioral Science for Anthropomorphic Agents: toward human‑centric, symbiotic, and autonomous AI. This project centers on evaluating and promoting anthropomorphic intelligence—AI agents with a human-like mindset and a degree of awareness, capable of acting proactively…
Source code for the paper: Echoes in AI: Quantifying Lack of Plot Diversity in LLM Outputs
ChatBench Interactive Benchmark Simulator enables automated, realistic evaluation of AI models through simulated user-AI conversations. We release fine-tuned user simulators (model weights) and supporting infrastructure, allowing the community to assess models in interactive, user-in-the-loop scenarios.
This repository contains the code and data for SimulatorArena, a framework that enables: (1) benchmarking AI assistants through multi-turn conversations with user simulators, and (2) evaluating the reliability of user simulators as proxies for human…
This repository contains all scripts for re-producing the results of our paper “Lost in Transmission: When and Why LLMs Fail to Reason Globally”.