工具
Fara-7B
2025年11月
Microsoft’s first agentic small language model specifically designed for computer use. With only 7 billion parameters, Fara-7B achieves state-of-the-art performance within its size class and is competitive with larger, more resource-intensive agentic systems that depend on prompting multiple large models.
Magentic Marketplace
2025年11月
Magentic Marketplace is an open-source simulation environment for exploring the numerous possibilities of agentic markets and their societal implications at scale. It provides a foundation for studying these markets and guiding them toward outcomes that benefit everyone.
MCP Interviewer
2025年10月
MCP Interviewer is a Python CLI tool that helps you catch MCP server issues before your agents do. The MCP Interviewer ensures compliance with provider constraints and offers warnings when recommended guidance is not followed, while also supporting optional functional testing…
Dion: Distributed Orthonormal Updates
2025年9月
Dion is a scalable optimizer that accelerates neural network training by applying orthonormal weight updates using amortized power iteration, which works efficiently on sharded matrices. It reduces communication overhead through low-rank compression and error feedback, offering faster convergence compared to…
Phi-4
2025年6月
Phi-4-multimodal and Phi-4-mini, the newest models in Microsoft’s Phi family of small language models (SLMs) are now available. These models are designed to empower developers with advanced AI capabilities. Phi-4-multimodal, with its ability to process speech, vision, and text simultaneously,…
Magentic-UI
2025年4月
Magentic-UI is a research prototype of an agentic web interface for solving complex web tasks. An Orchestrator coordinates four AutoGen agents—WebSurfer, Coder, FileSurfer, and UserProxy—to handle browsing, coding, file management, and user feedback, etc. It is designed with user-agent collaboration…
Steering LLMs for better instruction following
2025年3月
This repository contains the code for the paper “Improving Instruction-Following in Language Models through Activation Steering,” presented at ICLR 2025.
Skill Slice Insights
2025年1月
This is the official code repository for the paper “Unearthing Skill-level Insights for Understanding Tradeoffs of Foundation Models”. All rationales, localized skills, and skill-slices for the 12 datasets studied in the paper can also be accessed through this repo.
Magentic-One
2024年11月
Magentic-One is a generalist multi-agent system created to address intricate web and file-based tasks. By utilizing an intelligent Orchestrator alongside specialized agents, it facilitates the automation of complex, multi-step activities across various environments.