Orca-AgentInstruct: Agentic flows can be effective synthetic-data generators
Orca-AgentInstruct, from Microsoft Research, can generate diverse, high-quality synthetic data at scale to post-train and fine-tune base LLMs for expanded capabilities, continual learning, and increased performance.
Abstracts: November 14, 2024
The efficient simulation of molecules has the potential to change how the world understands biological systems and designs new drugs and biomaterials. Tong Wang discusses AI2BMD, an AI-based system designed to simulate large biomolecules with…
STAC: Sociotechnical Alignment Center
STAC is a team of researchers, applied scientists, and linguists in Microsoft Research NYC who represent part of the research pillar of Microsoft’s Responsible AI (RAI) investments. Founded within Microsoft Research by Hanna Wallach in…
KBLaM: Knowledge Base augmented Language Model
KBLaM is a new method for augmenting LLMs with external knowledge. Unlike Retrieval-Augmented Generation, KBLAM eliminates external retrieval modules, and unlike in-context learning, its computational overhead scales linearly with KB size rather than quadratically.
Toward modular models: Collaborative AI development enables model accountability and continuous learning
Modular models can democratize AI development while unlocking new benefits and use cases. Modularized AI can be more flexible, more compliant, and cheaper to develop—requiring less data and fewer compute resources to train expert models.
Research Focus: Week of November 11, 2024
Holistic motion-capture calibration technique without calibration, manual intervention or custom hardware; Research on AI agents for autonomous clouds; Automating proof-oriented program construction; One-to-many testing for natural language code generation.
Hearable devices with sound bubbles
Magentic-One
Magentic-One is a generalist multi-agent system created to address intricate web and file-based tasks. By utilizing an intelligent Orchestrator alongside specialized agents, it facilitates the automation of complex, multi-step activities across various environments.