News & features
MedFuzz: Exploring the robustness of LLMs on medical challenge problems
| Robert Osazuwa Ness
Medfuzz tests LLMs by breaking benchmark assumptions, exposing vulnerabilities to bolster real-world accuracy.
This talk introduces Phi-3-Vision, an advanced and economical open-source multimodal model. As a member of the Phi-3 model family, Phi-3-Vision enhances language models by integrating multisensory skills, seamlessly combining language and vision capabilities.
Research Focus: Week of August 26, 2024
Learn what’s next for AI at Research Forum on Sept. 3; WizardArena simulates human-annotated chatbot games; MInference speeds pre-filling for long-context LLMs via dynamic sparse attention; Reef: Fast succinct non-interactive zero-knowledge regex proofs.
What’s Your Story: Lex Story
| Johannes Gehrke and Lex Story
Model maker and fabricator Lex Story helps bring research to life through prototyping. He discusses his take on failure; the encouragement and advice that has supported his pursuit of art and science; and the sabbatical that might inspire his next…
Research Focus: Week of July 29, 2024
In this issue: Skeleton Posterior-guided OpTimization (SPOT) exhibits potential in various causal discovery tasks; Using visual imagery for an EEG-based brain–computer interface; Developing human-centered AI systems to assist creative professionals.
In the news | The Stack
‘Enormous business potential’: Microsoft on why GraphRAG outperforms naive RAG
Redmond opens up to discuss its new tool, which can extract data from unstructured text using large language models. Microsoft’s GraphRAG is a new approach to Retrieval-Augmented Generation (RAG) that Redmond has described a “significant advance in enhancing the capability…
Tracing the path to self-adapting AI agents
| Ching-An Cheng, Adith Swaminathan, and Allen Nie
Introducing Trace, Microsoft and Stanford University’s novel AI optimization framework, now available as a Python library. Trace adapts dynamically and optimizes a wide range of applications from language models to robot control.
Awards | MistyWest
Tusher Chakraborty named a Misties Top 20 Winner
Tusher Chakraborty was recognized for the contributions to enabling data-driven farming, influencing FCC to adopt regulations on IoT in TV White Spaces, and pioneering research in satellite-based IoT communications. The Misties Awards are for top 20 individual leaders who are…
Abstracts: July 18, 2024
| Gretchen Huizinga and Arindam Mitra
Senior Researcher Arindam Mitra introduces AgentInstruct. Using raw data sources, the automated multi-agent framework can create diverse, high-quality synthetic data at scale for the post-training of small and large language models.