Microsoft Research Blog

English

  1. m3: Accurate Flow-Level Performance Estimation using Machine Learning 

    August 1, 2024

    Data center network operators often need accurate estimates of aggregate network performance, such as the frequency of poor tail latency events, to guide network configuration -- when and where to add capacity as a function of increased load, which network congestion control algorithm to use…

  2. ECBD: Evidence-Centered Benchmark Design for NLP 

    August 1, 2024

    Benchmarking is seen as critical to assessing progress in NLP. However, creating a benchmark involves many design decisions (e.g., which datasets to include, which metrics to use) that often rely on tacit, untested assumptions about what the benchmark is intended to measure or is actually…

  3. Everything of Thoughts: Defying the Law of Penrose Triangle for Thought Generation 

    August 1, 2024

    Recent advancements in Large Language Models (LLMs) have revolutionized decision-making by breaking down complex problems into more manageable language sequences referred to as ā€thoughtsā€. An effective thought design should consider three key perspectives: performance, efficiency, and flexibility. However, existing thought can at most exhibit two…

  4. Let’s Fix this Together: Conversational Debugging with GitHub Copilot 

    August 1, 2024

    Despite advancements in IDE tooling, code understanding, generation, and automated repair, debugging continues to present significant challenges. Existing debugging strategies available to developers in literature are often too mechanical and rigid for day-to-day issues. Recent advances in Large Language Models (LLMs) promise practical solutions that…

  5. AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation 

    August 1, 2024

    We present AutoGen, an open-source framework that allows developers to build LLM applications by composing multiple agents to converse with each other to accomplish tasks. AutoGen agents are customizable, conversable, and can operate in various modes that employ combinations of LLMs, human inputs, and tools.…

  6. Efficient Policy-Rich Rate Enforcement with Phantom Queues 

    August 1, 2024

    Rate enforcement is routinely employed in modern networks (e.g. ISPs rate-limiting users traffic to the subscribed rates). In addition to correctly enforcing the desired rates, rate-limiting mechanisms must be able to support rich rate-sharing policies within each traffic aggregate (e.g. per-flow fairness, weighted fairness, and…