Loading...
Orca-2 blog hero | abstract waves of data
Microsoft Research Blog

Orca 2: Teaching Small Language Models How to Reason 

November 20, 2023 | Ahmed Awadallah, Andres Codas, Luciano Del Corro, Hamed Khanpour, Shweti Mahajan, Arindam Mitra, Hamid Palangi, Corby Rosset, Clarisse Simoes Ribeiro, and Guoqing Zheng

At Microsoft, we’re expanding AI capabilities by training small language models to achieve the kind of enhanced reasoning and comprehension typically found only in much larger models.

Illustrated figure of lifelong model editing with GRACE. On the left is a question and the model’s existing answer to it (which is incorrect). Editing method needs to update it the correct answer. In the middle the architecture is shown where the language model is frozen and embeddings are extracted to retrieve appropriate values (new embeddings) from the codebook. On the right the codebook is shown which includes a set of trainable embeddings.
Microsoft Research Blog

Lifelong model editing in large language models: Balancing low-cost targeted edits and catastrophic forgetting 

November 20, 2023 | Tom Hartvigsen and Hamid Palangi

Lifelong model editing fixes mistakes discovered after model deployment. This work could expand sequential editing to model properties like fairness and privacy and enable a new class of solutions for adapting LLMs over long deployment lifetimes.

Microsoft Research Podcast - Abstracts hero with a microphone icon
Microsoft Research Podcast

Abstracts: November 20, 2023 

November 20, 2023 | Gretchen Huizinga and Shrey Jain

Today I'm talking to Shrey Jain, an applied scientist at Microsoft Research, and Dr. Zoë Hitzig, a junior fellow at the Harvard Society of Fellows.

In the news | VentureBeat

Microsoft releases Orca 2, a pair of small language models that outperform larger counterparts 

November 20, 2023

Even as the world bears witness to the power struggle and mass resignation at OpenAI, Microsoft, the long-time backer of the AI major, is not slowing down its own AI efforts. Today, the research arm of the Satya Nadella-led company…

Articles

铸星闪耀 | 张扶桑:开展以人为本的研究,结识惺惺相惜的伙伴 

November 19, 2023

编者按:微软亚洲研究院“铸星计划”向全球杰出的青年学者发出邀请,提供在微软亚洲研究院进行为期三个月研究访问的机会。无论是与领域内顶尖研究员合作的机会,还是丰富的数据集和强大的支持资源,抑或是产业界独有的实际应用场景,都吸引着青年才俊们来到微软亚洲研究院探索领域内的前沿新知。 本文讲述了 2022 年度“铸星计划”访问学者、中国科学院软件研究所副研究员张扶桑的“铸星”故事。人在哪里,场景在哪里,无线...

Articles

铸星闪耀 | 郑伟龙:通过脑电波,我想看到抑郁症患者眼中的世界 

November 17, 2023

编者按:微软亚洲研究院“铸星计划”向全球杰出的青年学者发出邀请,提供在微软亚洲研究院进行为期三个月研究访问的机会。无论是与领域内顶尖研究员合作的机会,还是丰富的数据集和强大的支持资源,抑或是产业界独有的实际应用场景,都吸引着青年才俊们来到微软亚洲研究院探索领域内的前沿新知。 本文讲述了 2022 年度“铸星计划”访问学者、上海交通大学副教授郑伟龙的“铸星”故事。如果能知道抑郁症患者眼中的世界与普通...

Skeleton of Thought blog hero - flow diagram
Microsoft Research Blog

Skeleton-of-Thought: Parallel decoding speeds up and improves LLM output 

November 17, 2023 | Xuefei Ning and Zinan Lin

This research was accepted by the 2024 International Conference on Learning Representations. Large language models (LLMs) such as LLaMA and OpenAI’s GPT-4 are revolutionizing technology. However, one of the common complaints about LLMs is their speed, or lack thereof. In…

MSR Podcast "What's your story" | Desney Tan
Microsoft Research Podcast

What’s Your Story: Desney Tan 

November 16, 2023 | Johannes Gehrke and Desney Tan

From service in the Singapore Armed Forces to autonomous navigation with NASA & VR with Disney, Desney Tan’s life journey hasn’t been linear. Learn how Tan landed at Microsoft & about the purpose guiding his work in the podcast series…

In the news | Microsoft on the Issues

Accelerating Sustainability with AI: A Playbook 

November 16, 2023

Given the urgency of the planetary crisis, society needs to push harder on the AI accelerator while establishing guardrails that steer the world safely, securely, and equitably toward net-zero emissions, climate resilience, and a nature-positive future. This year the world…

  • Previous
  • 1
  • …
  • 93
  • 94
  • 95
  • 96
  • 97
  • …
  • 569
  • Next