Project Gecko

Nouvelles et reportages

Three white line icons on a blue‑to‑purple gradient background: a vertical audio waveform on the left, a globe showing Africa and Europe in the center, and a network on the right.

Blog de recherche Microsoft

Paza: Introducing automatic speech recognition benchmarks and models for low resource languages

février 4, 2026 | Mercy Muchai, Kevin Chege, Nick Mumero, et Stephanie Nyairo

Microsoft Research unveils Paza, a human-centered speech pipeline, and PazaBench, the first leaderboard for low-resource languages. It covers 39 African languages and 52 models and is tested with communities in real settings.

Three white icons on a blue-to-purple gradient background: the first icon shows an image/photo; the second icon depicts a computer monitor with vertical bars; the third icon displays three connected circles with user silhouettes.

Blog de recherche Microsoft

MMCTAgent: Enabling multimodal reasoning over large video and image collections

novembre 12, 2025 | Akshay Nambi, Kavyansh Chourasia, et Tanuja Ganu

MMCTAgent enables dynamic multimodal reasoning with iterative planning and reflection. Built on Microsoft’s AutoGen framework, it integrates language, vision, and temporal understanding for complex tasks like long video and image analysis.

a field of green plants with two people walking in between the rows

Dans l’actualité | Microsoft Research Story

Advancing AI to meet needs of the global majority

November 12, 2025

AI tools can perform poorly in non-Western languages and lack critical cultural context for many populations. Project Gecko uses small language models to bring vital expertise to farmers in underserved areas using local languages and multi-modal content.