
February Seminar
Beyond Swahili: Designing Inclusive AI for Bantu Languages
Join us on Wednesday February 18 at 3pm EAT for a talk by Alfred Malengo Kondoro
Swahili has become one of the most consistently represented African languages in modern AI benchmarks, spanning machine translation, language modeling, and multilingual evaluation suites, far exceeding the coverage of any other Bantu language. This prominence reflects its scale, standardization, and regional reach, but it also exposes the structural challenges of building AI for Bantu languages, including rich morphology, pervasive code-switching, and highly uneven data availability.
In this talk, Alfred will outline how these factors have shaped Swahili’s development within contemporary AI systems, showing why direct transfer from dominant global languages often fails to capture Bantu linguistic structure. Drawing on work in benchmarking, dataset creation, and cross‑lingual modelling, he will illustrate how Swahili provides a technically viable bridge for Bantu languages in machine translation, representation learning, and multilingual evaluation—an approach less tractable through non‑Bantu pivot languages. The talk shall conclude with a discussion on how Swahili can be used responsibly as a bridge rather than a proxy. This would enable scalable cross-language transfer while avoiding the erasure of linguistic diversity across the Bantu language family.
Past seminars
-
Anaximander: Interactive Orchestration and Evaluation of Geospatial Foundation Models
December 2025 Seminar
-
Building Better Language Models Through Global Understanding
July 2025 Seminar
-
AI for Africa's Future: Innovation, Equity, and Impact
April 2025 Grand Seminar
-
A Fever Dream of Machine Learning Framework Composability
December 2024 Seminar
-
Making Sentence Embeddings Robust to User-Generated Content
May 2024 Seminar
-
AI For All: Embracing Equity for All
April 2024 Seminar
-
Wildlife Conflict Resolution: Boma & Cattle Detection in the Masai Mara using AI
September 2023 Seminar
-
MEGA: Multi-lingual Evaluation of Generative AI
June 2023 Seminar
-
Large Language Models and Low Resource Languages
April 2023 Seminar
-
Behind the label: Glimpses of data labelling labours for AI
February 2023 Seminar
-
Fighting the Global Social Media Infodemic: from Fake News to Harmful Content
January 2023 Seminar
-
Detecting and mitigating bias in voice activated technologies
May 2022 Seminar
-
Overview of AI Research at the VUB Artificial Intelligence Lab
March 2022 Seminar
-
Social Media and Elections in Africa: A blessing or a curse?
February 2022 Seminar