Evaluating the Cultural Relevance of AI Models and Products: Learnings on Maternal Health ASR, Data Augmentation and User Testing Methods
- Oche Ankeli, YUX; Ertony Bashil, YUX; Dhananjay Balakrishnan, YUX; Serigne Fall, YUX; Oluchi Audu, YUX; Melissah Weya, YUX; Yann Le Beux, YUX
How do we ensure that AI systems are not only technologically advanced but also deeply resonant and beneficial for the specific communities they aim to serve? As AI becomes more integrated into critical sectors like healthcare, this question has never been more important.
This session by YUX explores that question through key learnings on evaluating the cultural relevance of AI models and products. Drawing from their work fine-tuning an ASR model in Wolof for maternal health, YUX shares insights on social norms, evaluation datasets, and user testing methods for assessing AI systems. They also introduce LOOKA, a platform designed to scale human evaluation in AI development.
-
-
Oche Ankeli
AI Engineer
YUX
-
Ertony Bashil
AI Engineer
YUX
-
Dhananjay Balakrishnan
Researcher
YUX
-
Serigne Fall
Lead Looka
YUX
-
Oluchi Audu
Senior Design Researcher
YUX
-
Melissah Weya
Design Researcher
YUX
-
Yann Le Beux
Co-founder & AI Lead
YUX
-
-
-
接下来观看
-
-
-
-
-
-
-
-
-
MEGA: Multi-lingual Evaluation of Generative AI
- Kabir Ahuja,
- Millicent Ochieng
-
Behind the label: Glimpses of data labelling labours for AI
- Srravya Chandhiramowuli