
Kathleen Siminyu
Fellow @ Mozilla Foundation
corpus development
language dialect
language variants
historical change
1
presentations
SHORT BIO
Kathleen Siminyu is an AI Researcher who has focused on Natural Language Processing for African Languages. She works at Mozilla Foundation as a Machine Learning Fellow to support the development of a Kiswahili Common Voice dataset and to build speech recognition models for end use cases in the agricultural and financial domains. In her NLP research, Kathleen has previously worked on speech transcription for Luhya languages and contributed to machine translation for Kenyan languages as part of Masakhane. Before joining Mozilla, Kathleen was Regional Coordinator of AI4D Africa, where she worked with ML and AI communities in Africa to run various programs. She has vast experience as a community organiser having co-organised the Nairobi Women in Machine Learning and Data Science community for three years and continues to organise as part of the committees of the Deep Learning Indaba and the Masakhane Research Foundation.
Presentations

Corpus Development of Kiswahili Speech Recognition Test and Evaluation sets, Preemptively Mitigating Demographic Bias Through Collaboration with Linguists
Kathleen Siminyu