profile picture

Bonaventure F. P. Dossou

PhD Student @ McGill University, Mila Quebec AI Institute

active learning

low-resource

transfer learning

bert

crf

pos

part-of-speech

data quality

audit

multilingual data

web-mined data

language id

africanlp

afrolm

5

presentations

12

number of views

SHORT BIO

I am a Computer Science Ph.D. student at McGill University & Mila, in the NLP group specializing in Natural Language Processing (NLP) applied to low-resource languages (specifically African) and Healthcare. I hold a Bachelor of Science with honors in Mathematics, from Kazan Federal University, Russia, and a Master of Science with honors in Computer Science and Data Engineering from Jacobs University Bremen, Germany.

My interests are in Natural Language Processing (Machine Translation, Large Language Modeling, Speech Recognition, Information Retrieval) for low-resourced languages and Machine Learning for Healthcare (Drug Discovery, small molecule generations, gene therapy). I am the creator of many Afro-centric NLP systems like the FFRTranslate, AfroLM, and Okwugbe ASR Python library.

Before my PhD, I was a research intern at the Mila Quebec AI Institute, working on Drug Discovery projects using Deep Learning (and Generative Flow Networks aka GFlowNets) under the supervision of Yoshua Bengio and Dianbo Lui. I was also an NLP Researcher at Google Research, an NLP Data Scientist at Roche Canada, and a Research Scientist at ModelisLabs, working on Health/Pharma-related challenges.

Presentations

MasakhaPOS: Part-of-Speech Tagging for Typologically Diverse African languages

David Ifeoluwa Adelani and 15 other authors

AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages

Bonaventure F. P. Dossou

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation

David Ifeoluwa Adelani and 35 other authors

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Julia Kreutzer and 51 other authors

MMTAfrica: Multilingual Machine Translation for African Languages

Chris Chinenye Emezue and 1 other author

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved