My name is Bonaventure Dossou,
I've a Bachelor of Science in Mathematics, from Kazan Federal University, Russia, and a Master of Science in Data Engineering from Jacobs University Bremen, Germany. I am an incoming Ph.D. at McGill University, Canada at the Research Center of Intelligent Machines. I will be specifically working in the Probabilistic Vision Group with Prof. Tal Arbel

My interests in Deep Learning for Computer Vision, and NLP with focus on low-resource languages and healthcare.

I am working on Drug Discovery projects using Deep Learning (and GFlowNets), at Mila Quebec AI Institute under the supervisions of Yoshua Bengio and Dianbo Lui. Previously, I was also a NLP Data Scientist Intern at Roche Canada, working on Health/Pharma-related challenges.

Alternatively, I am working on NLP language technologies, with a focus on low-resourced Sub-Saharan languages at Masakhane (and previously at Google Research).

I am the co-creator of the FFRTranslate, and Okwugbe ASR (Automatic Speech Recognition for low-resourced languages) Python library

Read out my inspirational personal story and how I got into research (includes also a short list of all the scientific talks I have gave), and here is my most recent CV.

Work and Research Experiences

1. NLP & AI Graduate Student Researcher, Google Research
2. Visiting Student Researcher in Deep Learning for Drug Discovery, MILA Quebec AI Institute
3. NLP Data Scientist, Roche Canada
4. Scientist in Residence - Deep Learning for Chemical Compound Discovery, Modelis
5. Senior Machine Learning Engineer, Omdena
6. African NLP Researcher & Core Member, Masakhane
7. Part-time Senior Data Scientist, Speeqo

Research and Scientific Publications

All publications can be accessed through my Semantic Scholar and Google Scholar pages. Here is a short list:
1. GFlowOut: Dropout with Generative Flow Networks (under review)
2. MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition (EMNLP 2022)
3. GraphCC for Diverse and Novel Antimicrobial Peptides Generation and Selection, (preprint, under review)
4. Self-Active Learning for Multilingual Language Models: Case Study of 23 African Languages (EMNLP 2022)
5. A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation, NAACL 2022
6. Biological Sequence Design with GFlowNets (ICML 2022)
7. MeSH2Matrix: Machine learning-driven biomedical relation classification based on the MeSH keywords of PubMed scholarly publications - BIR, ECIR 2022
8. MMTAfrica: Multilingual Machine Translation for African Languages - WMT, EMNLP 2021
9. FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition, Affective Behavior Analysis In-the-Wild (ABAW) - ICCV 2021
10. OkwuGbé: End-to-End Speech Recognition for Fon and Igbo, WideningNLP - EMNLP 2021 & AfricanNLP - EACL 2021
11. Crowdsourced Phrase-Based Tokenization for Low-Resourced Neural Machine Translation: The Case of Fon Language, AfricaNLP - EACL 2021
12. MasakhaNER: Named Entity Recognition for African Languages, TACL 2021
13. Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets, AfricaNLP - EACL 2021
14. AfriVEC: Word Embedding Models for African Languages. Case Study of Fon and Nobiin, AfricaNLP - EACL 2021
15. An Approach to Intelligent Pneumonia Detection and Integration
16. Participatory Research for Low-resourced Machine Translation: A Case Study in African Languages, Findings of EMNLP 2021
17. Lanfrica: A Participatory Approach to Documenting Machine Translation Research on African Languages
18. Masakhane -- Machine Translation For Africa, AfricaNLP - ICLR 2021
19. FFR v1.1: Fon-French Neural Machine Translation, WideningNLP - ACL 2021
20. FFR v1.0: Fon-French Neural Machine Translation, AfricaNLP - ICLR 2021

Awards, Honours and Grants

1. Mila Quebec AI Institute's 2021-2022 Impact Annual Report
2. McGill Engineering Doctoral Award (MEDA)
3. Innovation Award 2022 of the German African Diaspora
4. Dean's Prize for outstanding Master's Thesis
5. Shuttleworth Flash Grant
6. Winner of the ViVaTech-Unesco Challenge for Cracking Language Barriers through Data and AI
7. Wikimedia Foundation Research of the Year Award 2021 with Masakhane Community
8. Grant "Lacuna Fund" for Named Entity Recognition for Fon with Masakhane Community
9. Jacobs University Community Award 2021 for Innovation, Cultural Understanding, and Diversity
10. Jacobs University Hall of Fame
11. Jacobs University Mobility Area’s Scholarship & Jacobs University Faces
12. Academic and Scientific paper reviewer at AfricanNLP workshop, EACL 2021
13. Global Nominee and Benin’s finalist with «Afro Num» - NASA's 2020 World Space Apps Challenge
14. Winner of the National Russian AI Hackathon 2019
15. International interviews and articles on BBC, Voice of America, German, Russian newspapers, and TVs
16. Scientific presentations and publications, Workshops organizations, and Reviewing Services at ACL, EACL, NAACL, AACL, EMNLP, ICML, ICLR, NeuRIPs (2020, 2021, 2022)