1
presentations
1
number of views
SHORT BIO
I am a postdoctoral researcher in Dirk Hovy‘s MilaNLP Lab. My work is located at the intersection of computation, language and society. Right now, I am particularly interested in evaluating and aligning social values in large generative language models.
In May 2023, I completed my PhD at the University of Oxford, where I was supervised by Janet Pierrehumbert and Helen Margetts. In my PhD, I worked on improving the evaluation and effectiveness of natural language processing models for hate speech detection. I also worked on general language modelling challenges like language change and annotator subjectivity. The HateCheck project that I led won the Stanford AI Audit Challenge.