
8
presentations
18
number of views
SHORT BIO
Paul’s research focuses on natural language processing (NLP) methods for detecting online hate speech. He has worked on new techniques for evaluating hate speech detection models, on accounting for language change in NLP models and on managing data annotation for subjective NLP tasks. His wider research interests are in incorporating context (e.g. social, temporal) as well as subjective perspectives into language modelling.
Presentations

Beyond Flesch-Kincaid: Prompt-based Metrics Improve Difficulty Classification of Educational Texts
Donya Rooein and 3 other authors

The Past, Present and Better Future of Feedback Learning in Large Language Models for Subjective Human Preferences and Values | VIDEO
Hannah Rose Kirk and 4 other authors

Data-Efficient Strategies for Expanding Hate Speech Detection into Under-Resourced Languages
Paul Röttger and 3 other authors

Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models
Paul Röttger

Two Contrasting Data Annotation Paradigms for Subjective NLP Tasks
Paul Röttger and 3 other authors

Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate
Hannah Rose Kirk and 4 other authors

Temporal Adaptation of BERT and Performance on Downstream Document Classification: Insights from Social Media
Paul Röttger and 1 other author

HateCheck: Functional Tests for Hate Speech Detection Models
Paul Röttger and 5 other authors