
Willem Zuidema
interpretability
context mixing
transformers
probing
gender bias
consistency
circuits
chatgpt
causal interventions
feature-attribution
speech transformers
pcfgs
targeted fine-tuning
model interpretability for spoken language
structural priming
6
presentations
13
number of views
SHORT BIO
Associate Professor of Computational Linguistics and Cognitive Science at the University of Amsterdam. PhD (2005), University of Edinburgh. Former Marie Curie, VENI, NIAS and Language in Interaction fellow.
Presentations

Do Language Models Exhibit Human-like Structural Priming Effects?
Jaap Jumelet and 2 other authors

Homophone Disambiguation Reveals Patterns of Context Mixing in Speech Transformers
Hosein Mohebbi and 3 other authors

Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model | VIDEO
Michael Hanna and 4 other authors

Transparency at the Source: Evaluating and Interpreting Language Models With Access to the True Distribution
Jaap Jumelet and 1 other author

Quantifying Context Mixing in Transformers
Hosein Mohebbi and 3 other authors

Language, Brains & Interpretability
Willem Zuidema