
Ryan Cotterell
Assistant Professor @ ETH Zurich
language models
information theory
probing
rnn
transformer
language modeling
llms
concepts
preference optimization
interpretability
language model
recurrent neural networks
bias
uniform information density
qa
42
presentations
29
number of views
1
citations
SHORT BIO
Ryan is an assistant professor of computer science at ETH Zürich, where he has been since 2020. Previous to that, he spent a year as a lecturer at the University of Cambridge. He defended his PhD in 2019 at Johns Hopkins University where his advisor was Jason Eisner. Ryan likes probability, word formation, and hierarchical structure inter alia.
Presentations

Measuring Susceptibility to Irrelevant Context in Language Models
Tianyu Liu and 3 other authors

Activation Scaling for Attribution and Intervention in Language Models
Niklas Stoehr and 5 other authors

Can Transformer Language Models Learn $n$-gram Language Models?
Anej Svete and 4 other authors

Reverse-Engineering the Reader
Samuel Kiegeland and 4 other authors

On the Proper Treatment of Tokenization in Psycholinguistics
Mario Giulianelli and 5 other authors

Surprisal Curves of Discourse
Eleftheria Tsipidi and 5 other authors

Generalized Measures of Anticipation and Responsivity in Online Language Processing
Mario Giulianelli and 2 other authors

Direct Preference Optimization with an Offset
Afra Amini and 2 other authors

On Efficiently Representing Regular Languages as RNNs
Anej Svete and 2 other authors

On the Representational Capacity of Neural Language Models with Chain-of-Thought Reasoning
Franz Nowak and 3 other authors

Context versus Prior Knowledge in Language Models
Kevin Du and 5 other authors

A Transformer with Stack Attention
Jiaoda Li and 3 other authors

The Role of n-gram Smoothing in the Age of Neural Networks
Luca Malagutti and 5 other authors

Transformers Can Represent n-gram Language Models
Anej Svete and 1 other author

On the Relationship Between Non-deterministic FSLMs and RNN LMs
Anej Svete and 3 other authors

Efficient Algorithms for Recognizing Weighted Tree-Adjoining Languages
Alexandra Butoi and 3 other authors