Yulia Tsvetkov

large language models

stress test

summarization

political communication

llm

agenda-setting

social media

language generation

framing

text generation

survey

politics

bias

information warfare

diffusion language models

42

presentations

41

number of views

2

citations

SHORT BIO

I am an assistant professor in the Paul G. Allen School of Computer Science & Engineering, at the University of Washington. I'm also an adjunct professor at the Language Technologies Institute at CMU. I work on Natural Language Processing–a subfield of computer science focusing on computational processing of human languages. I am particularly interested in hybrid solutions at the intersection of machine learning and theoretical or social linguistics, i.e., solutions that combine interesting learning/modeling methods and insights about human languages or about people speaking these languages.

Much of my research group's work focuses on NLP for social good, multilingual NLP, and language generation. This research is motivated by a unified goal: to extend the capabilities of human language technology beyond individual populations and across language boundaries, thereby enabling NLP for diverse and disadvantaged users, the users that need it most.

Previously, I was an assistant professor in the Language Technologies Institute, School of Computer Science at Carnegie Mellon University, and before that a postdoc in the Stanford NLP Group. I got my PhD from CMU.

Presentations

On the Importance of Nuanced Taxonomies for LLM-Based Understanding of Harmful Events: A Case Study on Antisemitism

Karina Halevy and 5 other authors

ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions

Chan Young Park and 6 other authors

Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

Shangbin Feng and 6 other authors

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Shangbin Feng and 8 other authors

Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia

Farhan Samir and 4 other authors

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects

Orevaoghene Ahia and 7 other authors

Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

Yizhuo Zhang and 6 other authors

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

Shangbin Feng and 5 other authors

Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Yichen Wang and 7 other authors

Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language Models

Wenxuan Ding and 6 other authors

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Herun Wan and 5 other authors

What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Shangbin Feng and 5 other authors

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

Roy Xie and 3 other authors

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

Abe Bohan Hou and 9 other authors

BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer

Akari Asai and 8 other authors

P^3Sum: Preserving Author’s Perspective in News Summarization with Diffusion Language Models

Yuhan Liu and 6 other authors