
Robin Jia
Assistant Professor @ University of Southern California
large language models
benchmarking
nli
in-context learning
evaluation
question answering
long-tail
generalization
few-shot learning
semantic parsing
continual learning
zero-shot learning
pretraining
neuro-symbolic
qa
18
presentations
13
number of views
SHORT BIO
I am an assistant professor in the Department of Computer Science at the University of Southern California. I am interested broadly in natural language processing and machine learning, with a particular focus on building NLP systems that are robust to distribution shift at test time.
Presentations

When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models
Ting-Yun Chang and 2 other authors

Proving membership in LLM pretraining data via data watermarks
Johnny Tian-Zheng Wei and 2 other authors

How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench
Qinyuan Ye and 3 other authors

Estimating Large Language Model Capabilities without Labeled Test Data
Harvey Fu and 4 other authors

Data Curation Alone Can Stabilize In-context Learning
Ting-Yun Chang and 1 other author

Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models
Albert Xu and 2 other authors

Benchmarking Long-tail Generalization with Likelihood Splits
Ameya Godbole and 1 other author

Benchmarking Long-tail Generalization with Likelihood Splits
Ameya Godbole and 1 other author

Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
Max Bartolo and 5 other authors

On the Robustness of Reading Comprehension Models to Entity Renaming
Jun Yan and 5 other authors

Analyzing Dynamic Adversarial Training Data in the Limit
Eric Wallace and 3 other authors

On Continual Model Refinement in Out-of-Distribution Data Streams
Bill Yuchen Lin and 6 other authors

Question Answering Infused Pre-training of General-Purpose Contextualized Representations
Robin Jia and 2 other authors

The statistical advantage of automatic NLG metrics at the system level
Johnny Tian-Zheng Wei and 1 other author

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Koustuv Sinha and 5 other authors

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
Max Bartolo and 5 other authors