profile picture

Robin Jia

Assistant Professor @ University of Southern California

large language models

benchmarking

nli

in-context learning

evaluation

question answering

long-tail

generalization

few-shot learning

semantic parsing

continual learning

zero-shot learning

pretraining

neuro-symbolic

qa

18

presentations

13

number of views

SHORT BIO

I am an assistant professor in the Department of Computer Science at the University of Southern California. I am interested broadly in natural language processing and machine learning, with a particular focus on building NLP systems that are robust to distribution shift at test time.

Presentations

When Parts are Greater Than Sums: Individual LLM Components Can Outperform Full Models

Ting-Yun Chang and 2 other authors

Proving membership in LLM pretraining data via data watermarks

Johnny Tian-Zheng Wei and 2 other authors

How Predictable Are Large Language Model Capabilities? A Case Study on BIG-bench

Qinyuan Ye and 3 other authors

Estimating Large Language Model Capabilities without Labeled Test Data

Harvey Fu and 4 other authors

Data Curation Alone Can Stabilize In-context Learning

Ting-Yun Chang and 1 other author

Contrastive Novelty-Augmented Learning: Anticipating Outliers with Large Language Models

Albert Xu and 2 other authors

Benchmarking Long-tail Generalization with Likelihood Splits

Ameya Godbole and 1 other author

Benchmarking Long-tail Generalization with Likelihood Splits

Ameya Godbole and 1 other author

Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants

Max Bartolo and 5 other authors

On the Robustness of Reading Comprehension Models to Entity Renaming

Jun Yan and 5 other authors

Analyzing Dynamic Adversarial Training Data in the Limit

Eric Wallace and 3 other authors

On Continual Model Refinement in Out-of-Distribution Data Streams

Bill Yuchen Lin and 6 other authors

Question Answering Infused Pre-training of General-Purpose Contextualized Representations

Robin Jia and 2 other authors

The statistical advantage of automatic NLG metrics at the system level

Johnny Tian-Zheng Wei and 1 other author

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little

Koustuv Sinha and 5 other authors

Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation

Max Bartolo and 5 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved