profile picture

Jesse Dodge

Senior Research Scientist @ Allen Institute for AI

evaluation

benchmarking

pretraining

multilinguality

reproducibility

hallucination

computational sociolinguistics

large language model

data quality

language id

grammaticality

intrinsic subspace

multilingualism and cross-lingual nlp

evaluation methodologies

machine translation

7

presentations

4

number of views

SHORT BIO

Jesse Dodge is a research scientist at the Allen Institute for AI. His research focuses on reproducibility and efficiency. He earned his PhD from the LTI at CMU in spring of 2020.

Presentations

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Clara Na and 6 other authors

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging

Jacob Morrison and 5 other authors

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

Lucy Li and 6 other authors

Language Models Hallucinate, but May Excel at Fact Verification

Jian Guan and 4 other authors

Expected Validation Performance and Estimation of a Random Variable's Maximum

Jesse Dodge and 4 other authors

Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus

Jesse Dodge and 7 other authors

Expected Validation Performance and Estimation of a Random Variable's Maximum

Jesse Dodge and 4 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved