
Jesse Dodge
Senior Research Scientist @ Allen Institute for AI
evaluation
benchmarking
pretraining
multilinguality
reproducibility
hallucination
computational sociolinguistics
large language model
data quality
language id
grammaticality
intrinsic subspace
multilingualism and cross-lingual nlp
evaluation methodologies
machine translation
7
presentations
4
number of views
SHORT BIO
Jesse Dodge is a research scientist at the Allen Institute for AI. His research focuses on reproducibility and efficiency. He earned his PhD from the LTI at CMU in spring of 2020.
Presentations

Scalable Data Ablation Approximations for Language Models through Modular Training and Merging
Clara Na and 6 other authors

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
Jacob Morrison and 5 other authors

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters
Lucy Li and 6 other authors

Language Models Hallucinate, but May Excel at Fact Verification
Jian Guan and 4 other authors

Expected Validation Performance and Estimation of a Random Variable's Maximum
Jesse Dodge and 4 other authors

Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus
Jesse Dodge and 7 other authors

Expected Validation Performance and Estimation of a Random Variable's Maximum
Jesse Dodge and 4 other authors