
Leon Weber
dataset
active learning
evaluation
biomedical
language modeling
trust
human-in-the-loop
trustworthiness
training dynamics
annotation error detection
data-centric
llms
instruction tuning
noise detection
annoation error detection
6
presentations
4
number of views
SHORT BIO
Leon Weber is a post-doctoral researcher at MaiNLP, where he works with Barbara Plank on human-centric Natural Language Processing (NLP). Currently, he focusses on how human-in-the-loop approaches and data-centric machine learning can be used to enhance the performance of NLP models. He is also interested in the application of NLP to the biomedical domain.
Prior to his role at MaiNLP, Leon pursued his PhD at Humboldt University of Berlin and MDC Berlin, where he worked on biomedical NLP under the guidance of Ulf Leser and Jana Wolf.
Before that Leon studied in Berlin, where he earned his Bachelor's and Master's degrees in Computer Science from Humboldt University and Free University. In addition to his Computer Science degrees, he also holds a Bachelor of Arts in Philosophy from Bamberg University.
Presentations

Donkii: Characterizing and Detecting Errors in Instruction-Tuning Datasets
Leon Weber and 3 other authors

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Robert Litschko and 4 other authors

ActiveAED: A Human in the Loop Improves Annotation Error Detection
Leon Weber and 1 other author

Dataset Debt in Biomedical Language Modeling
Jason Fries and 11 other authors

Extend, donÕt rebuild: Phrasing conditional graph modification as autoregressive sequence labelling
Leon Weber and 3 other authors

Extend, don’t rebuild: Phrasing conditional graph modification as autoregressive sequence labelling
Leon Weber and 3 other authors