
Barbara Plank
evaluation
human label variation
ner
computational job market analysis
llms
nlp4hr
uncertainty
language modeling
disagreement
language models
representation learning
relation classification
dependency parsing
cross-lingual
pragmatics
25
presentations
22
number of views
1
citations
SHORT BIO
Full Prof, Chair for AI and Computational Linguistics
Presentations

To Know or Not To Know? Analyzing Self-Consistency of Large Language Models under Ambiguity
Anastasiia Sedova and 4 other authors

Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models
Philipp Mondorf and 1 other author

The Potential and Challenges of Evaluating Attitudes, Opinions, and Values in Large Language Models
Bolei Ma and 6 other authors

Seeing the Big through the Small: Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?
Beiduo Chen and 5 other authors

"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models
Xinpeng Wang and 7 other authors

What Do Dialect Speakers Want? A Survey of Attitudes Towards Language Technology for German Dialects
Verena Blaschke and 3 other authors

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning
Philipp Mondorf and 1 other author

What's wrong with your model? A Quantitative Analysis of Relation Classification
Barbara Plank and 2 other authors

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark
Stephen Mayhew and 12 other authors

Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties
Ekaterina Artemova and 2 other authors

Entity Linking in the Job Market Domain
Mike Zhang and 2 other authors

NNOSE: Nearest Neighbor Occupational Skill Extraction
Mike Zhang and 3 other authors

Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?
Joris Baan and 3 other authors

What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability
Mario Giulianelli and 4 other authors

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training
Max Müller-Eberstein and 3 other authors

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation
Robert Litschko and 4 other authors