
Taja Kuzman
causal commonsense reasoning; copa; south slavic dialects; natural language understanding; large language models
english varieties
english variety classifier
automatic genre identification
web corpus
south slavic languages
bcms
hbs
discrimination between closely related languages
corpora linguistics
4
presentations
SHORT BIO
Taja Kuzman is a computational linguist with a MA degree in translation, pursuing a PhD in IT at the Jožef Stefan International Postgraduate School, Ljubljana, Slovenia. Her PhD research is focused on automatic genre identification. Taja works as a research assistant, developing language resources and technologies at the Department for Knowledge Technologies at the Jožef Stefan Institute, and co-leading the CLARIN Knowledge Centre for South Slavic languages CLASSLA. She is epecially active in text categorization tasks, web corpora collection and curation and machine translation.
Presentations

DIALECT-COPA: Extending the Standard Translations of the COPA Causal Commonsense Reasoning Dataset to South Slavic Dialects
Nikola Ljubešić and 6 other authors

JSI and W\"{u}NLP at the DIALECT-COPA Shared Task: In-Context Learning From Just a Few Dialectal Examples Gets You Quite Far
Nikola Ljubešić and 5 other authors

Get to Know Your Parallel Data: Performing English Variety and Genre Classification over MaCoCu Corpora
Taja Kuzman and 2 other authors

BENCHic ́-lang: A Benchmark for Discriminating between Bosnian, Croatian, Mon- tenegrin and Serbian
Peter Rupnik and 2 other authors