profile picture

Taja Kuzman

causal commonsense reasoning; copa; south slavic dialects; natural language understanding; large language models

english varieties

english variety classifier

automatic genre identification

web corpus

south slavic languages

bcms

hbs

discrimination between closely related languages

corpora linguistics

4

presentations

SHORT BIO

Taja Kuzman is a computational linguist with a MA degree in translation, pursuing a PhD in IT at the Jožef Stefan International Postgraduate School, Ljubljana, Slovenia. Her PhD research is focused on automatic genre identification. Taja works as a research assistant, developing language resources and technologies at the Department for Knowledge Technologies at the Jožef Stefan Institute, and co-leading the CLARIN Knowledge Centre for South Slavic languages CLASSLA. She is epecially active in text categorization tasks, web corpora collection and curation and machine translation.

Presentations

DIALECT-COPA: Extending the Standard Translations of the COPA Causal Commonsense Reasoning Dataset to South Slavic Dialects

Nikola Ljubešić and 6 other authors

JSI and W\"{u}NLP at the DIALECT-COPA Shared Task: In-Context Learning From Just a Few Dialectal Examples Gets You Quite Far

Nikola Ljubešić and 5 other authors

Get to Know Your Parallel Data: Performing English Variety and Genre Classification over MaCoCu Corpora

Taja Kuzman and 2 other authors

BENCHic ́-lang: A Benchmark for Discriminating between Bosnian, Croatian, Mon- tenegrin and Serbian

Peter Rupnik and 2 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved