
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

workshop paper
ELYADATA at NADI 2024 shared task: Arabic Dialect Identification with Similarity-Induced Mono-to-Multi Label Transformation.
keywords:
msa
nadi2024
multi labled dialect identification
staged fine-tuning
binary relavence
simmt
dialectal arabic
ensemble models
dialect identification
preprocessing
fine-tuning
bert
This paper describes our submissions to the Multi-label Country-level Dialect Identification subtask of the NADI2024 shared task, organized during the second edition of the ArabicNLP conference. Our submission is based on the ensemble of fine-tuned BERT-based models, after implementing the Similarity-Induced Mono-to-Multi Label Transformation (SIMMT) on the input data. Our submission ranked first with a Macro-Average (MA) F1 score of 50.57%.