
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

workshop paper
Alson at NADI 2024 shared task: Alson - A fine-tuned model for Arabic Dialect Translation
keywords:
and modern standard arabic.
fine-tuned
pre-trained
dialects
DA-MSA Machine Translation is a recent challenge due to the multitude of Arabic dialects and their variations. In this paper, we present our results within the context of Subtask 3 of the NADI-2024 Shared Task(Abdul- Mageed et al., 2024) that is DA-MSA Machine Translation . We utilized the DIALECTS008 MSA MADAR corpus (Bouamor et al., 2018), the Emi-NADI corpus for the Emirati dialect (Khered et al., 2023), and we augmented the Palestinian and Jordanian datasets based on NADI 2021. Our approach involves develop013 ing sentence-level machine translations from Palestinian, Jordanian, Emirati, and Egyptian dialects to Modern Standard Arabic (MSA).To 016 address this challenge, we fine-tuned mod els such as (Nagoudi et al., 2022)AraT5v2- msa-small, AraT5v2-msa-base, and (Elmadany et al., 2023)AraT5v2-base-1024 to compare their performance. Among these, the AraT5v2- base-1024 model achieved the best accuracy, with a BLEU score of 0.1650 on the develop023 ment set and 0.1746 on the test set.