Anas Amchaar
Abdellah Oumida
Mohammed Sbaihi
Abdeljalil El Majjodi

Creating your custom Ghibli Text-to-Image model

How we built a Ghibli-style text-to-image model that authentically represents Moroccan culture, using LoRA fine-tuning and a curated dataset.

Read more
Abdeljalil El Majjodi
Abdelaziz Bounhar

AL Atlas: Moroccan Darija Pretraining

We present a comprehensive dataset for Moroccan darija, addressing the lack of resources for this widely spoken dialect. We detail our collection methodology, provide thorough data analysis, and demonstrate performance improvements in both masked and causal language models after training on this dataset.

Read more
Abdeljalil El Majjodi
Aymane El Firdoussi
Ihssane Nedjaoui

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

We introduce Darija Chatbot Arena, an innovative platform designed to facilitate the comparison of responses from various Large Language Models (LLMs) on a diverse set of prompts in Darija, the Moroccan Arabic dialect.

Read more
Imane Momayiz
ao
Ali Nirheche
Choukrani

TerjamaBench: A Cultural Benchmark for English-Darija Machine Translation

We introduce TerjamaBench, an evaluation benchmark for English-Darija machine translation.

Read more