QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

Post Details

Company

HuggingFace

Date Published

April 21, 2026

Author

Leen AlQadi, Ahmed Alzubaidi, Mohammed Alyafeai, Maitha Alhammadi, Shaikha Alsuwaidi, Omar saif alkaabi, Basma Boussaha, and Hakim Hacid

Word Count

1,731

Company Posts That Month

61

Language

-

Hacker News Points

-

Source URL

huggingface.co/blog/tiiuae/qimma-arabic-leaderboard

Summary

QIMMA is an Arabic Language Model Leaderboard that addresses the challenges in evaluating Arabic NLP by implementing a rigorous quality validation pipeline before model evaluation. It highlights systematic quality issues in existing benchmarks and provides a unified evaluation suite covering over 52,000 samples across seven domains, ensuring 99% native Arabic content. QIMMA uniquely integrates code evaluation, applies a multi-stage validation process involving both automated assessments and human review to maintain cultural and dialectal accuracy, and releases transparent, per-sample inference outputs. The leaderboard demonstrates that model performance does not solely depend on size, as smaller, Arabic-specialized models often outperform larger, multilingual ones in specific domains.

Trends Found in this Post

Trend	Post Mentions	Total Month Mentions	Posts	Companies	MoM
LLM	9	5,932	1,046	223	-2%
AI Guardrails	3	362	123	45	+1%