ELAB: Extensive LLM Alignment Benchmark in Persian Language

Pourbahman, Zahra; Rajabi, Fatemeh; Sadeghi, Mohammadhossein; Ghahroodi, Omid; Bakhshaei, Somaye; Amini, Arash; Kazemi, Reza; Baghshah, Mahdieh Soleymani

Computer Science > Computation and Language

arXiv:2504.12553 (cs)

[Submitted on 17 Apr 2025]

Title:ELAB: Extensive LLM Alignment Benchmark in Persian Language

Authors:Zahra Pourbahman, Fatemeh Rajabi, Mohammadhossein Sadeghi, Omid Ghahroodi, Somaye Bakhshaei, Arash Amini, Reza Kazemi, Mahdieh Soleymani Baghshah

View PDF HTML (experimental)

Abstract:This paper presents a comprehensive evaluation framework for aligning Persian Large Language Models (LLMs) with critical ethical dimensions, including safety, fairness, and social norms. It addresses the gaps in existing LLM evaluation frameworks by adapting them to Persian linguistic and cultural contexts. This benchmark creates three types of Persian-language benchmarks: (i) translated data, (ii) new data generated synthetically, and (iii) new naturally collected data. We translate Anthropic Red Teaming data, AdvBench, HarmBench, and DecodingTrust into Persian. Furthermore, we create ProhibiBench-fa, SafeBench-fa, FairBench-fa, and SocialBench-fa as new datasets to address harmful and prohibited content in indigenous culture. Moreover, we collect extensive dataset as GuardBench-fa to consider Persian cultural norms. By combining these datasets, our work establishes a unified framework for evaluating Persian LLMs, offering a new approach to culturally grounded alignment evaluation. A systematic evaluation of Persian LLMs is performed across the three alignment aspects: safety (avoiding harmful content), fairness (mitigating biases), and social norms (adhering to culturally accepted behaviors). We present a publicly available leaderboard that benchmarks Persian LLMs with respect to safety, fairness, and social norms at: this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2504.12553 [cs.CL]
	(or arXiv:2504.12553v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.12553

Submission history

From: Mahdieh Soleymani Baghshah [view email]
[v1] Thu, 17 Apr 2025 00:50:41 UTC (3,296 KB)

Computer Science > Computation and Language

Title:ELAB: Extensive LLM Alignment Benchmark in Persian Language

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ELAB: Extensive LLM Alignment Benchmark in Persian Language

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators