Sarikoli Alphabet at a Glance

  • 32 letters written right to left using the Perso-Arabic script, an abjad where short vowels are not written in everyday text
  • Sarikoli is an Eastern Iranian Pamiri language spoken in the Tashkurgan Tajik Autonomous County of Xinjiang, China, with approximately 20,000–25,000 speakers — the only Pamiri language spoken in China [1]
  • Uses the Perso-Arabic script, following the Persian/Dari writing tradition adapted for Pamiri phonology [2]
  • Sarikoli is the closest relative of Shughni within the Shughni-Rushani branch of Eastern Iranian — together they form the core of the Pamiri language group [3]
  • Belongs to the Shughni-Rushani group within Eastern Iranian (Pamiri branch) — its closest relatives are Shughni, Rushani, Bartangi, Oroshori, and Yazghulami [4]
  • Sarikoli speakers in China are officially recognised as the Tajik ethnic minority (塔吉克族), though linguistically Sarikoli is distinct from Tajik (a Western Iranian language) [5]
  • Includes 4 letters unique to the Persian/Dari script, not found in Arabic: پ (pe), چ (che), ژ (zhe), and گ (gaf); short vowels are indicated by optional diacritical marks (harakat) in educational texts

Sarikoli Consonant Letters

The Sarikoli alphabet contains 32 consonant letters based on the Perso-Arabic script. This includes the 28 letters of the Arabic alphabet plus 4 letters unique to Persian/Dari: پ (pe /p/), چ (che /tʃ/), ژ (zhe /ʒ/), and گ (gaf /ɡ/). Each letter changes shape depending on its position within a word — initial, medial, final, or isolated. Sarikoli uses this script as the closest relative of Shughni within the Eastern Iranian Pamiri language family, spoken in Tashkurgan, Xinjiang, China.

Sarikoli Consonants

ب
[BEH]
پ
[PEH]
ت
[TEH]
ث
[SEH]
ج
[JIM]
چ
[CHEH]
ح
[HEH]
خ
[KHEH]
د
[DAL]
ذ
[ZAL]
ر
[REH]
ز
[ZEH]
ژ
[ZHEH]
س
[SIN]
ش
[SHIN]
ص
[SAD]
ض
[ZAD]
ط
[TAH]
ظ
[ZAH]
ع
[AIN]
غ
[GHAIN]
ف
[FEH]
ق
[QAHF]
ک
[KAF]
گ
[GAF]
ل
[LAM]
م
[MIM]
ن
[NUN]
و
[VAV]
ه
[HEH]
ی
[YEH]
ء
[HAMZA]

Sarikoli Vowel Diacritics (Harakat)

Like all Perso-Arabic scripts, Sarikoli is an abjad — a consonantal alphabet where short vowels are not written by default but indicated by optional diacritical marks (harakat). These six marks are used in educational materials to aid correct pronunciation of Sarikoli, whose Eastern Iranian Pamiri vowel system includes phonemic contrasts preserved from Old Iranian that are absent from standard Persian.

Sarikoli Vowel Diacritics

َ
[FAT-ha]
ِ
[KAS-ra]
ُ
[DAM-ma]
ّ
[SHAD-da]
ً
[tan-WIN]
ْ
[SOO-kun]

All Alphabet

The complete Sarikoli alphabet with all 32 Perso-Arabic letters in traditional order, from ا (alef) to ی (ye). These letters form the foundation of the Sarikoli writing system, the Eastern Iranian Pamiri language of Tashkurgan Tajik Autonomous County, Xinjiang, China — the only Pamiri language spoken within China.

ا
ب
پ
ت
ث
ج
چ
ح
خ
د
ذ
ر
ز
ژ
س
ش
ص
ض
ط
ظ
ع
غ
ف
ق
ک
گ
ل
م
ن
و
ه
ی

Eastern Arabic Digits (۰–۹)

Sarikoli texts use Eastern Arabic-Indic numerals (۰–۹) following the Persian numeral tradition. These digits are written left to right even within otherwise right-to-left Sarikoli text.

Sarikoli Digits

۰
۱
۲
۳
۴
۵
۶
۷
۸
۹

Special Characters & Punctuation

Sarikoli and Persian texts use Arabic punctuation marks that are mirrored versions of their Western Latin equivalents. The Arabic comma (،) and Arabic question mark (؟) are reflected horizontally to suit right-to-left reading direction, while guillemets (« ») serve as quotation marks in formal Sarikoli writing.

،
؛
؟
«
»
٪
ـ

Frequently Asked Questions (FAQs)

References:

Sambhu Raj SinghSambhu Raj Singh · LinkedIn · GitHub · Npm

Updated:


Shughni Cyrillic — Ӣ ӣ, Ӯ ӯ & Pamiri letters...
Endangered Pamiri language of Gorno-Badakhshan, Tajikistan...
Farsi — ب پ ت, ی ک گ & Perso-Arabic letters...
The 32-letter Perso-Arabic script of Takestani, an endangered Northwest Iranian language of Qazvin Province...
Semnani — ب پ ت, ی ک گ & Perso-Arabic letters...