Wakhi Alphabet at a Glance

  • 32 Perso-Arabic letters of the Wakhi writing system, following the Dari/Afghan and Urdu/Pakistani orthographic convention
  • Wakhi is an Eastern Iranian Pamiri language spoken by approximately 58,000–100,000 people across Afghanistan (Wakhan Corridor), Pakistan (Hunza-Gilgit), Tajikistan, and China (Tashkurgan/Xinjiang) — the most geographically widespread Pamiri language [1]
  • Wakhi belongs to the Wakhi branch of Eastern Iranian — distinct from both the Shughni-Yazghulami group and the Munji-Yidgha subgroup; historically the language of the Wakhan kingdom on the ancient Silk Road [4]
  • ISO 639-3 code: wbl; uses the Perso-Arabic abjad in Afghanistan and Pakistan (Cyrillic is used in Tajikistan) [3]
  • Preserves the Old Iranian /w/ in و — an archaic East Iranian feature shared with Yidgha and Munji, contrasting with the /v/ of surrounding Dari and Urdu [2]
  • Wakhi speakers are typically multilingual in Dari (Afghanistan), Urdu/Burusho/Shina (Pakistan), or Tajik (Tajikistan) depending on their country of residence
  • Includes 4 letters unique to Persian/Dari script — پ (pe), چ (che), ژ (zhe), گ (gaf) — not found in classical Arabic

Wakhi is an Eastern Iranian Pamiri language spoken by approximately 58,000–100,000 people across four countries: the Wakhan Corridor of northeastern Afghanistan, the Hunza-Gilgit region of northern Pakistan, parts of the Gorno-Badakhshan Autonomous Oblast of Tajikistan, and the Tashkurgan Tajik Autonomous County of Xinjiang, China [1]. It uses the 32-letter Perso-Arabic script in Afghanistan and Pakistan, following Dari/Afghan and Urdu/Pakistani orthographic conventions [2]. Wakhi is the most geographically widespread of all the Pamiri languages, stretching across the high mountain valleys that connect Central Asia, South Asia, and East Asia.

Wakhi belongs to the Wakhi branch of Eastern Iranian, forming a distinct subgroup separate from both the Shughni-Yazghulami languages (Shughni, Bartangi, Yazgulami, Sarikoli) and the Munji-Yidgha subgroup [3]. Wakhi is thus the sole member of its own branch within Eastern Iranian — a unique position reflecting its long geographic isolation in the high-altitude valleys of the Pamirs and Karakoram. The language was historically spoken in the Wakhan kingdom, an ancient polity that controlled a key corridor of the Silk Road connecting the cultures of Persia, India, China, and Central Asia.

Wakhi preserves the archaic /w/ pronunciation of و (vav) inherited directly from Old Iranian *w — a feature shared with Yidgha and Munji but contrasting sharply with the /v/ of surrounding Dari, Urdu, and Persian [4]. This archaic bilabial approximant is one of the key phonological markers of the Eastern Iranian heritage of Wakhi. Despite the geographic dispersal of its speakers across four countries and the pressure of multiple dominant languages, Wakhi maintains a strong spoken tradition and continues to be used as a community language in the high mountain valleys of the Pamirs and Karakoram.

Wakhi Consonant Letters

The Wakhi alphabet contains 32 consonant letters based on the Perso-Arabic script — 28 Arabic letters plus 4 Persian/Dari-specific letters: پ (pe /p/), چ (che /tʃ/), ژ (zhe /ʒ/), and گ (gaf /g/).

Wakhi is an Eastern Iranian Pamiri language of the Wakhan Corridor (Afghanistan), Hunza-Gilgit (Pakistan), Tajikistan, and Tashkurgan (China). Each letter changes shape by position.

Wakhi Consonants

ا
[AH-lef]
ب
[BEH]
پ
[PEH]
ت
[TEH]
ث
[SEH]
ج
[JEEM]
چ
[CHEH]
ح
[HEH-JEE-mee]
خ
[KHEH]
د
[DAHL]
ذ
[ZAHL]
ر
[REH]
ز
[ZEH]
ژ
[ZHEH]
س
[SEEN]
ش
[SHEEN]
ص
[SAWD]
ض
[DAWD]
ط
[TAW]
ظ
[ZAW]
ع
[AYN]
غ
[GHAYN]
ف
[FEH]
ق
[QAHF]
ک
[KAHF]
گ
[GAHF]
ل
[LAHM]
م
[MEEM]
ن
[NOON]
و
[VAWV]
ه
[HEH]
ی
[YEH]

Wakhi Vowel Diacritics (Harakat)

Wakhi is an abjad — short vowels are not written by default but indicated by optional diacritical marks (harakat).

Wakhi has a rich vowel system inherited from Old Iranian. Fully-vocalised texts with harakat are especially valuable for documenting this Pamiri language of the Silk Road region.

Wakhi Vowel Diacritics (Harakat)

َ
[FAH-tah]
ِ
[KAS-rah]
ُ
[DAM-mah]
ّ
[SHAD-dah]
ْ
[SOO-koon]
ً
[TAN-ween]

All Alphabet

The complete Wakhi alphabet with all 32 Perso-Arabic letters in traditional order, from ا (alef) to ی (ye). These letters form the writing system of Wakhi, an Eastern Iranian Pamiri language of the Wakhan Corridor (Afghanistan), Hunza-Gilgit (Pakistan), Tajikistan, and Tashkurgan (China), following the Dari/Afghan and Urdu/Pakistani script conventions.

ا
ب
پ
ت
ث
ج
چ
ح
خ
د
ذ
ر
ز
ژ
س
ش
ص
ض
ط
ظ
ع
غ
ف
ق
ک
گ
ل
م
ن
و
ه
ی

Eastern Arabic Digits (۰–۹)

Wakhi texts use Eastern Arabic-Indic numerals (۰–۹) rather than Western Arabic numerals (0–9). These digits follow the standard Dari/Urdu convention used across Afghanistan and Pakistan, reflecting the influence of Dari and Urdu on Wakhi written conventions in the Wakhan Corridor and Hunza-Gilgit, written left to right even in otherwise right-to-left text.

Eastern Arabic Digits (۰–۹)

۰
۱
۲
۳
۴
۵
۶
۷
۸
۹

Frequently Asked Questions (FAQs)

References:

Sambhu Raj SinghSambhu Raj Singh · LinkedIn · GitHub · Npm

Updated:


Shughni Cyrillic — Ӣ ӣ, Ӯ ӯ & Pamiri letters...
Munji: 32-letter Perso-Arabic script of the endangered East Iranian language of Badakhshan, Afghanistan...
Endangered Eastern Iranian language of Chitral, Pakistan...
Endangered Pamiri language of Gorno-Badakhshan, Tajikistan...
Endangered Pamiri language of Gorno-Badakhshan, Tajikistan...