Tangut at a Glance

  • Tangut is an extinct Sino-Tibetan language of the Western Xia dynasty, last attested around 1502 CE — it was the official language of a powerful state that rivalled the Song and Jin dynasties of medieval China [1]
  • Tangut belongs to the Qiangic branch of Sino-Tibetan, making it a distant relative of Classical Tibetan and the Qiang languages of the Sichuan-Gansu border region [2]
  • The Tangut script was created around 1036 CE by order of Emperor Jing Zong and encoded in Unicode in 2016 — it contains over 6,000 unique logographic characters [3]
  • The Tangut script was intentionally designed to be visually distinct from Chinese characters, though it uses a similar stroke-based construction method and similar structural complexity
  • The primary Tangut dictionary — the Sea of Characters — lists over 6,000 characters, each representing a syllable-morpheme combination in the Tangut language
  • The Russian Kozlov expedition (1908–1909) recovered thousands of Tangut manuscripts from the ruins of Khara-Khoto in Inner Mongolia, providing the main corpus for modern decipherment
  • Tangut characters are built from about 756 distinct structural components — similar to Chinese radicals — which can appear as semantic classifiers or phonetic indicators within characters

Tangut Characters

A representative selection of Tangut logographic characters from the Western Xia dynasty corpus. Each character represents a syllable-morpheme of the Tangut language.

The Tangut script contains over 6,000 unique characters, each built from structural components similar to Chinese radicals. The script was created in 1036 CE and was used for Buddhist scriptures, administrative documents, and dictionaries.

𗀀
[lhi]
𗀁
[lhjij]
𗀂
[ljij]
𗀃
[lhjij]
𗀄
[lhjij]
𗀅
[lew]
𗀆
[lhjij]
𗀇
[lhjij]
𗀈
[lhjij]
𗀉
[lhjij]
𗀊
[lhjij]
𗀋
[lhjij]
𗀌
[lhjij]
𗀍
[lhjij]
𗀎
[lhjij]
𗀏
[lhjij]
𗀐
[lhjij]
𗀑
[lhjij]
𗀒
[lhjij]
𗀓
[lhjij]

Tangut Components

Tangut components are the structural building blocks used to construct Tangut characters — similar to radicals in Chinese script. The Tangut Yinchuan dictionary identified 756 distinct components.

Components can function as semantic classifiers (indicating meaning category) or phonetic indicators. Unlike Chinese radicals, Tangut components do not always appear as standalone characters outside of their role as building blocks.

𘠀
𘠁
𘠂
𘠃
𘠄
𘠅

Digits

The Tangut script did not use a positional numeral system. Arabic numerals are used in modern scholarly editions of Tangut for line numbering and catalogue references.

Quantities in Tangut texts were expressed using logographic number characters within the main Tangut character set, following conventions similar to Classical Chinese.

0
1
2
3
4
5
6
7
8
9

Special Characters

Tangut manuscripts used an Iteration Mark to indicate repetition of the preceding character or syllable, following conventions also found in Chinese and Japanese texts of the same period.

Modern scholarly editions of Tangut additionally use square brackets to indicate lacunae, damaged readings, or uncertain text in manuscript transcriptions.

𘟬
[
]

Frequently Asked Questions (FAQs)

References:

Sambhu Raj SinghSambhu Raj Singh · LinkedIn · GitHub · Npm

Updated:


Explore the Tangut logographic script — the extinct writing system of the Western Xia dynasty...
Learn the Tibetan script — 30 consonants, vowel diacritics and Tibetan digits...
Chinese uses Hanzi characters with Pinyin romanisation and Bopomofo phonetics...
Learn Hangul, Korea's scientific and elegant 40-letter writing system
Explore thousands of Kanji characters, Japan's beautiful logographic writing