Assamese (অসমীয়া, ISO 639-3: asm) is an Eastern Indo-Aryan language spoken by approximately 15 million people in Assam and Arunachal Pradesh, India [1]. It uses the Bengali-Assamese script (U+0980–U+09FF) with two distinct letters — ৰ and ৱ — not found in Bengali [2].
Assamese belongs to the Bengali-Assamese branch of Eastern Indo-Aryan, closely related to Bengali, Sylheti, and Chittagonian [3].
A hallmark of Assamese phonology is the merger of all three sibilants (শ, ষ, স) into the velar fricative /x/ — a feature unique among major Indo-Aryan languages.
Assamese has approximately 35 consonant letters, including the two letters unique to Assamese: ৰ (ra, U+09F0) and ৱ (wa, U+09F1) — not used in Bengali. Each carries an inherent /ɔ/ vowel.
A key feature: the palatal stops (চ, জ) are pronounced as sibilants (/s/, /z/) in Assamese, not affricates as in Bengali — and all three sibilants (শ, ষ, স) merge into the velar fricative /x/.
Assamese has 11 independent vowels — standalone characters used when a vowel begins a syllable without a preceding consonant. Shared with Bengali in the same Unicode Block U+0980–U+09FF.
The inherent vowel of Assamese consonants is /ɔ/ (as in Bengali), not /ə/ as in Hindi. Assamese also has a distinct open vowel /a/ contrasting with /ɔ/ in many words.
Vowel signs (কাৰ, kar) are diacritical marks written around Assamese consonants — before, after, above, below, or as two-part signs (ো, ৌ) on both sides of the consonant.
The hasanta (্) suppresses the inherent vowel for consonant clusters; anusvara (ং) marks the velar nasal /ŋ/; chandrabindu (ঁ) marks nasalisation of vowels. Assamese Block: U+0980–U+09FF.
Assamese uses Bengali-Assamese numerals (০–৯, Unicode U+09E6–U+09EF) — the same digit forms as Bengali, corresponding to Arabic numerals 0–9. Part of the Bengali-Assamese Unicode Block.
Both Assamese digits and Western Arabic numerals (0–9) are widely used in contemporary Assamese writing, printing, and digital media across Assam.
Updated: