Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Khmer script
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
== Unicode == The basic [[Khmer (Unicode block)|Khmer block]] was added to the [[Unicode]] Standard in version 3.0, released in September 1999. It then contained 103 defined code points; this was extended to 114 in version 4.0, released in April 2003. Version 4.0 also introduced an additional block, called [[Khmer Symbols]], containing 32 signs used for writing [[lunar calendar|lunar dates]]. The Unicode block for basic Khmer characters is U+1780–U+17FF: {{Unicode chart Khmer}} The first 35 characters are the [[#Consonants|consonant letter]]s (including two obsolete). The symbols at U+17A3 and U+17A4 are deprecated (they were intended for use in Pali and Sanskrit transliteration, but are identical in appearance to the consonant {{lang|km|អ}}, written alone or with the ''a'' vowel). These are followed by the 15 [[#Independent vowels|independent vowels]] (including one obsolete and one variant form). The code points U+17B4 and U+17B5 are invisible combining marks for inherent vowels, intended for use only in special applications. Next come the 16 [[#Dependent vowels|dependent vowel signs]] and the 12 [[#Diacritics|diacritics]] (excluding the ''kbiĕh kraôm'', which is identical in form to the ''ŏ'' dependent vowel); these are represented together with a dotted circle, but should be displayed appropriately in combination with a preceding Khmer letter. The code point U+17D2, called {{lang|km|ជើង}} ''{{transliteration|km|ceung}}'', meaning "foot", is used to indicate that a following consonant is to be written in subscript form. It is not normally visibly rendered as a character. U+17D3 was originally intended for use in writing lunar dates, but its use is now discouraged (see the Khmer Symbols block hereafter). The next seven characters are the [[#Spacing and punctuation|punctuation marks]] listed hereinbefore; these are followed by the [[Cambodian riel|riel]] currency symbol, a rare sign corresponding to the Sanskrit [[avagraha]], and a mostly obsolete version of the ''vĭréam'' diacritic. The U+17Ex series contains the [[#Numerals|Khmer numerals]], and the U+17Fx series contains variants of the numerals used in [[divination]] lore. The block with additional lunar date symbols is U+19E0–U+19FF: {{Unicode chart Khmer Symbols}} The symbols at U+19E0 and U+19F0 represent the first and second "eighth month" in a lunar year containing a leap-month (see [[Khmer calendar]]). The remaining symbols in this block denote the days of a lunar month: those in the U+19Ex series for waxing days, and those in the U+19Fx series for waning days.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)