Malayalam script

Revision as of 04:53, 24 May 2025 by imported>Naveen Sankar (→‎Malayalam)
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Template:Short description Template:Distinguish Template:Expand Malayalam Template:Use dmy dates Template:Infobox Writing system

Template:Contains special characters Template:Malayalam transliteration Template:Brahmic

Malayalam script (Template:Transliteration; {{#invoke:IPA|main}}<ref name="Ref_a"/><ref name="Grammatical Sketch">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> / Template:Langx) is a Brahmic script used commonly to write Malayalam, which is the principal language of Kerala, India, spoken by 45 million people in the world. It is a Dravidian language spoken in the Indian state of Kerala and the union territories of Lakshadweep and Puducherry (Mahé district) by the Malayali people. It is one of the official scripts of the Indian Republic.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref name="Ref_2009">Template:Cite book</ref> Malayalam script is also widely used for writing Sanskrit texts in Kerala.

The Malayalam script bears high similarity with Tulu Script and Tigalari script, which was used for writing the Tulu language, spoken in coastal Karnataka (Dakshina Kannada and Udupi districts) and the northernmost Kasargod district of Kerala.<ref name="Tulu Unicode 2017">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Like many other Indic scripts, it is an alphasyllabary (abugida), a writing system that is partially "alphabetic" and partially syllable-based. The modern Malayalam alphabet has 15 vowel letters, 42 consonant letters, and a few other symbols. The Malayalam script is a Vatteluttu alphabet extended with symbols from the Grantha alphabet to represent Indo-Aryan loanwords.<ref>Template:Cite book</ref> The script is also used to write several minority languages such as Paniya, Betta Kurumba, and Ravula.<ref name="Ref_">Ethnologue (16th ed.): "Paniya", "Kurumba, Betta", and "Ravula".</ref> The Malayalam language itself was historically written in several different scripts.

HistoryEdit

Malayalam was first written in the Vattezhuthu script, an ancient script of Tamil and Malayalam languages. However, the modern Malayalam script evolved from the Grantha alphabet, and Vattezhuthu, both of which evolved from the Tamil-Brahmi, but independently.

Vatteluttu (Template:Langx) is a script that had evolved from Tamil-Brahmi and was once used extensively in the southern part of present-day Tamil Nadu and in Kerala. The Vazhappally inscription issued by Rajashekhara Varman is the earliest example, dating from about 830 CE.<ref name="Omniglot">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref name="Vazhapally Temple">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> During the medieval period, the Tigalari script that was used for writing Tulu in South Canara, and Sanskrit in the adjacent Malabar region, was very similar to the modern Malayalam script.<ref name="Tulu Unicode 2017"/> In the Tamil state, the modern Tamil script had supplanted Vattezhuthu by the 15th century, but in the Malabar region, Vattezhuthu remained in general use up to the 17th century,<ref name="Ref_b">Burnell (1874), p. 39.</ref> or the 18th century.<ref name="s">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> A variant form of this script, Kolezhuthu, was used until about the 19th century mainly in the Malabar-Cochin area.<ref name="l">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Another variant form, Malayanma, was used in the south of Thiruvananthapuram.<ref name="l"/>

According to Arthur Coke Burnell, one form of the Grantha alphabet, originally used in the Chola dynasty, was imported into the southwest coast of India in the 8th or 9th century, which was then modified in course of time in this secluded area, where communication with the east coast was very limited.<ref name="b35">Burnell (1874), p. 35.</ref> It later evolved into Tigalari-Malayalam script was used by the Malayali, Havyaka Brahmins and Tulu Brahmin people, but was originally only applied to write Sanskrit. This script split into two scripts: Tigalari and Malayalam. While Malayalam script was extended and modified to write vernacular language Malayalam, the Tigalari was written for Sanskrit only.<ref name="b35"/><ref name="Ref_2009a">Template:Cite encyclopedia</ref> In Malabar, this writing system was termed Arya-eluttu ({{#invoke:Lang|lang}}, Ārya eḻuttŭ),<ref name="Ref_c">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> meaning "Arya writing" (Sanskrit is Indo-Aryan language while Malayalam is a Dravidian language).

Vatteluttu was in general use, but was not suitable for literature where many Sanskrit words were used. Like Tamil-Brahmi, it was originally used to write Tamil, and as such, did not have letters for voiced or aspirated consonants used in Sanskrit but not used in Tamil. For this reason, Vatteluttu and the Grantha alphabet were sometimes mixed, as in the Manipravalam. One of the oldest examples of the Manipravalam literature, Vaishikatantram ({{#invoke:Lang|lang}}, Vaiśikatantram), dates back to the 12th century,<ref name="Ref_d">Nampoothiri, N. M. (1999), "Cultural Traditions in Medieval Kerala"Template:Dead link (PDF) in Cherian, P. J., Perspectives on Kerala History: The Second Millennium, Kerala Council for Historical Research, Template:ISBN, retrieved 2009-11-20.</ref><ref name="Ref_e">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> where the earliest form of the Malayalam script was used, which seems to have been systematised to some extent by the first half of the 13th century.<ref name="Omniglot"/><ref name="s"/>

It is Thunchaththu Ezhuthachan who is also credited with the development of Malayalam script into the current form through the intermixing and modification of the erstwhile scripts of Vatteluttu, Kolezhuthu, and Grantha script, which were used to write the inscriptions and literary works of Old and Middle Malayalam.<ref name="mlm"/> He further eliminated excess and unnecessary letters from the modified script.<ref name="mlm">Template:Cite book</ref> Hence, Ezhuthachan is also known as The Father of modern Malayalam.<ref name="mlm"/> The development of modern Malayalam script was also heavily influenced by the Tigalari script, which was used to write the Tulu language, due to the influence of Tuluva Brahmins in Kerala.<ref name="mlm"/>

Thunchaththu Ezhuthachan, a poet from around the 16th century,<ref name=":10">Template:Cite book</ref> used Arya-eluttu to write his Malayalam poems based on Classical Sanskrit literature.<ref name="b35"/> For a few letters missing in Arya-eluttu (ḷa, ḻa, ṟa), he used Vatteluttu. His works became unprecedentedly popular to the point that the Malayali people eventually started to call him the father of the Malayalam language, which also popularised Arya-eluttu as a script to write Malayalam. However, Grantha did not have distinctions between e and ē, and between o and ō, as it was as an alphabet to write a Sanskrit language. The Malayalam script as it is today was modified in the middle of the 19th century when Hermann Gundert invented the new vowel signs to distinguish them.<ref name="b35"/>

By the 19th century, old scripts like Kolezhuthu had been supplanted by Arya-eluttu – that is the current Malayalam script. Nowadays, it is widely used in the press of the Malayali population in Kerala.<ref name="Ref_f">Andronov, Mikhail Sergeevich. A Grammar of the Malayalam Language in Historical Treatment. Wiesbaden : Harrassowitz, 1996.</ref>

Orthography reformEdit

In 1971, the Government of Kerala reformed the orthography of Malayalam by a government order to the education department.<ref name="Ref_1971">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref name="Ref_g">Template:Cite book</ref> The objective was to simplify the script for print and typewriting technology of that time, by reducing the number of glyphs required. In 1967, the government appointed a committee headed by Sooranad Kunjan Pillai, who was the editor of the Malayalam Lexicon project. It reduced number of glyphs required for Malayalam printing from around 1000 to around 250. Above committee's recommendations were further modified by another committee in 1969.

This proposal was later accepted by major newspapers in January 1971. The reformed script came into effect on 15 April 1971 (the Kerala New Year), by a government order released on 23 March 1971.

Recommendations by the committeesEdit

Use non-ligating vowel signs for u, ū, and Edit

In the traditional orthography that had been taught in the primary education system before the reforms, any consonant or consonant ligature followed by the vowel sign u, ū, or were represented by a cursive consonant-vowel ligature. The glyph of each consonant had its own way of ligating with these vowel signs. This irregularity was simplified in the reformed script.<ref>Manohar, Kavya & Thottingal, Santhosh. (2018). "Malayalam Orthographic Reforms: Impact on Language and Popular Culture". Presented at the Graphematik 2018.</ref> Thus, a vowel sign or consonant sign would always have a disconnected symbol that did not fuse with the base consonant.

Examples:

  • ku: → {{#invoke:Lang|lang}}
  • kū: → {{#invoke:Lang|lang}}
  • kr̥: → {{#invoke:Lang|lang}}
  • nu: → {{#invoke:Lang|lang}}
  • śu: → {{#invoke:Lang|lang}}

Split uncommon conjuncts with ChandrakkalaEdit

Also, most of traditional consonant-consonant ligatures, especially the less common ones only used to write words of Sanskrit origin, were split into non-ligated forms with explicit chandrakkala. For example:

  • {{#invoke:Lang|lang}} g + {{#invoke:Lang|lang}} da = gda: -> {{#invoke:Lang|lang}}
  • {{#invoke:Lang|lang}} l + {{#invoke:Lang|lang}} ta = lta: -> {{#invoke:Lang|lang}}
  • {{#invoke:Lang|lang}} ś + {{#invoke:Lang|lang}} na = śna: -> {{#invoke:Lang|lang}}
  • {{#invoke:Lang|lang}} ś + {{#invoke:Lang|lang}} ma = śma: -> {{#invoke:Lang|lang}}
  1. The ligature ({{#invoke:Lang|lang}}) śma is required as an additional letter. For examples, {{#invoke:Lang|lang}}, śmaśanam, is the word for cemetery.

Use non-ligating sign for conjoining raEdit

Any consonant or consonant ligature followed by the conjoining ra is represented by a cursive tail attached to the consonant or the consonant-ligature. In the reformed script, this consonant sign would be disconnected from the base and represented as a left-bracket like symbol placed on the left side of the cluster.

  • kra:{{#invoke:Lang|lang}} → {{#invoke:Lang|lang}}
  • kru: {{#invoke:Lang|lang}} → {{#invoke:Lang|lang}}

Current statusEdit

Today the reformed orthography, is commonly called put̪iya lipi (Template:Langx) and traditional system, pazhaya lipi (Template:Langx).<ref name="John">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Current print media almost entirely uses reformed orthography. The state run primary education introduces the Malayalam writing to the pupils in reformed script only and the books are printed accordingly. However, the digital media uses both traditional and reformed in almost equal proportions as the fonts for both the orthographies are commonly available.

DescriptionEdit

CharacteristicsEdit

The basic characters can be classified as follows:

  • Vowels ({{#invoke:Lang|lang}}, svaram)
    1. Independent vowel letters
    2. Dependent vowel signs ({{#invoke:Lang|lang}},svarachinnam)
  • Consonant letters ({{#invoke:Lang|lang}}, vyañjanam)

An independent vowel letter is used as the first letter of a word that begins with a vowel. A consonant letter, despite its name, does not represent a pure consonant, but represents a consonant + a short vowel /a/ by default. For example, {{#invoke:Lang|lang}} is the first consonant letter of the Malayalam alphabet, which represents /ka/, not a simple /k/. A vowel sign is a diacritic attached to a consonant letter to indicate that the consonant is followed by a vowel other than /a/. If the following vowel is /a/, no vowel sign is needed. The phoneme /a/ that follows a consonant by default is called an inherent vowel. In Malayalam, its phonetic value is unrounded Template:IPAblink,<ref name="Ref_a">Canepari (2005), pp. 396, 140.</ref> or Template:IPAblink as an allophone. To denote a pure consonant sound not followed by a vowel, a special diacritic virama is used to cancel the inherent vowel. The following are examples where a consonant letter is used with or without a diacritic.

  • {{#invoke:Lang|lang}} /k/ = /k/ which is a consonant sound
  • {{#invoke:Lang|lang}} ka = {{#invoke:Lang|lang}} /k/ + {{#invoke:Lang|lang}} vowel sign a
  • {{#invoke:Lang|lang}} ki = {{#invoke:Lang|lang}} /k/ + {{#invoke:Lang|lang}} vowel sign i
  • {{#invoke:Lang|lang}} ku = {{#invoke:Lang|lang}} /k/ + {{#invoke:Lang|lang}} vowel sign u
  • {{#invoke:Lang|lang}} kai = {{#invoke:Lang|lang}} /k/ + {{#invoke:Lang|lang}} vowel sign ai

Malayalam alphabet is unicase, or does not have a case distinction. It is written from left to right, but certain vowel signs are attached to the left (the opposite direction) of a consonant letter that it logically follows. In the word {{#invoke:Lang|lang}} (Kēraḷam), the vowel sign {{#invoke:Lang|lang}} (ē) visually appears in the leftmost position, though the vowel ē logically follows the consonant k.

Malayalam lettersEdit

VowelsEdit

Vowel letters and vowel signsEdit

The following tables show the independent vowel letters and the corresponding dependent vowel signs (diacritics) of the Malayalam script, with romanizations in ISO 15919, transcriptions in the International Phonetic Alphabet (IPA).

Monophthongs
  Short Long
Independent Dependent Indep. Dependent
Vowel sign Example Vowel sign Example
a lang}} a
{{#invoke:IPA|main}}
(none) lang}} pa
{{#invoke:IPA|main}}
lang}} ā
{{#invoke:IPA|main}}
lang}}
 
lang}}
{{#invoke:IPA|main}}
i lang}} i
{{#invoke:IPA|main}}
lang}}
 
lang}} pi
{{#invoke:IPA|main}}
lang}} ī
{{#invoke:IPA|main}}
lang}}
 
lang}}
{{#invoke:IPA|main}}
u lang}} u
{{#invoke:IPA|main}}
lang}}
 
lang}} pu
{{#invoke:IPA|main}}
lang}} ū
{{#invoke:IPA|main}}
lang}}
 
lang}}
{{#invoke:IPA|main}}
lang}}
{{#invoke:IPA|main}}
lang}}
 
lang}} pr̥
{{#invoke:IPA|main}}
{{#invoke:Lang|lang}} r̥̄
{{#invoke:IPA|main}}
{{#invoke:Lang|lang}}
 
{{#invoke:Lang|lang}} pr̥̄
{{#invoke:IPA|main}}
lang}}
{{#invoke:IPA|main}}
lang}}
 
lang}} pl̥
{{#invoke:IPA|main}}
lang}} l̥̄
{{#invoke:IPA|main}}
lang}}
 
lang}} pl̥̄
{{#invoke:IPA|main}}
e lang}} e
{{#invoke:IPA|main}}
lang}}
 
lang}} pe
{{#invoke:IPA|main}}
lang}} ē
{{#invoke:IPA|main}}
lang}}
 
lang}}
{{#invoke:IPA|main}}
o lang}} o
{{#invoke:IPA|main}}
lang}}
 
lang}} po
{{#invoke:IPA|main}}
lang}} ō
{{#invoke:IPA|main}}

 
lang}}
{{#invoke:IPA|main}}

, r̥̄, , l̥̄, used to write Sanskrit words, are treated as vowels. They are called semi-vowels and are phonetically closer to vowels in Malayalam and in Classical Sanskrit where Panini, the Sanskrit grammarian, groups them with vowel sounds in his sutras. (see Proto-Indo-European language and Vedic Sanskrit). The letters and signs for r̥̄, , l̥̄ are very rare, and are not considered as part of the modern orthography.<ref name="K">Template:Cite journal See also the May 2001 version Template:Webarchive (PDF).</ref>

The vowel signs ā, i, ī are placed to the right of a consonant letter to which it is attached. The vowel signs e, ē, ai are placed to the left of a consonant letter. The vowel signs o and ō consist of two parts: the first part goes to the left of a consonant letter and the second part goes to the right of it. In the reformed orthography, the vowel signs u, ū, are simply placed to the right of the consonant letter, while they often make consonant-vowel ligatures in the traditional orthography.

Diphthongs
  Independent Dependent
Vowel sign Example
ai lang}} ai
{{#invoke:IPA|main}}
lang}}
 
lang}} pai
{{#invoke:IPA|main}}
au {{#invoke:Lang|lang}} au
{{#invoke:IPA|main}}
{{#invoke:Lang|lang}}
(archaic)
{{#invoke:Lang|lang}} pau
{{#invoke:IPA|main}}
lang}}
(modern)
lang}} pau
{{#invoke:IPA|main}}

It is important to note the vowel duration as it can be used to differentiate words that would otherwise be the same. For example, {{#invoke:IPA|main}} means "earthenware pot" while {{#invoke:IPA|main}} means "time" or "season".<ref name="Ref_h">Asher, R. E. Malayalam. Ed. T. C. Kumari 1934-. London; New York : Routledge, 1997.</ref>

AnusvaramEdit

Anusvaram
aṁ lang}} aṁ
{{#invoke:IPA|main}}
lang}} m̐
{{#invoke:IPA|main}}
lang}} paṁ
{{#invoke:IPA|main}}

An anusvaram ({{#invoke:Lang|lang}} anusvāram), or an anusvara, originally denoted the nasalization where the preceding vowel was changed into a nasalised vowel, and hence is traditionally treated as a kind of vowel sign. In Malayalam, however, it simply represents a consonant {{#invoke:IPA|main}} after a vowel, though this {{#invoke:IPA|main}} may be assimilated to another nasal consonant. It is a special consonant letter, different from a "normal" consonant letter, in that it is never followed by an inherent vowel or another vowel. In general, an anusvara at the end of a word in an Indian language is transliterated as in ISO 15919, but a Malayalam anusvara at the end of a word is transliterated as m without a dot.

VisargamEdit
Visargam ഃ
aḥ lang}} aḥ
{{#invoke:IPA|main}}
lang}}
{{#invoke:IPA|main}}
lang}} paḥ
{{#invoke:IPA|main}}

A visargam ({{#invoke:Lang|lang}}, visargam), or visarga, represents a consonant {{#invoke:IPA|main}} after a vowel, and is transliterated as . Like the anusvara, it is a special symbol, and is never followed by an inherent vowel or another vowel.

ConsonantsEdit

Basic consonant lettersEdit

The following tables show the basic consonant letters of the Malayalam script, with romanizations in ISO 15919, transcriptions in IPA, and Unicode Template:Mono. The character names used in the report of the Government of Kerala committee (2001) are shown in lowercase italics when different from Unicode character names.<ref name="K"/> Those alternative names are based on the traditional romanization used by the Malayali people. For example, tha in "Thiruvananthapuram" is neither ISO tha nor Unicode Template:Mono, but tha in this sense ({{#invoke:Lang|lang}}). The ISCII (IS 13194:1991) character names are given in parentheses when different from the above.

Varga consonants
  Voiceless Voiced
Unaspirated Aspirated Unaspirated Aspirated Nasal
Velar Template:Letter Template:Letter Template:Letter Template:Letter Template:Letter
Postalveolar
or
Alveolo-palatal
Template:Letter Template:Letter Template:Letter Template:Letter Template:Letter
Retroflex Template:Letter Template:Letter Template:Letter Template:Letter Template:Letter
Dental Template:Letter Template:Letter Template:Letter Template:Letter Template:Letter
Labial Template:Letter Template:Letter Template:Letter Template:Letter Template:Letter
Other consonants
Template:Letter Template:Letter Template:Letter Template:Letter
Template:Letter Template:Letter Template:Letter Template:Letter
Template:Letter Template:Letter Template:Letter Template:Letter Template:Letter

Template:Reflist

ChillusEdit

Template:See also A chillu, or a chillaksharam ({{#invoke:Lang|lang}}, cillakṣaram), is a special consonant letter that represents a pure consonant independently, without help of a virama. Unlike a consonant represented by an ordinary consonant letter, this consonant is never followed by an inherent vowel. Anusvara and visarga fit this definition but are not usually included. ISCII and Unicode 5.0 treat a chillu as a glyph variant of a normal ("base") consonant letter.<ref name="TUS50">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> In Unicode 5.1 and later, however, chillu letters are treated as independent characters, encoded atomically.<ref name="Unicode5.1.0"/>

Six independent chillu letters (0D7A..0D7F) had been encoded in Unicode 5.1.,<ref name="Unicode5.1.0"/> three additional chillu letters (0D54..0D56) were encoded with the publication of Unicode 9.0.<ref>Unicode 12.1.0 Derived Age. Published 2019-04-01, Retrieved 2019-09-15.</ref>

Chillu letters
Letter Unicode name Base Remarks Examples
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} lang}} (kūṇ, "mushroom")
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} Chillu of alveolar nasal ṉa. lang}} (avaṉ, "he")
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} Historically stood for Template:Transliteration {{#invoke:Lang|lang}}, not Template:Transliteration {{#invoke:Lang|lang}}. lang}} (avar̠, "they")
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} lang}} (oṭuvil, "finally")
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} lang}} (avaḷ, "she")
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} Not in modern use lang}} (does not occur word finally.)
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} Not in modern use
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} Not in modern use
{{#invoke:Lang|lang}} Template:Mono Template:Transliteration {{#invoke:Lang|lang}} Not in modern use

ChandrakkalaEdit

The virama in Malayalam is called candrakkala (chandrakkala), it has two functions:<ref>Cibu Johny; Shiju Alex; Sunil V S. (2015). L2/14-014R Proposal to encode Malayalam Sign Circular Virama.</ref><ref>Cibu Johny; Shiju Alex; Sunil V S. (2015). L2/14-015R Proposal to encode Malayalam Sign Vertical Bar Virama.</ref>Template:Efn

  • As virama: used to suppress the inherent vowel
  • As samvruthokaram: represent the "half-u" sound /ə̆/
As viramaEdit

Chandrakkala {{#invoke:Lang|lang}} ({{#invoke:Lang|lang}}, candrakkala) is a diacritic attached to a consonant letter to show that the consonant is not followed by an inherent vowel or any other vowel (for example, {{#invoke:Lang|lang}} ka → {{#invoke:Lang|lang}} k). This kind of diacritic is common in Indic scripts, generically called virama in Sanskrit, or halant in Hindi.

Half-uEdit

At the end of a word, the same symbol sometimes represents a very short vowel, known as "half-u", or "samvruthokaram" ({{#invoke:Lang|lang}}, Template:Transliteration), or Template:Transliteration ({{#invoke:Lang|lang}}).<ref name="ChitrajakumarEtGangadharan2005">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> The exact pronunciation of this vowel varies from dialect to dialect, but it is approximately {{#invoke:IPA|main}}<ref name="N3126"/> or {{#invoke:IPA|main}}, and transliterated as ŭ (for example, {{#invoke:Lang|lang}} na → {{#invoke:Lang|lang}} ). Optionally, a vowel sign u is inserted, as in {{#invoke:Lang|lang}} (= {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}}). According to one author, this alternative form is historically more correct, though the simplified form without a vowel sign u is common nowadays.<ref name="S"/> This means that the same spelling {{#invoke:Lang|lang}} may represent either n or depending on the context. Generally, it is at the end of a word, and n elsewhere; {{#invoke:Lang|lang}} always represents .

The virama of Tigalari script behave similarly to Malayalam. Virama has three functions: to suppress the inherent vowel (as the halant of Devanagari); to form conjunct consonants; to represent the half-u.<ref>Murthy, Vaishnavi & Rajan, Vinodh. (2017). L2/17-378 Preliminary proposal to encode Tigalari script in Unicode (pp. 12-15).</ref><ref>Srinidhi, A. & Sridatta, A. (2017). L2/17-182 Comments on encoding the Tigalari script (pp. 9-11).</ref> Devanagari supports half-u for Kashmiri; for example {{#invoke:Lang|lang}} is written as {{#invoke:Lang|lang}}. Template:Notelist

LigaturesEdit

Consonant ligaturesEdit

Like in other Indic scripts, a virama is used in the Malayalam script to cancel—or "kill"—the inherent vowel of a consonant letter and represent a consonant without a vowel, so-called a "dead" consonant. For example,

  1. {{#invoke:Lang|lang}} is a consonant letter na,
  2. {{#invoke:Lang|lang}} is a virama; therefore,
  3. {{#invoke:Lang|lang}} (na + virama) represents a dead consonant n.

If this n {{#invoke:Lang|lang}} is further followed by another consonant letter, for example, ma {{#invoke:Lang|lang}}, the result may look like {{#invoke:Lang|lang}}, which represents nma as na + virama + ma. In this case, two elements n {{#invoke:Lang|lang}} and ma {{#invoke:Lang|lang}} are simply placed one by one, side by side. Alternatively, nma can be also written as a ligature {{#invoke:Lang|lang}}.

Generally, when a dead consonant letter C1 and another consonant letter C2 are conjoined, the result may be either:

  1. A fully conjoined ligature of C1+C2;
  2. Half-conjoined—
    • C1-conjoining: a modified form (half form) of C1 attached to the original form (full form) of C2
    • C2-conjoining: a modified form of C2 attached to the full form of C1; or
  3. Non-ligated: full forms of C1 and C2 with a visible virama.<ref name="Constable2004">{{#invoke:citation/CS1|citation

|CitationClass=web }}</ref>

If the result is fully or half-conjoined, the (conceptual) virama which made C1 dead becomes invisible, only logically existing in a character encoding scheme such as Unicode. If the result is non-ligated, a virama is visible, attached to C1. The glyphs for nma has a visible virama if not ligated ({{#invoke:Lang|lang}}), but if ligated, the virama disappears ({{#invoke:Lang|lang}}). Usually the difference between those forms is superficial and both are semantically identical, just like the meaning of the English word palaeography does not change even if it is spelled palæography, with the ligature æ.

Common consonant ligaturesEdit

Several consonant-consonant ligatures are used commonly even in the new orthography.

lang}}
  kka ṅka ṅṅa cca ñca ñña ṭṭa ṇṭa ṇṇa tta nta nna ppa mpa mma
lang}} ക്‌ക ങ്‌ക ങ്‌ങ ച്‌ച ഞ്‌ച ഞ്‌ഞ ട്‌ട ണ്‌ട ണ്‌ണ ത്‌ത ന്‌ത ന്‌ന പ്‌പ മ്‌പ മ്‌മ
lang}} ക്ക ങ്ക ങ്ങ ച്ച ഞ്ച ഞ്ഞ ട്ട ണ്ട ണ്ണ ത്ത ന്ത ന്ന പ്പ മ്പ മ്മ

The ligature mpa {{#invoke:Lang|lang}} was historically derived from npa {{#invoke:Lang|lang}}. The ligatures cca, bba, yya, and vva are special in that a doubled consonant is denoted by a triangle sign below a consonant letter.

  cca bba yya vva
lang}} ച്‌ച ബ്‌ബ യ്‌യ വ്‌വ
lang}} ച്ച ബ്ബ യ്യ വ്വ
Consonant + ya, va, la, raEdit
  1. The consonant letter ya is generally C2-conjoining after a consonant in both orthographies. For example,
  • k {{#invoke:Lang|lang}} + ya {{#invoke:Lang|lang}} = kya {{#invoke:Lang|lang}}
  • p {{#invoke:Lang|lang}} + ya {{#invoke:Lang|lang}} = pya {{#invoke:Lang|lang}}

In kya {{#invoke:Lang|lang}}, a variant form of ya ({{#invoke:Lang|lang}}) is placed after the full form of ka {{#invoke:Lang|lang}}, just like ki {{#invoke:Lang|lang}} is written ka {{#invoke:Lang|lang}} followed by the vowel sign i {{#invoke:Lang|lang}}. In other words, the variant form of ya ({{#invoke:Lang|lang}}) used after a consonant letter can be considered as a diacritic. Since it is placed after the base character, it is sometimes referred to as a post-base form. An exception is yya {{#invoke:Lang|lang}} (see above).

  1. Similarly, va ({{#invoke:Lang|lang}}) after a consonant takes a post-base form:
  • k {{#invoke:Lang|lang}} + va {{#invoke:Lang|lang}} = kva {{#invoke:Lang|lang}}
  • p {{#invoke:Lang|lang}} + va {{#invoke:Lang|lang}} = pva {{#invoke:Lang|lang}}

An exception is vva {{#invoke:Lang|lang}} (see above).

  1. The consonant letter la ({{#invoke:Lang|lang}}) after a consonant traditionally takes a below-base form. These forms are used also in the new orthography, though some fonts do not support them.
  • k {{#invoke:Lang|lang}} + la {{#invoke:Lang|lang}} = kla {{#invoke:Lang|lang}}
  • p {{#invoke:Lang|lang}} + la {{#invoke:Lang|lang}} = pla {{#invoke:Lang|lang}}
  • l {{#invoke:Lang|lang}} + la {{#invoke:Lang|lang}} = lla {{#invoke:Lang|lang}}
  1. A consonant letter ra (്ര) after a consonant usually takes a pre-base form in the reformed orthography, while this combination makes a fully conjoined ligature in the traditional orthography.
  • k {{#invoke:Lang|lang}} + ra {{#invoke:Lang|lang}} = kra {{#invoke:Lang|lang}}
  • p {{#invoke:Lang|lang}} + ra {{#invoke:Lang|lang}} = pra {{#invoke:Lang|lang}}
nṯa and ṯṯaEdit

The ligature nṯa is written as n {{#invoke:Lang|lang}} + ṟa {{#invoke:Lang|lang}} and pronounced {{#invoke:IPA|main}}. The ligature ṯṯa is written as {{#invoke:Lang|lang}} + ṟa {{#invoke:Lang|lang}}.

  nṯa ṯṯa
lang}} ന്‌റ റ്‌റ
lang}} ന്റ റ്റ
lang}} ൻറ ററ

In those two ligatures, a small ṟa {{#invoke:Lang|lang}} is written below the first letter (chillu-n if it is a dead n). Alternatively, the letter ṟa is sometimes written to the right of the first letter, making a digraph (just like {{#invoke:Lang|lang}} used instead of {{#invoke:Lang|lang}} in Greek). The spelling {{#invoke:Lang|lang}} is therefore read either nṟa (two separate letters) or nṯa (digraph) depending on the word like in {{#invoke:Lang|lang}} (en̠r̠ōḷ) 'enroll' or {{#invoke:Lang|lang}} (hen̠r̠i) 'Henry' but {{#invoke:Lang|lang}} is always read nṯa. Similarly, {{#invoke:Lang|lang}} is read either ṟaṟa or ṯṯa.<ref name="Unicode5.1.0"/>

Dot rephEdit

In the traditional orthography, a dead consonant r before a consonant sometimes takes an above-base form, known as a dot reph, which looks like a short vertical line or a dot. Generally, a chillu-r is used instead of a dot reph in the reformed orthography.

  • r {{#invoke:Lang|lang}} + ga {{#invoke:Lang|lang}} = rga {{#invoke:Lang|lang}} (Reformed: {{#invoke:Lang|lang}})
  • r {{#invoke:Lang|lang}} + ja {{#invoke:Lang|lang}} = rja {{#invoke:Lang|lang}} (Reformed: {{#invoke:Lang|lang}})
Consonant-vowel ligaturesEdit

In the pre-1971 orthography, consonant + the vowels u, ū, r̥ were written as ligatures, post-1971 they are written with symbols after the letter. They can be still seen in old signs and used by people who learned to write before 1971.

r̥̄ l̥ l̥̄ (which are not part of modern orthography) were also written as ligatures but there were not any words with l̥̄ even in Sanskrit; r̥̄ was only used grammatically instead of r̥ in Sanskrit so it was not used either; there is only one root with l̥ in Sanskrit {{#invoke:Lang|lang}} which was loaned into Malayalam as {{#invoke:Lang|lang}}.

Consonant-consonant ligaturesEdit

Although there are consonant-consonant ligatures used even now like {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}} almost all clusters were written as ligatures before 1971, most of the time the second consonant was written to the bottom right of the first consonant, in consonant + r clusters the {{#invoke:Lang|lang}} was attached to the main consonant, now its detached and placed to the left.

Archaic signsEdit

{{#invoke:Lang|lang}}Edit

{{#invoke:Lang|lang}} was made by A. R. Raja Raja Varma and it was not used as a single letter, in his orthography {{#invoke:Lang|lang}} (ṯṯ) was written as {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}} (ṉḏ) as {{#invoke:Lang|lang}}.

Archaic viramasEdit

Before chandrakkala was made, there were two other viramas used simultaneously, the vertical bar virama {{#invoke:Lang|lang}} and circular virama {{#invoke:Lang|lang}}. The vertical bar virama was used exclusively for loanwords and circular virama just for native words. Before the vertical bar virama used to cut through the main consonant and it led to the creation of the chillu letters. It was sometimes confused with the dot reph {{#invoke:Lang|lang}} since they look similar but both of them are used for different purposes (see above for dot reph).<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

ChandrabinduEdit

{{#invoke:Lang|lang}} was like the chandrabindu from other scripts and was used to nasalise the vowel; it was only used for writing Sanskrit and Prakrits. It is archaic.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Other AnusvarasEdit

{{#invoke:Lang|lang}} was used like the Devanagari {{#invoke:Lang|lang}} and contrasts with the normal anusvara.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> {{#invoke:Lang|lang}} was not really used in Malayalam but was used in the Grantha script, in it the normal anusvara {{#invoke:Lang|lang}} represents gemination of the next consonant and this anusvara represents an actual linguistic anusvara. Both are archaic.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Numeral systemEdit

{{#invoke:Labelled list hatnote|labelledList|Main article|Main articles|Main page|Main pages}} Malayalam numeral system is archaic and no longer commonly used. Instead, the common Hindu-Arabic numeral system is followed.

0 1 2 3 4 5 6 7 8 9 10 100 1000 Template:1/4 Template:1/2 Template:3/4

Number "11" is written as "{{#invoke:Lang|lang}}" and not "{{#invoke:Lang|lang}}". "32" is written as "{{#invoke:Lang|lang}}" similar to the Tamil numeral system.

11 20 21 30 110 10,099
൰൧ ൨൰ ൨൰൧ ൩൰ ൱൰ ൰൲൯൰൯

Suppose the number is "2013". It is read in Malayalam as "{{#invoke:Lang|lang}}" (raṇḍāyiratti padimūnnu). It is split into :

  • {{#invoke:Lang|lang}} (raṇḍŭ) : 2 - {{#invoke:Lang|lang}}
  • {{#invoke:Lang|lang}} (āyiram) : 1000 - {{#invoke:Lang|lang}}
  • {{#invoke:Lang|lang}} (pattŭ) : 10 - {{#invoke:Lang|lang}}
  • {{#invoke:Lang|lang}} (mūnnŭ) : 3 - {{#invoke:Lang|lang}}

Combine them together to get the Malayalam number "{{#invoke:Lang|lang}}".<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Other symbolsEdit

Praslesham lang}} Corresponds to Devanagari avagraha, used when a Sanskrit phrase containing an avagraha is written in Malayalam script. The symbol indicates the elision of the word-initial vowel a after a word that ends in ā, ē, or ō, and is transliterated as an apostrophe (’), or sometimes as a colon + an apostrophe (:’).
(Template:Indic)
Malayalam date mark lang}} Used in an abbreviation of a date.
Danda lang}} Archaic punctuation marks used as full stops or for delimiting verses.
Double danda lang}}

Sample textEdit

The following text is Article 1 of the Universal Declaration of Human Rights.

EnglishEdit

All human beings are born free and equal in dignity and rights. They are endowed with reason and conscience and should act towards one another in a spirit of brotherhood.

MalayalamEdit

{{#invoke:Lang|lang}}.

Romanisation (ISO 15919)Edit

man̠uṣyarellāvaruṁ tulyāvakāśaṅṅaḷōṭuṁ antassōṭuṁ svātantryattōṭuṅkūṭi jan̠icciṭṭuḷḷavarāṇ‌ŭ. an̠yōn̠yaṁ bhrātr̥bhāvattōṭe perumāṟuvān̠āṇ‌ŭ man̠uṣyan̠ŭ vivēkabuddhiyuṁ man̠asākṣiyuṁ siddhamāyirikkunnat‌ŭ.

IPAEdit

/manuʂjaɾelːaːʋaɾum t̪uljaːʋakaːʃaŋːaɭoːʈum an̪t̪asːoːʈum sʋaːt̪an̪tɾjat̪ːoːʈuŋkuːʈi d͡ʒanit͡ʃːiʈːuɭːaʋaɾaːɳɨ̆ ǁ anjoːnjam bʱraːt̪rɨ̆bʱaːʋat̪ːoːʈe peɾumaːruʋaːnaːɳɨ̆ manuʂjanɨ̆ ʋiʋeːkabud̪ːʱijum manasaːkʂijum sid̪ːʱamaːjiɾikːun̪ːat̪ɨ̆ ǁ/

UnicodeEdit

Malayalam script was added to the Unicode Standard in October, 1991 with the release of version 1.0.

BlockEdit

{{#invoke:Labelled list hatnote|labelledList|Main article|Main articles|Main page|Main pages}} The Unicode block for Malayalam is U+0D00–U+0D7F:

Template:Unicode chart Malayalam

Chillus in UnicodeEdit

For example, avan {{#invoke:Lang|lang}} ("he") is written as a {{#invoke:Lang|lang}} + va {{#invoke:Lang|lang}} + chillu-n {{#invoke:Lang|lang}}, where chillu-n represents the n sound without a vowel. In other Indic scripts, the same word would be possibly written as a + va + na + virama. However, in Malayalam script, that sequence represents a different word, avanŭ {{#invoke:Lang|lang}} ("to him"), and is not interchangeable with avan.<ref name="Johny2005">{{#invoke:citation/CS1|citation |CitationClass=web }} See also L2/05-085 (PDF).</ref> This is because in modern Malayalam script, the sign for a virama also works as the sign for a vowel ŭ at the end of a word, and is not able to cleanly "kill" the inherent vowel in this case.<ref name="N3126">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

To differentiate a pure consonant (chillu) and a consonant with ŭ, zero-width joiner (ZWJ) and zero-width non-joiner (ZWNJ) were used before Unicode 5.1.<ref name="TUS50"/> However, this system was problematic. Among other things, glyph variants specified by ZWJ or ZWNJ are supposed to be non-semantic, whereas a chillu (expressed as letter + virama + ZWJ) and the same consonant followed by a ŭ (expressed as letter + virama + ZWNJ) are often semantically different. After a long debate,<ref name="N3126"/><ref name="S">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Nine chillu letters now have their own code points since Unicode 9.0 (though only 5 of them are used in modern Malayalam), though applications should also be prepared to handle data in the representation specified in Unicode 5.0.<ref name="Unicode5.1.0">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> This means, fonts should display chillus in both sequences; while an input method should output standard chillus.

The ligature nṯa {{#invoke:Lang|lang}} is very common and supported by most Malayalam fonts in one way or another, but exactly how it should be encoded was not clear in Unicode 5.0 and earlier, and two incompatible implementations are currently in use.<ref name="Ref_2005">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> In Unicode 5.1 (2008), the sequence to represent it was explicitly redefined as chillu-n + virama + ṟa ({{#invoke:Lang|lang}}).<ref name="Unicode5.1.0"/> {{#invoke:Lang|lang}} ligature is often considered to be the correct form to represent n̠d̠ as {{#invoke:Lang|lang}} can also represent n̠r̠ but in many computers it is only shown with {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}} even though a chandrakkala cannot be after a chillu letter, other computers show it with {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}}. Some computers display {{#invoke:Lang|lang}} ({{#invoke:Lang|lang}} + {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}}) and {{#invoke:Lang|lang}} ({{#invoke:Lang|lang}} + {{#invoke:Lang|lang}} + {{#invoke:Lang|lang}}) differently.

Image galleryEdit

{{#invoke:Gallery|gallery}}

See alsoEdit

Template:Div col

Template:Div col end

ReferencesEdit

Template:Reflist

SourcesEdit

Further readingEdit

External linksEdit

Template:Sister project

Template:List of writing systems Template:Malayalam Literature Template:Portal bar