Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Devanagari transliteration
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==Alternative transliteration methods== ===Schemes with diacritics=== ====National Library at Kolkata romanisation==== {{main|National Library at Kolkata romanisation}} The [[National Library at Kolkata romanisation]], intended for the romanisation of all [[Brahmic family|Indic scripts]], is an extension of [[IAST]]. It differs from IAST in the use of the symbols ē and ō for {{lang|hi|ए}} and {{lang|hi|ओ}} (e and o are used for the short vowels present in many Indian languages), the use of 'ḷ' for the consonant (in [[Kannada]]) {{lang|kn|ಳ}}, and the absence of symbols for {{lang|hi|ॠ}}, {{lang|hi|ऌ}} and {{lang|hi|ॡ}}. ====ISO 15919==== {{main|ISO 15919}} A standard [[transliteration]] convention not just for Devanagari,<ref>[http://transliteration.eki.ee/pdf/Hindi-Marathi-Nepali.pdf Devanagari comparison: ISO, ALA-LC, Hunterian] in 2005 by Thomas T. Pedersen at Eesti Keele Instituut [http://transliteration.eki.ee/ Transliteration of Non-Roman Scripts]</ref> but for all South-Asian languages was codified in the ISO 15919 standard of 2001, providing the basis for modern digital libraries that conform to International Organization for Standardization (ISO) norms. ISO 15919 defines the common Unicode basis for Roman transliteration of South-Asian texts in a wide variety of languages/scripts. ISO 15919 transliterations are platform-independent texts so that they can be used identically on all modern operating systems and software packages, as long as they comply with ISO norms. This is a prerequisite for all modern platforms so that ISO 15919 has become the new standard for digital libraries and archives for transliterating all South Asian texts.{{Original research inline|date=February 2011}} ISO 15919<ref>[http://homepage.ntlworld.com/stone-catend/trind.htm Transliteration of Indic scripts: how to use ISO 15919] {{Webarchive|url=https://web.archive.org/web/20160414223033/http://homepage.ntlworld.com/stone-catend/trind.htm |date=14 April 2016 }} ntlworld.com.</ref> uses [[diacritic]]s to map the much larger set of [[Brāhmī script|Brahmi]]c graphemes to the Latin script. The Devanagari-specific portion is nearly identical to the academic standard, [[IAST]]: "International Alphabet of Sanskrit Transliteration", and to [[ALA-LC]], the United States Library of Congress standard.<ref>[https://www.loc.gov/catdir/cpso/romanization/hindi.pdf Hindi romanization LC] as pdf by Library of Congress standard, loc.gov.</ref> Another standard, [[United Nations romanization systems for geographical names|United Nations Romanization Systems for Geographical Names]] (UNRSGN), was developed by the United Nations Group of Experts<ref name="Geoname">{{Citation | title=Technical reference manual for the standardization of geographical names | author=United Nations Group of Experts on Geographical Names, United Nations Department of Economic and Social Affairs | year=2007 | publisher=United Nations Publications, 2007 | isbn=978-92-1-161500-5 | url=https://books.google.com/books?id=mh8u32ANQxAC | quote=... ISO 15919 ... There is no evidence of the use of the system either in India or in international cartographic products ... The Hunterian system is the actually used national system of romanization in India ...}}</ref> on Geographical Names (UNGEGN)<ref>{{Cite web|url=http://www.eki.ee/wgrs/|title=UNGEGN Working Group on Romanization Systems|website= United Nations Group of Experts on Geographical Names UNGEGN, www.eki.ee .Eesti Keele Instituut. |access-date=2017-02-14}}</ref> and covers many Brahmic scripts. There are some differences<ref name="diff">{{Cite web|url=http://www.eki.ee/wgrs/rom1_hi.htm|title=Differences between ISO 15919 and UNRSGN|date=March 2016|website=Working group on Romanization systems. United Nations Group of Experts on Geographical Names UNGEGN. www.eki.ee/wgrs/ .Eesti Keele Instituut |access-date=13 February 2017}}</ref> between ISO 15919 and UNRSGN. ===ASCII schemes=== ====Harvard-Kyoto==== {{main|Harvard-Kyoto}} Compared to [[IAST]], [[Harvard-Kyoto]] looks much simpler. It does not contain any of the [[diacritic]] marks that IAST contains. Instead of diacritics, Harvard-Kyoto uses [[capital letters]]. The use of capital letters makes typing in Harvard-Kyoto much easier than in IAST but produces words with capital letters inside them. ==== ITRANS scheme ==== {{main|ITRANS}} {{citation needed span|[[ITRANS]] is an extension of [[Harvard-Kyoto]]. The ITRANS transliteration scheme was developed for the [[ITRANS]] software package, a pre-processor for [[Indic scripts]]. The user inputs in Roman letters and the ITRANS preprocessor converts the Roman letters into Devanāgarī (or other Indic scripts). The latest version of [[ITRANS]] is version 5.30 released in July 2001.|date=September 2022}} ==== Velthuis ==== {{main|Velthuis}} The disadvantage of the above [[ASCII]] schemes is case-sensitivity, implying that transliterated names may not be capitalised. This difficulty is avoided with the system developed in 1996 by Frans Velthuis for [[TeX]], loosely based on IAST, in which case is irrelevant. ==== WX ==== {{main|WX_notation}} WX notation is a transliteration scheme for representing Indian languages in ASCII. This scheme originated at IIT Kanpur for computational processing of Indian languages, and is widely used among the natural language processing (NLP) community in India. The notation (though unidentified) is used, for example, in a textbook on NLP from IIT Kanpur.[1] The salient features of this transliteration scheme are: Every consonant and every vowel has a single mapping into Roman. Hence it is a prefix code,[2] advantageous from a computation point of view. Typically the small case letters are used for un-aspirated consonants and short vowels while the capital case letters are used for aspirated consonants and long vowels. While the retroflexed voiceless and voiced consonants are mapped to 't, T, d and D', the dentals are mapped to 'w, W, x and X'. Hence the name of the scheme "WX", referring to the idiosyncratic mapping. Ubuntu Linux provides a keyboard support for WX notation. ==== SLP1 ==== {{main|SLP1}} SLP1 (Sanskrit Library Phonetic) is a case-sensitive scheme initially used by [http://sanskritlibrary.org Sanskrit Library]<ref>{{cite web | url = http://sanskritlibrary.org/tomcat/sl/ScriptTable | title = The Sanskrit Library » Tools » Indic Script Comparison Table | url-status = dead | archive-url = https://web.archive.org/web/20140808070533/http://sanskritlibrary.org/tomcat/sl/ScriptTable | archive-date = 2014-08-08}}</ref> which was developed by Peter Scharf and (the late) Malcolm Hyman, who first described it in appendix B of their book Linguistic Issues in Encoding Sanskrit.<ref>{{cite book | last1 = Scharf | first1 = Peter M. | last2 = Hyman | first2 = Malcolm D. | title = Linguistic Issues in Encoding Sanskrit | date = 2011 | url = http://www.sanskritlibrary.org/Sanskrit/pub/lies_sl.pdf}}</ref> The advantage of SLP1 over other encodings is that a single ASCII character is used for each Devanagari letter, a peculiarity that eases reverse transliteration.<ref>[http://www.sanskrit-lexicon.uni-koeln.de/monier/help.html Morphology Help<!-- Bot generated title -->]</ref> ==== Hinglish ==== {{main|Hinglish}} Hinglish refers to the non-standardised Romanised Hindi used online, and especially on social media. In India, Romanised Hindi is the dominant form of expression online. In an analysis of [[YouTube]] comments, Palakodety et al., identified that 52% of comments were in Romanised Hindi, 46% in English, and 1% in [[Devanagari]] Hindi.<ref name="Palakodety">{{Citation |last1=Palakodety |first1=Shriphani |title=Low Resource Machine Translation |date=2021 |url=http://dx.doi.org/10.1007/978-981-16-5625-5_5 |work=Low Resource Social Media Text Mining |pages=7–9 |place=Singapore |publisher=Springer Singapore |isbn=978-981-16-5624-8 |access-date=2022-09-24 |last2=KhudaBukhsh |first2=Ashiqur R. |last3=Jayachandran |first3=Guha|doi=10.1007/978-981-16-5625-5_5 |s2cid=244313560 |url-access=subscription }}</ref> ==== Others ==== Other less popular ASCII schemes include [[WX_notation]], Vedatype and the 7-bit ISO 15919. WX notation is a transliteration scheme for representing Indian languages in ASCII. It originated at IIT Kanpur for computational processing of Indian languages and is widely used among the natural language processing (NLP) community in India. This scheme is described in [http://ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf NLP Panini] {{Webarchive|url=https://web.archive.org/web/20131126005517/http://ltrc.iiit.ac.in/downloads/nlpbook/nlp-panini.pdf |date=26 November 2013 }} (Appendix B). It is similar to, but not as versatile as, SLP1, as far as the coverage of Vedic Sanskrit is concerned. Comparison of WX with other schemes is found in [https://web.archive.org/web/20141222030835/http://yquem.inria.fr/~huet/PUBLIC/Brown.pdf Huet (2009), App A.]. Vedatype is another scheme used for encoding Vedic texts at [[Maharishi University of Management]]. An online transcoding utility across all these schemes is provided at the [https://web.archive.org/web/20121112060730/http://sanskritlibrary.org/tomcat/sl/TranscodeText Sanskrit Library]. [[ISO 15919]] includes a so-called "limited character set" option to replace the diacritics by prefixes, so that it is ASCII-compatible. A pictorial explanation is [https://web.archive.org/web/20140222204023/http://homepage.ntlworld.com/stone-catend/tri7-cnv.gif here] from [https://web.archive.org/web/20120819120416/http://homepage.ntlworld.com/stone-catend/trioprc.htm Anthony Stone].
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)