Persian alphabet

Revision as of 10:52, 24 May 2025 by imported>Peoplic (→‎Diacritics)
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Template:Short description Template:For Template:Multiple issues Template:Infobox writing system Template:Contains special characters Template:Arabic-script sidebar Template:Writing systems in India

The Persian alphabet (Template:Langx), also known as the Perso-Arabic script, is the right-to-left alphabet used for the Persian language. It is a variation of the Arabic script with four additional letters: {{#invoke:Lang|lang}} (the sounds 'g', 'zh', 'ch', and 'p', respectively), in addition to the obsolete {{#invoke:Lang|lang}} that was used for the sound Template:IPAslink. This letter is no longer used in Persian, as the Template:IPAblink-sound changed to Template:IPAblink, e.g. archaic {{#invoke:Lang|lang}} {{#invoke:IPA|main}} > {{#invoke:Lang|lang}} {{#invoke:IPA|main}} 'language'.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref name=":1">Template:Cite journal</ref>

It was the basis of many Arabic-based scripts used in Central and South Asia. It is used for both Iranian and Dari: standard varieties of Persian; and is one of two official writing systems for the Persian language, alongside the Cyrillic-based Tajik alphabet.

The script is mostly but not exclusively right-to-left; mathematical expressions, numeric dates and numbers bearing units are embedded from left to right. The script is cursive, meaning most letters in a word connect to each other; when they are typed, contemporary word processors automatically join adjacent letter forms. Persian is unusual among Arabic scripts because a zero-width non-joiner is sometimes entered in a word, making a letter becoming not connected by others.

HistoryEdit

The Persian alphabet is directly derived and developed from the Arabic alphabet. The Arabic alphabet was introduced to the Persian-speaking world after the Muslim conquest of Persia and the fall of the Sasanian Empire in the 7th century. Following this, the Arabic language became the principal language of government and religious institutions in Persia, which led to the widespread usage of the Arabic script. Classical Persian literature and poetry were affected by this simultaneous usage of Arabic and Persian. A new influx of Arabic vocabulary soon entered the Persian language.<ref name=":0" /> In the 8th century, the Tahirid dynasty and Samanid dynasty officially adopted the Arabic script for writing Persian, followed by the Saffarid dynasty in the 9th century, gradually displacing the various Pahlavi scripts used for the Persian language prior. By the 9th-century, the Perso-Arabic alphabet became the dominant form of writing in Greater Khorasan.<ref name=":0">Template:Cite book</ref><ref name="Lapidus2002">Template:Cite book</ref><ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Under the influence of various Persian Empires, many languages in Central and South Asia that adopted the Arabic script use the Persian Alphabet as the basis of their writing systems. Today, extended versions of the Persian alphabet are used to write a wide variety of Indo-Iranian languages, including Kurdish, Balochi, Pashto, Urdu (from Classical Hindustani), Saraiki, Panjabi, Sindhi and Kashmiri. In the past the use of the Persian alphabet was common amongst Turkic languages, but today is relegated to those spoken within Iran, such as Azerbaijani, Turkmen, Qashqai, Chaharmahali and Khalaj. The Uyghur language in western China is the most notable exception to this.

During the Soviet period many languages in Central Asia, including Persian, were reformed by the government. This ultimately resulted in the Cyrillic-based alphabet used in Tajikistan today. See: Template:Section link.

LettersEdit

File:Persian Nastaʿlīq's proportions.jpg
Example showing the Nastaʿlīq calligraphic style's proportion rulesTemplate:Citation needed

Below are the 32 letters of the modern Persian alphabet. Since the script is cursive, the appearance of a letter changes depending on its position: isolated, initial (joined on the left), medial (joined on both sides) and final (joined on the right) of a word.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> These include 28 letters of the Arabic alphabet, in addition to 4 other letters.

The names of the letters are mostly the ones used in Arabic except for the Persian pronunciation. The only ambiguous name is Template:Transliteration, which is used for both {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}}. For clarification, they are often called Template:Transliteration (literally "Template:Transliteration-like Template:Transliteration" after Template:Transliteration, the name for the letter {{#invoke:Lang|lang}} that uses the same base form) and Template:Transliteration (literally "two-eyed Template:Transliteration", after the contextual middle letterform {{#invoke:Lang|lang}}), respectively. There are eight Persian letters that are mainly used in Arabic or foreign loanwords and not in native words: {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}}. These eight letters are also common used in proper names only. Unlike Arabic, the Persian language absolutely does not have pharyngealization at all. Although the letter {{#invoke:Lang|lang}} is mainly used in Arabic loanwords, there are some native Persian words with this letter: {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, etc. The pronunciation of these letters in Persian can differ from their pronunciation in Arabic. For example, the letter ث is pronounced as /s/ in Persian, while it is pronounced as /θ/ in Arabic.

Letter Persian Arabic
ث /s/ /θ/
ح /h/ /ħ/
ذ /z/ /ð/
ص /s/ /sˤ/
ض /z/ /dˤ/
ط /t/ /tˤ/
ظ /z/ /ðˤ/
ع /ʔ/ /ʕ/
غ Template:IPAblink or Template:IPAblink /ɣ/

Overview tableEdit

# Name
(in Persian)
Name
(transliterated)
Transliteration IPA Unicode Contextual forms
Final Medial Initial Isolated
0 {{#invoke:Lang|lang}} Template:Transliteration<ref>{{#invoke:citation/CS1|citation CitationClass=web

}}</ref>

Template:Transliteration Glottal stop Template:IPAblink U+0621 Template:N/A Template:N/A Template:N/A {{#invoke:Lang|lang}}
U+0623 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
U+0626 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
U+0624 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
1 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0627 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
2 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0628 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
3 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+067E {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
4 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+062A {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
5 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+062B {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
6 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+062C {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
7 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0686 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
8 lang}} Template:Transliteration (Template:Transliteration) Template:Transliteration Template:IPAblink U+062D {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
9 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+062E {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
10 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+062F {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
11 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0630 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
12 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0631 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
13 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0632 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
14 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0698 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
15 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0633 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
16 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0634 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
17 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0635 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
18 lang}} Template:Transliteration z Template:IPAblink U+0636 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
19 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0637 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
20 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0638 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
21 lang}} Template:Transliteration Template:Transliteration Template:IPAblink, Template:IPAblink/Template:IPAblink U+0639 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
22 lang}} Template:Transliteration Template:Transliteration Template:IPAblink, Template:IPAblink U+063A {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
23 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0641 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
24 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0642 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
25 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+06A9 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
26 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+06AF {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
27 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0644 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
28 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0645 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
29 lang}} Template:Transliteration Template:Transliteration Template:IPAblink U+0646 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
30 {{#invoke:Lang|lang}} Template:Transliteration (in Farsi) Template:Transliteration / Template:Transliteration / Template:Transliteration / Template:Transliteration Template:IPAblink, {{#invoke:IPA|main}}, Template:IPAblink, Template:IPAblink (only word-finally) U+0648 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Template:Transliteration (in Dari) Template:Transliteration / Template:Transliteration / Template:Transliteration / Template:Transliteration Template:IPAblink, Template:IPAblink, {{#invoke:IPA|main}}, Template:IPAblink
31 lang}} Template:Transliteration (Template:Transliteration) Template:Transliteration Template:IPAblink, or Template:IPAblink and Template:IPAblink (word-finally) U+0647 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
32 lang}} Template:Transliteration Template:Transliteration / Template:Transliteration / Template:Transliteration / (Also Template:Transliteration / Template:Transliteration in Dari) Template:IPAblink, Template:IPAblink, Template:IPAblink ({{#invoke:IPA|main}} / Template:IPAblink in Dari) U+06CC {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}

Historically, in Early New Persian, there was a special letter for the sound Template:IPAslink. This letter is no longer used, as the {{#invoke:IPA|main}}-sound changed to Template:IPAslink, e.g. archaic {{#invoke:Lang|lang}} /zaβān/ > {{#invoke:Lang|lang}} {{#invoke:IPA|main}} 'language'.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Sound Isolated form Final form Medial form Initial form Name
Template:IPAslink {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} βe

Another obsolete variant of the twenty-sixth letter {{#invoke:Lang|lang}} Template:IPAslink is {{#invoke:Lang|lang}} which used to appear in old manuscripts.<ref name=":1" />

Sound Isolated form Final form Medial form Initial form Name
Template:IPAslink {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} Template:Transliteration

Another obsolete variant of the twenty-fifth letter {{#invoke:Lang|lang}} Template:IPAslink is {{#invoke:Lang|lang}} which used to appear in old manuscripts.

Sound Isolated form Final form Medial form Initial form Name
Template:IPAslink {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} Template:Transliteration

The archaic letter {{#invoke:Lang|lang}} Template:IPAslink was also used as a substitute for the twenty-sixth letter of the Persian alphabet, {{#invoke:Lang|lang}}, which was used to appear in the older manuscripts of Persian in the late 18th century to the early 19th century.

Sound Isolated form Final form Medial form Initial form Name
Template:IPAslink {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} Template:Transliteration

VariantsEdit

{{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Noto Nastaliq Urdu
Scheherazade
Lateef
Noto Naskh Arabic
Markazi Text
Noto Sans Arabic
Baloo Bhaijaan
El Messiri SemiBold
Lemonada Medium
Changa Medium
Mada
Noto Kufi Arabic
Reem Kufi
Lalezar
Jomhuria
Rakkas
The alphabet in 16 fonts: Noto Nastaliq Urdu, Scheherazade, Lateef, Noto Naskh Arabic, Markazi Text, Noto Sans Arabic, Baloo Bhaijaan, El Messiri SemiBold, Lemonada Medium, Changa Medium, Mada, Noto Kufi Arabic, Reem Kufi, Lalezar, Jomhuria, and Rakkas.

Letter constructionEdit

forms {{#if:(i)|(i)|[1]}} isolated {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
start {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
mid {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
end {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
i'jam {{#if:(i)|(i)|[2]}}
Unicode 0621 .. 0627 .. 0649 .. 06BA .. 066E .. 062D .. 0633 .. 0635 .. 0637 .. 0639 .. 06A1 .. 066F .. 066F .. 0644 .. 0645 .. 062F .. 0631 .. 0648. .. 0647 ..
1 dot below {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode FBB3. 0628 .. 062C ..
1 dot above {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode FBB2. 0646 .. 062E .. 0636 .. 0638 .. 063A .. 0641 .. 0630 .. 0632 ..
2 dots below {{#if:(ii)|(ii)|[3]}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode FBB5. 06CC ..
2 dots above {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode FBB4. 062A .. 0642 .. 0629 ..
3 dots below {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode FBB9. FBB7. 067E .. 0686 ..
3 dots above {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode FBB6. 062B .. 0634 .. 0698 ..
line above {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode 203E. 06AF ..
none {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode 0621 .. 0627 .. 0649 .. 06BA .. 062D .. 0633 .. 0635 .. 0637 .. 0639 .. 066F .. 0644 .. 0645 .. 062F .. 0631 .. 0648. .. 0647 ..
madda above {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode 06E4. 0653. 0622 ..
Hamza below {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode 0655. 0625 ..
Hamza above {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Unicode 0674. 0654. 0623 .. 0626 .. 0624 .. 06C0 ..

<templatestyles src="Citation/styles.css"/>^i.{{#if:| }} The i'jam diacritic characters are illustrative only; in most typesetting the combined characters in the middle of the table are used.

<templatestyles src="Citation/styles.css"/>^ii.{{#if:| }} Persian has 2 dots below in the initial and middle positions only. The standard Arabic version {{#invoke:Lang|lang}} always has 2 dots below.

Letters that do not link to a following letterEdit

Seven letters ({{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}) do not connect to the following letter, unlike the rest of the letters of the alphabet. The seven letters have the same form in isolated and initial position and a second form in medial and final position. For example, when the letter {{#invoke:Lang|lang}} Template:Transliteration is at the beginning of a word such as {{#invoke:Lang|lang}} Template:Transliteration ("here"), the same form is used as in an isolated Template:Transliteration. In the case of {{#invoke:Lang|lang}} Template:Transliteration ("today"), the letter {{#invoke:Lang|lang}} Template:Transliteration takes the final form and the letter {{#invoke:Lang|lang}} Template:Transliteration takes the isolated form, but they are in the middle of the word, and {{#invoke:Lang|lang}} also has its isolated form, but it occurs at the end of the word.

DiacriticsEdit

Persian script has adopted a subset of Arabic diacritics: Template:Transliteration Template:IPAslink (Template:Transliteration in Arabic), Template:Transliteration Template:IPAslink (Template:Transliteration in Arabic), and Template:Transliteration Template:IPAslink or Template:IPAslink (Template:Transliteration in Arabic, pronounced Template:Transliteration in Western Persian), Template:Transliteration {{#invoke:IPA|main}} and Template:Transliteration (gemination). Other Arabic diacritics may be seen in Arabic loanwords in Persian.

Short vowelsEdit

Of the four Arabic diacritics, the Persian language has adopted the following three for short vowels. The last one, sukūn, which indicates the lack of a vowel, has not been adopted.

Short vowels
(fully vocalized text)
Name
(in Persian)
Name
(transliterated)
(a)|[4]}} Value {{#if:(b)|(b)|[5]}}

(Farsi/Dari)

064E
Template:Script/Arabic
lang}} Template:Transliteration/Template:Transliteration Template:Transliteration Template:IPAslink Template:IPAslink
0650
Template:Script/Arabic
lang}} Template:Transliteration/Template:Transliteration Template:Transliteration; Template:Transliteration Template:IPAslink Template:IPAslink; Template:IPAslink
064F
Template:Script/Arabic
lang}} Template:Transliteration/Template:Transliteration Template:Transliteration; Template:Transliteration Template:IPAslink Template:IPAslink

<templatestyles src="Citation/styles.css"/>^a.{{#if:| }} There is no standard transliteration for Persian. The letters 'i' and 'u' are only ever used as short vowels when transliterating Dari or Tajik Persian. See Persian Phonology

<templatestyles src="Citation/styles.css"/>^b.{{#if:| }} Diacritics differ by dialect, due to Dari having 8 distinct vowels compared to the 6 vowels of Farsi. See Persian Phonology

In Farsi, none of these short vowels may be the initial or final grapheme in an isolated word, although they may appear in the final position as an inflection, when the word is part of a noun group. In a word that starts with a vowel, the first grapheme is a silent Template:Transliteration which carries the short vowel, e.g. {{#invoke:Lang|lang}} (Template:Transliteration, meaning "hope"). In a word that ends with a vowel, letters {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}} respectively become the proxy letters for Template:Transliteration, Template:Transliteration and Template:Transliteration, e.g. {{#invoke:Lang|lang}} (Template:Transliteration, meaning "new") or {{#invoke:Lang|lang}} (Template:Transliteration, meaning "package").

Tanvin (nunation)Edit

{{#invoke:Labelled list hatnote|labelledList|Main article|Main articles|Main page|Main pages}} Nunation (Template:Langx, {{#invoke:Lang|lang}}) is the addition of one of three vowel diacritics to a noun or adjective to indicate that the word ends in an alveolar nasal sound without the addition of the letter nun.

Nunation
(fully vocalized text)
Name
(in Persian)
Name
(transliterated)
Notes
064B
Template:Script/Arabic
lang}} Template:Transliteration
064D
Template:Script/Arabic
lang}} Template:Transliteration Never used in the Persian language.

Taught in Islamic nations to

complement Quran education.

064C
Template:Script/Arabic
lang}} Template:Transliteration

TašdidEdit

{{#invoke:Labelled list hatnote|labelledList|Main article|Main articles|Main page|Main pages}}

Symbol Name
(in Persian)
Name
(transliteration)
0651
Template:Script/Arabic
تشدید Template:Transliteration

Other charactersEdit

The following are not actual letters but different orthographical shapes for letters, a ligature in the case of the Template:Transliteration. As to {{#invoke:Lang|lang}} (hamza), it has only one graphical form since it is never tied to a preceding or following letter. However, it is sometimes 'seated' on a Template:Transliteration, Template:Transliteration or Template:Transliteration, and in that case, the seat behaves like an ordinary Template:Transliteration, Template:Transliteration or Template:Transliteration respectively. Technically, hamza is not a letter but a diacritic.

Name Pronunciation IPA Unicode Final Medial Initial Stand-alone Notes
Template:Transliteration Template:Transliteration Template:IPAblink U+0622 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} The final form is very rare and is freely replaced with ordinary alef.
Template:Transliteration Template:Transliteration or Template:Transliteration main}} U+06C0 {{#invoke:Lang|lang}} {{#invoke:Lang|lang}} lang}} or {{#invoke:Lang|lang}} combinations instead.
Template:Transliteration Template:Transliteration main}} U+0644 (lām) and U+0627 (alef) {{#invoke:Lang|lang}} {{#invoke:Lang|lang}}
Template:Transliteration U+0640 {{#invoke:Lang|lang}} This is the medial character which connects other characters

Although at first glance, they may seem similar, there are many differences in the way the different languages use the alphabets. For example, similar words are written differently in Persian and Arabic, as they are used differently.

Unicode has accepted Template:Unichar in the Miscellaneous Symbols range.<ref>"Miscellaneous Symbols". p. 4. The Unicode Standard, Version 13.0. Unicode.org</ref> In Unicode 1.0 this symbol was known as Template:Sc2.<ref>"3.8 Block-by-block Charts" § Miscellaneous Dingbats p. 325 (155 electronically). The Unicode Standard Version 1.0. Unicode.org</ref> It is a stylization of {{#invoke:Lang|lang}} (Template:Transliteration) used as the emblem of Iran. It is also a part of the flag of Iran.

The Unicode Standard has a compatibility character defined Template:Unichar that can represent {{#invoke:Lang|lang}}, the Persian name of the currency of Iran.<ref>For the proposal, see {{#invoke:citation/CS1|citation |CitationClass=web }} It proposes the character under the name of Template:Sc2, which was changed by the standard committees to Template:Sc2.</ref>

Novel lettersEdit

The Persian alphabet has four extra letters that are not in the Arabic alphabet: Template:IPAslink, Template:IPAslink (ch in chair), Template:IPAslink (s in measure), Template:IPAslink. An additional fifth letter {{#invoke:Lang|lang}} was used for {{#invoke:IPA|main}} (v in Spanish {{#invoke:Lang|lang}}) but it is no longer used.

Sound Shape Name Unicode code point
Template:IPAslink {{#invoke:Lang|lang}} Template:Transliteration U+067E
Template:IPAslink (ch) {{#invoke:Lang|lang}} Template:Transliteration U+0686
Template:IPAslink (zh) {{#invoke:Lang|lang}} Template:Transliteration U+0698
Template:IPAslink {{#invoke:Lang|lang}} Template:Transliteration U+06AF

Deviations from the Arabic scriptEdit

Template:Anchor Persian uses the Eastern Arabic numerals, but the shapes of the digits 'four' ({{#invoke:Lang|lang}}), 'five' ({{#invoke:Lang|lang}}), and 'six' ({{#invoke:Lang|lang}}) are different from the shapes used in Arabic. All the digits also have different codepoints in Unicode:<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Hindu-Arabic Persian Name Unicode Arabic Unicode
0 ۰ صفر

sefr

U+06F0 ٠ U+0660
1 ۱ يک

yek

U+06F1 ١ U+0661
2 ۲ دو

do

U+06F2 ٢ U+0662
3 ۳ سه

se

U+06F3 ٣ U+0663
4 ۴ چهار

čahâr

U+06F4 ٤ U+0664
5 ۵ پنج

panj

U+06F5 ٥ U+0665
6 ۶ شش

šeš

U+06F6 ٦ U+0666
7 ۷ هفت

haft

U+06F7 ٧ U+0667
8 ۸ هشت

hašt

U+06F8 ٨ U+0668
9 ۹ نه

no

U+06F9 ٩ U+0669
- ی ye U+06CC يTemplate:Efn U+064A
- ک kâf U+06A9 ك U+0643

Template:Notelist

Comparison of different numeralsEdit

Template:See also

Western Arabic 0 1 2 3 4 5 6 7 8 9 10
Eastern ArabicTemplate:Efn lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}}
PersianTemplate:Efn lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}}
UrduTemplate:Efn Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd Template:Urd
Abjad numerals Template:N/A lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}} lang}}

Template:Notelist

Word boundariesEdit

Typically, words are separated from each other by a space. Certain morphemes (such as the plural ending '-hâ'), however, are written without a space. On a computer, they are separated from the word using the zero-width non-joiner.

Cyrillic Persian alphabet in TajikistanEdit

As part of the russification of Central Asia, the Cyrillic script was introduced in the late 1930s.<ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref> The alphabet has remained Cyrillic since then. In 1989, with the growth in Tajik nationalism, a law was enacted declaring Tajik the state language. In addition, the law officially equated Tajik with Persian, placing the word Farsi (the endonym for the Persian language) after Tajik. The law also called for a gradual reintroduction of the Perso-Arabic alphabet.<ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite book</ref>Template:Excessive citations inline

The Persian alphabet was introduced into education and public life, although the banning of the Islamic Renaissance Party in 1993 slowed adoption. In 1999, the word Farsi was removed from the state-language law, reverting the name to simply Tajik.{{#if:|{{{2}}}|[6]}} Template:As of the de facto standard in use is the Tajik Cyrillic alphabet,{{#if:|{{{2}}}|[7]}} and Template:As of only a very small part of the population can read the Persian alphabet.{{#if:|{{{2}}}|[8]}}

See alsoEdit

ReferencesEdit

Template:Reflist

External linksEdit

Template:Sister project

Template:Persian language Template:Arabic alphabets Template:Authority control