Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
List of Latin-script alphabets
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
===Notes=== # <span id="n-la">[[#r-la|↑]][[#r-la:1|↑]][[#r-la:2|↑]][[#r-la:3|↑]]</span> In classical '''Latin''', the digraphs {{vr|[[ch (digraph)|ch]]}}, {{vr|[[ph (digraph)|ph]]}}, {{vr|[[rh (digraph)|rh]]}}, {{vr|[[th (digraph)|th]]}} were used in loanwords from [[Greek language|Greek]], but they were not included in the alphabet. The ligatures {{vr|[[æ]]}}, {{vr|[[œ]]}} and {{vr|[[w]]}}, as well as lowercase letters, were added to the alphabet only in [[Middle Ages]]. The letters {{vr|j}} and {{vr|u}} were used as [[typography|typographical variants]] of {{vr|i}} and {{vr|v}}, respectively, roughly until the [[Age of Enlightenment|Enlightenment]]. # <span id="n-af">[[#r-af|↑]][[#r-af:1|↑]][[#r-af:2|↑]][[#r-af:3|↑]]</span> In '''Afrikaans''', {{vr|c}} and {{vr|q}} are only (and {{vr|x}} and {{vr|z}} almost only) used in loanwords. # <span id="n-sq">[[#r-sq|↑]][[#r-sq:1|↑]][[#r-sq:2|↑]][[#r-sq:3|↑]]</span> '''Albanian''' officially has the digraphs {{vr|{{wrap|dh, gj, ll, nj, rr, sh, th, xh, zh}}}}, which is sufficient to represent the [[Tosk Albanian|Tosk]] dialect. The [[Gheg Albanian|Gheg]] dialect supplements the official alphabet with 6 nasal vowels, namely {{vr|{{wrap|â, ê, î, ô, û, ŷ}}}}. # <span id="n-aat">[[#r-aat|↑]][[#r-aat:1|↑]][[#r-aat:2|↑]][[#r-aat:3|↑]]</span> '''Arbëresh''' officially has the digraphs {{vr|{{wrap|dh, gj, hj, ll, nj, rr, sh, th, xh, zh}}}}. Arbëresh has the distinctive {{vr|hj}}, which is considered as a letter in its own right. # <span id="n-lrl">[[#r-lrl|↑]][[#r-lrl:1|↑]]</span> '''Achomi''' also has the digraph {{vr|a{{'}}}}. # <span id="n-az">[[#r-az|↑]][[#r-az:1|↑]][[#r-az:2|↑]][[#r-az:3|↑]]</span> '''Azeri''' only uses the letter {{vr|ä}} as a substitute for {{vr|ə}} if the latter cannot be used (it was replaced by the schwa one year later because it is the most common letter). These cases should be avoided! The letters {{vr|w}}, {{vr|đ}}, {{vr|ŋ}}, {{vr|q̇}}, {{vr|ć}} (or the digraph {{vr|ts}}), and the digraph {{vr|dz}} are only used in certain dialects. # <span id="n-bm">[[#r-bm|↑]]</span> '''Bambara''' also has the digraphs: {{vr|kh}} (only present in loanwords), {{vr|sh}} (also written as {{vr|ʃ}}; only present in some dialects). Historically, {{vr|è}} was used instead of {{vr|ɛ}}, {{vr|ny}} was used instead of {{vr|ɲ}}, and {{vr|ò}} was used instead of {{vr|ɔ}} in Mali. # <span id="n-eu">[[#r-eu|↑]][[#r-eu|↑]][[#r-eu:2|↑]][[#r-eu:3|↑]]</span> '''Basque''' has several digraphs: {{vr|{{wrap|dd, ll, rr, ts, tt, tx, tz}}}}. The {{vr|ü}}, which represents {{IPA|/ø/}}, is required for various words in its Zuberoan dialect. {{vr|c, q, v, w, y}} are used in foreign words, but are officially considered part of the alphabet. # <span id="n-be">[[#r-be|↑]][[#r-be:1|↑]][[#r-be:2|↑]][[#r-be:3|↑]]</span> '''Belarusian''' also has several digraphs: {{vr|ch, dz, dź, dž}}. # <span id="n-bi">[[#r-bi|↑]][[#r-bi:1|↑]][[#r-bi:2|↑]][[#r-bi:3|↑]]</span> '''Bislama''' also has the digraph {{vr|ng}}. # <span id="n-br">[[#r-br|↑]][[#r-br:1|↑]][[#r-br:2|↑]][[#r-br:3|↑]]</span> '''Breton''' also has the digraphs {{vr|ch, c'h, zh}}. {{vr|c, q, x}} are used in foreign words or digraphs only. # <span id="n-ca">[[#r-ca:1|↑]][[#r-ca:2|↑]][[#r-ca:3|↑]]</span> '''Catalan''' also has a large number of digraphs: {{vr|{{wrap|dj, gu, gü, ig, ix, ll, l·l, ny, qu, qü, rr, ss, tg, tj, ts, tx, tz}}}}. The letters {{vr|k, q, w, y}} are only used in loanwords or the digraphs mentioned. # <span id="n-tch">[[#r-tch|↑]][[#r-tch:1|↑]]</span> The '''Alphabet of Chad''' also uses the unique letters {{vr|n̰}} and {{vr|r̰}}. # <span id="n-ch">[[#r-ch|↑]][[#r-ch:1|↑]][[#r-ch:2|↑]]</span> '''Chamorro''' also has the digraphs {{vr|ch, ng}}. {{vr|c}} used only in digraphs. # <span id="n-co">[[#r-co|↑]][[#r-co:1|↑]][[#r-co:2|↑]][[#r-co:3|↑]]</span> '''Corsican''' has the trigraphs: {{vr|chj, ghj}}. # <span id="n-hr">[[#r-hr|↑]][[#r-hr:1|↑]][[#r-hr|↑]][[#r-hr:3|↑]][[#r-sr|↑]][[#r-sr:1|↑]][[#r-sr|↑]][[#r-sr:3|↑]]</span> '''Croatian''' [[Gaj's alphabet]] also has the digraphs: {{vr|dž, lj, nj}}. There are also four tone markers that are sometimes used on vowels to avoid ambiguity in [[homophones]], but this is generally uncommon. [[Gaj's alphabet]] has been adopted by the [[Serbian language|Serbian]] and [[Bosnian language|Bosnian]] standards and that it has complete one-to-one congruence with [[Serbian Cyrillic]], where the three digraphs map to Cyrillic letters {{vr|џ}}, {{vr|љ}} and {{vr|њ}}, respectively. Rarely and non-standardly, digraph {{vr|dj}} is used instead of {{vr|đ}} (like it was previously) (Cyrillic {{vr|ђ}}). [[Montenegrin alphabet|Montenegrin]] variant additionally uses {{vr|ś}} and {{vr|ź}} to indicate dialectal pronunciation. # <span id="n-acy">[[#r-acy|↑]][[#r-acy:1|↑]] </span> '''Cypriot Arabic''' also has the letters {{vr|Θ}} and {{vr|Δ}}. # <span id="n-cs">[[#r-cs|↑]][[#r-cs:1|↑]][[#r-cs:2|↑]][[#r-cs:3|↑]]</span> '''Czech''' also has the digraph {{vr|ch}}, which is considered a separate letter and is sorted between {{vr|h}} and {{vr|i}}. While {{vr|{{wrap|á, ď, é, ě, í, ň, ó, ť, ú, ů, ý}}}} are considered separate letters, in collation they are treated merely as letters with diacritics. However, {{vr|č, ř, š, ž}} are sorted as separate letters. {{vr|q, w, x}} occur only in loanwords. # <span id="n-crx">[[#r-crx|↑]] '''Dakelh''' also contains the letter {{vr|{{'}}}}, which represents the glottal stop. The letters {{vr|f, p, r, v}} are only used in loanwords.</span> # <span id="n-da">[[#r-da:1|↑]][[#r-da:2|↑]][[#r-da|↑]][[#r-no:1|↑]][[#r-no:2|↑]][[#r-no:3|↑]]</span> The '''Norwegian''' alphabet is currently identical with the '''Danish''' alphabet. {{vr|c}} is part of both alphabets and is not used in native Danish or Norwegian words (except some proper names), but occurs quite frequently in well-established loanwords in Danish. Norwegian and Danish use {{vr|é}} in some words such as {{lang|da|én}}, although {{vr|é}} is considered a diacritic mark, while {{vr|å, æ, ø}} are letters. {{vr|q, w, x, z}} are not used except for names and some foreign words. # <span id="n-din">[[#r-din|↑]]</span> '''Dinka''' also has the digraphs: {{vr|dh, nh, ny, th}}. {{vr|h}} is only present in these digraphs. Dinka also used the letters {{vr|{{wrap|ä, ë, ï, ö, ɛ̈, ɔ̈}}}} (the last two which do not exist as precomposed characters in Unicode) # <span id="n-nl">[[#r-nl:1|↑]][[#r-nl:2|↑]][[#r-nl:3|↑]]</span> The status of {{vr|[[IJ (letter)|ij]]}} as a letter, ligature or digraph in '''Dutch''' is disputed. {{vr|c}} (outside the digraph {{vr|ch}}), {{vr|q}}, {{vr|x}}, and {{vr|y}} occur mostly in foreign words. Letters with grave and letters with circumflex occur only in loanwords. # <span id="n-en">[[#r-en:1|↑]][[#r-en:2|↑]][[#r-en:3|↑]]</span> '''English''' generally now uses extended Latin letters only in loan words, such as ''fiancé'', ''fiancée'', and ''résumé''. Rare publication guides may still use the dieresis on words, such as "coöperate", rather than the now-more-common "co-operate" (UK) or "cooperate" (US). For a fuller discussion, see articles branching from [[Lists of English words of international origin]], which was used to determine the diacritics needed for more unambiguous English. However, an {{vr|é}} or {{vr|è}} is sometimes used in poetry to show that a normally silent vowel is to be pronounced, as in "blessèd". # <span id="n-tl">[[#r-tl|↑]][[#r-tl:1|↑]][[#r-tl:2|↑]][[#r-tl:3|↑]]</span> '''Filipino''' [and also applicable in or to '''Tagalog''', which is the topmost influencer and contributor language of '''Filipino''', among the rest of the other influencer and contributor languages of the Philippines and foreign languages for '''Filipino's''' evolution, further development, and further enrichment; it ('''Tagalog''') is also the de facto historical, traditional, and linguistic basis of '''Filipino''' and the de jure or official basis of '''Filipino's''' both predecessor Philippine national and official language/s or language phase/s or stage/s since 1937 (as a national language) and 1946 (as an official language), which is lastly institutionally, officially, and constitutionally named or renamed as or into '''Pilipino''' from 1959 to 1987, before being constitutionally and officially replaced by '''Filipino''' as the national and an official language since 1987] also uses the digraph {{vr|ng}}, even originally with a large tilde that spanned both {{vr|n}} and {{vr|g}} (as in {{vr|n͠g}}) when a vowel follows the digraph. (The use of the tilde over the two letters is now rare). Only {{vr|ñ}} is required for everyday use (only in loanwords). The accented vowels are used in dictionaries to indicate pronunciation, and {{vr|g}} with tilde is only present in older works. {{vr|Ë}} and {{vr|ë}} are new variants of {{vr|E}} and {{vr|e}}, respectively, and we're introduced in 2013 by the Komisyon sa Wikang Filipino (Commission on the Filipino Language)'s "Ortograpiyang Pambansa" (National Orthography) and in 2014 by the Komisyon sa Wikang Filipino (Commission on the Filipino Language)'s KWF Manwal sa Masinop na Pagsulat (KWF Manual on Provident Writing) to represent and preserve the schwa vowel sound /ə/ in non-Tagalog Filipino words of Philippine origin or from the other languages of the Philippines that natively have this vowel sound in their languages. # <span id="n-fr">[[#r-fr:1|↑]][[#r-fr:2|↑]][[#r-fr:3|↑]]</span> Uppercase diacritics in '''French''' are often (incorrectly) thought of as being optional, but the official rules of French orthography designate accents on uppercase letters as obligatory in most cases. Many pairs or triplets are read as digraphs or trigraphs depending on context, but are not treated as such lexicographically: consonants {{vr|{{wrap|ph, (ng), th, gu/gü, qu, ce, ch/(sh/sch), rh}}}}; vocal vowels {{vr|{{wrap|(ee), ai/ay, ei/ey, eu, au/eau, ou}}}}; nasal vowels {{vr|{{wrap|ain/aim, in/im/ein, un/um/eun, an/am, en/em, om/on}}}}; the half-consonant -{{vr|(i)ll}}-; half-consonant and vowel pairs {{vr|{{wrap|oi, oin/ouin, ien, ion}}}}. When rules that govern the French orthography are not observed, they are read as separate letters, or using an approximating phonology of a foreign language for loan words, and there are many exceptions. In addition, most final consonants are mute (including those consonants that are part of feminine, plural, and conjugation endings). {{vr|ÿ}} and {{vr|ü}} are only used in certain geographical names and proper names plus their derivatives, or, in the case of {{vr|ü}} with diaeresis, newly proposed reforms, e.g. {{lang|de|capharnaüm}} 'shambles' is derived from the proper name Capharnaüm. {{vr|æ}} occurs only in Latin or Greek loanwords. # <span id="n-ff">[[#r-ff|↑]]</span> '''Fula''' has {{vr|x}} as part of the alphabet in all countries except Guinea, Guinea-Bissau, Liberia, and Sierra Leone (used only in loanwords in these countries). {{vr|ɠ}}, which is used only in loanwords (but still part of the alphabet), is used in Guinea only. Fula also uses the digraphs {{vr|mb}} (In Guinea spelled {{vr|mb}}), {{vr|nd, ng, nj}}. {{vr|{{wrap|aa, ee, ii, oo, uu}}}} are part of the alphabet in all countries except Guinea, Guinea-Bissau, Liberia, and Sierra Leone. {{vr|ƴ}} is used in all countries except for Nigeria, where it is written {{vr|{{'}}y}}. {{vr| ŋ}} is used in all countries except for Nigeria. {{vr|ɲ}} is used in Guinea, Mali, and Burkina Faso, {{vr|ñ}} is used in Senegal, Gambia, Mauritania, Guinea-Bissau, Liberia, and Sierra Leone, and the digraph {{vr|ny}} is used in Niger, Cameroon, Chad, Central African Republic, and Nigeria. The apostrophe is a letter (representing the glottal stop) in Guinea-Bissau, Liberia, and Sierra Leone. {{vr|q, v, z}} are only used in loanwords, and are not part of the alphabet. # <span id="n-gl">[[#r-gl|↑]][[#r-gl|↑]][[#r-gl|↑]][[#r-gl|↑]]</span> '''Galician'''. The standard of 1982 set also the digraphs ''gu'', ''qu'' (both always before {{vr|e}} and {{vr|i}}), ''ch, ll, nh'' and ''rr''. In addition, the standard of 2003 added the grapheme {{vr|ao}} as an alternative writing of {{vr|ó}}. Although not marked (or forgotten) in the list of digraphs, they are used to represent the same sound, so the sequence {{vr|ao}} should be considered as a digraph. The sequence {{vr|nh}} represents a [[Velar consonant|velar]] nasal (not a [[palatal]] as in [[Portuguese language|Portuguese]]) and is restricted only to three feminine words, being either [[demonstrative]] or [[pronoun]]: {{lang|gl|unha}} ('a' and 'one'), {{lang|gl|algunha}} ('some') and {{lang|gl|ningunha}} ('not one'). The Galician ''[[reintegracionismo]]'' movement uses it as in Portuguese. {{vr|j}} (outside of the Limia Baixa region), {{vr|k}}, {{vr|w}}, and {{vr|y}} are only used in loanwords, and are not part of the alphabet. # <span id="n-de">[[#r-de|↑]][[#r-de:1|↑]][[#r-de:2|↑]][[#r-de:3|↑]]</span> '''German''' also retains most original letters in French loan words. Swiss German does not use {{vr|ß}} any more. The [[long s]] {{vr|ſ}} was in use until the mid-20th century. {{vr|sch}} is usually not considered a separate letter, neither are the digraphs {{vr|{{wrap|ch, ck, st, sp, th, (ph, rh), qu}}}}. {{vr|q}} only appears in the sequence {{vr|qu}} and in loanwords, while {{vr|x}} and {{vr|y}} are found almost only in loan words. The capital {{vr|ß}} ({{vr|ẞ}}) is almost never used. The accented letters (other than the letters {{vr|ä}}, {{vr|ö}}, {{vr|ü}}, and {{vr|ß}}) are used only in loanwords. # <span id="n-gn">[[#r-gn|↑]][[#r-gn:1|↑]][[#r-gn:2|↑]][[#r-gn:3|↑]]</span> '''Guaraní''' also uses digraphs {{vr|{{wrap|ch, mb, nd, ng, nt, rr}}}} and the glottal stop {{vr|{{'}}}}. {{vr|b, c, d}} are only used in these digraphs. # <span id="n-gwi">[[#r-gwi|↑]] '''Gwich'in''' also contains the letter {{vr|{{'}}}}, which represents the glottal stop. Gwich'in also uses the letters {{vr|ą̀, ę̀, į̀, ǫ̀, ų̀}}, which are not available as precomposed characters in Unicode. Gwich'in also uses the digraphs and trigraphs: {{vr|{{wrap|aa, ąą, àà, ą̀ą̀, ch, ch', ddh, dh, dl, dr, dz, ee, ęę, èè, ę̀ę̀, gh, ghw, gw, ii, įį, ìì, į̀į̀, kh, kw, k', nd, nh, nj, oo, ǫǫ, òò, ǫ̀ǫ̀, rh, sh, shr, th, tl, tl', tr, tr', ts, ts', tth, tth', t', uu, ųų, ùù, ų̀ų̀, zh, zhr}}}}. The letter {{vr|c}} is only used the digraphs above. {{vr|b, f, m}} are only used in loanwords.</span> # <span id="n-ha">[[#r-ha|↑]][[#r-ha:1|↑]][[#r-ha:2|↑]][[#r-ha:3|↑]]</span> '''Hausa''' has the digraphs: {{vr|sh, ts}}. Vowel length and tone are usually not marked. Textbooks usually use macron or doubled vowel to mark the length, grave to mark the low tone and circumflex to mark the falling tone. Therefore, in some systems, it is possible that macron is used in combination with grave or circumflex over a, e, i, o or u. The letter {{vr|p}} is only used in loanwords. # <span id="n-hu">[[#r-hu|↑]][[#r-hu:1|↑]][[#r-hu:2|↑]][[#r-hu:3|↑]]</span> '''Hungarian''' also has the digraphs: {{vr|{{wrap|cs, dz, gy, ly, ny, sz, ty, zs}}}}; and the trigraph: {{vr|dzs}}. {{vr|{{wrap|á, é, í, ó, ő, ú, ű}}}} are considered separate letters, but are collated as variants of {{vr|{{wrap|a, e, i, o, ö, u, ü}}}}. # <span id="n-ga">[[#r-ga|↑]][[#r-ga:1|↑]][[#r-ga:2|↑]][[#r-ga:3|↑]]</span> '''Irish''' traditionally used the [[dot (diacritic)|dot diacritic]] ({{langx|ga|ponc séimhithe}}) to mark lenition, forming the dotted letters ({{lang|ga|litreacha buailte}} "struck letters") {{vr|{{wrap|ḃ, ċ, ḋ, ḟ, ġ, ṁ, ṗ, ṡ, ṫ}}}}. These have largely been replaced by the digraphs: {{vr|{{wrap|bh, ch, dh, fh, gh, mh, ph, sh, th}}}} except for in decorative or self-consciously traditional contexts. {{vr|v}} occurs in a small number of (mainly onomatopoeic) native words (e.g. {{lang|ga|vácarnach}} "to quack") and colloquialisms ({{lang|ga|vís}} for {{lang|ga|bís}} "screw"). {{vr|j, k, q, w, x, y, z}} only occur in loanwords and scientific terminology. # <span id="n-ig">[[#r-ig|↑]]</span> '''Igbo''' writes {{vr|ṅ}} alternatively as {{vr|n̄}}. Igbo has the digraphs: {{vr|{{wrap|ch, gb, gh, gw, kp, kw, nw, ny, sh}}}}. {{vr|c}} is only used in the digraph before. Also, vowels take a grave accent, an acute accent, or no accent, depending on tone. # <span id="n-it">[[#r-it|↑]][[#r-it:1|↑]][[#r-it:2|↑]][[#r-it:3|↑]]</span> '''Italian''' also has the digraphs: {{vr|ch, gh, gn, gl, sc}}. {{vr|j, k, w, x, y}} are used in foreign words, and are not part of the alphabet. {{vr|x}} is also used for native words derived from Latin and Greek; {{vr|j}} is also used for just a few native words, mainly names of persons (as in Jacopo) or of places (as in [[Jesolo]] and [[Jesi]]), in which represents {{IPA|/i/}}. While it does not occur in ordinary running texts, geographical names on maps are often written only with acute accents. The circumflex is used on an -i ending that was anciently written -ii (or -ji, -ij, -j, etc.) to distinguish homograph plurals and verb forms: e.g. {{lang|it|principî}} form {{lang|it|principi}}, {{lang|it|genî}} from {{lang|it|geni}}. # <span id="n-kaa">[[#r-kaa|↑]]</span> '''Karakalpak''' also has the digraphs: {{vr|ch, sh}}. {{vr|c, f, v}} are used in foreign words. # <span id="n-kk">[[#r-kk|↑]]</span> '''Kazakh''' also has the digraphs: {{vr|ia, io, iu}}. {{vr|f, h, v}} and the digraph {{vr|io}} are used in foreign words. # <span id="n-lv">[[#r-lv|↑]][[#r-lv:1|↑]][[#r-lv:2|↑]][[#r-lv:3|↑]]</span> '''Latvian''' also has the digraphs: {{vr|dz, dž, ie}}. ''Dz'' and ''dž'' are occasionally considered separate letters of the alphabet in more archaic examples, which have been published as recently as the 1950s; however, modern alphabets and teachings discourage this due to an ongoing effort to set decisive rules for Latvian and eliminate barbaric words accumulated during the Soviet occupation. The digraph "ie" is never considered a separate letter. ''Ō'', ''Ŗ'', and the digraphs ''CH'' (only used in loanwords) and ''UO'' are no longer part of the alphabet, but are still used in certain dialects and newspapers that use the old orthography. ''Y'' is used only in certain dialects and not in the standard language. ''F'' and ''H'' are only used in loanwords. # <span id="n-ls">[[#r-ls|↑]][[#r-ls:1|↑]][[#r-ls:2|↑]][[#r-ls:3|↑]]</span> A nearby language, '''Pite Sami''', uses '''Lule Sami''' orthography but also includes the letters {{vr|đ}} and {{vr|ŧ}}, which are not in Lule Sami. # <span id="n-lt">[[#r-lt|↑]][[#r-lt:1|↑]][[#r-lt:2|↑]][[#r-lt:3|↑]]</span> '''Lithuanian''' also has the digraphs: {{vr|{{wrap|ch, dz, dž, ie, uo}}}}. However, these are not considered separate letters of the alphabet. ''F'', ''H'', and the digraph ''CH'' are only used in loanwords. Demanding publications such as dictionaries, maps, schoolbooks etc. need additional diacritical marks to differentiate homographs. Using grave accent on A, E, I, O, U, acute accent on all vowels, and tilde accent on all vowels and on L, M, N and R. Small E and I (also with ogonek) must retain the dot when additional accent mark is added to the character; the use of ì and í (with missing dot) is considered unacceptable. # <span id="n-liv">[[#r-liv|↑]][[#r-liv:1|↑]][[#r-liv:2|↑]][[#r-liv:3|↑]]</span> In '''Livonian''', the letters ''Ö, Ȫ, Y, Ȳ'' were used by the older generation, but the younger generation merged these sounds; Around the late 1990s, these letters were removed from the alphabet. # <span id="n-mt">[[#r-mt|↑]][[#r-mt:1|↑]][[#r-mt:2|↑]][[#r-mt:3|↑]]</span> '''Maltese''' also has the digraphs: {{vr|ie, għ}}. # <span id="n-mi">[[#r-mi|↑]]</span> '''Māori''' only uses {{vr|g}} in {{vr|ng}} digraph. {{vr|wh}} is also a digraph. # <span id="n-mt">[[#r-mh|↑]][[#r-mh:1|↑]][[#r-mh:2|↑]][[#r-mh:3|↑]]</span> '''Marshallese''' often uses the old orthography (because people did not approve of the new orthography), which writes ''ļ'' as l, ''m̧'' as m, ''ņ'' as n, ''p'' as b, ''o̧'' as o at the ends of words or in the word ''yokwe'' (also spelled iakwe under the old orthography; under the new orthography, spelled io̧kwe), but a{{Clarify|reason=Unclear what is meant|date=November 2019}} at other places, and ''d'' as dr before vowels, or r after vowels. The old orthography writes ''ā'' as e in some words, but ā in others; it also writes ''ū'' as i between consonants. The old orthography writes geminates and long vowels as two letters instead. Allophones of {{IPA|/ɘ/}}, written as only e o ō in the new orthography, are also written as i u and very rarely, ū. The letter ''Y'' only occurs in the words ''yokwe'' or the phrase ''yokwe yuk'' (also spelled iakwe iuk in the old orthography or io̧kwe eok in the new orthography). # <span id="n-mss">[[#r-mss|↑]][[#r-mss:1|↑]][[#r-mss:2|↑]] '''Massachusett''' also uses the digraphs {{vr|ch, ee, sh, ty}} and the letter {{vr|8}} (which was previously written {{vr|oo}}). {{vr|c}} is only used in the digraph {{vr|ch}}.</span> # <span id="n-moh">[[#r-moh|↑]]</span> Some '''Mohawk''' speakers use orthographic {{vr|i}} in place of the consonant {{vr|y}}. The glottal stop is indicated with an apostrophe {{vr|{{'}}}} and long vowels are written with a colon {{vr|:}}. # <span id="n-mis">[[#r-mis|↑]]</span> '''Na'vi''' uses the letter ''ʼ'' and the digraphs ''aw'', ''ay'', ''ew'', ''ey'', ''kx'', ''ll'', ''ng'' (sometimes written {{vr|g}}), ''px'', ''rr'', ''ts'' (sometimes written {{vr|c}}), ''tx''. {{vr|g}} (in standard orthography) and {{vr|x}} are used only in digraphs. # <span id="n-om">[[#r-om|↑]] '''Oromo''' uses the following digraphs: {{vr|ch, dh, ny, ph, sh}}. {{vr|p}} is only used in the digraph {{vr|ph}} and loanwords. {{vr|v}} and {{vr|z}} are only used in loanwords.</span> # <span id="n-pap">[[#r-pap|↑]][[#r-pap:1|↑]][[#r-pap:2|↑]][[#r-pap:3|↑]] '''Papiamento''' also has the digraphs: {{vr|ch, dj, sh, zj}}. {{vr|q, x}} are only used in loanwords and proper names. {{vr|j}} is only used in digraphs, loanwords, and proper names. Papiamentu in Bonaire and Curaçao is different from Papiamento in Aruba in the following ways: Papiamento in Aruba uses a more etymological spelling, so Papiamento uses {{vr|c}} in native words outside of the digraph {{vr|ch}}, but Papiamentu in Bonaire and Curaçao does not. Papiamentu in Bonaire and Curaçao uses {{vr|è}}, {{vr|ò}}, {{vr|ù}}, and {{vr|ü}} for various sounds and {{vr|á, é, í, ó, ú}} for stress, but Papiamento in Aruba does not use these letters.</span> # <span id="n-pms">[[#r-pms:1|↑]]</span> '''Piedmontese''' also uses the letter {{vr|n-}}, which usually precedes a vowel, as in {{lang|pms|lun-a}} "moon". # <span id="n-pinyin">[[#r-pinyin|↑]][[#r-pinyin:1|↑]][[#r-pinyin:2|↑]][[#r-pinyin:3|↑]]</span> '''Pinyin''' has four tone markers that can go on top of any of the six vowels ({{vr|{{wrap|a, e, i, o, u, ü}}}}); e.g.: macron ({{vr|{{wrap|ā, ē, ī, ō, ū, ǖ}}}}), acute accent ({{vr|{{wrap|á, é, í, ó, ú, ǘ}}}}), caron ({{vr|{{wrap|ǎ, ě, ǐ, ǒ, ǔ, ǚ}}}}), grave accent ({{vr|{{wrap|à, è, ì, ò, ù, ǜ}}}}). It also uses the digraphs: {{vr|ch, sh, zh}}. # <span id="n-pl">[[#r-pl|↑]][[#r-pl:1|↑]][[#r-pl:2|↑]][[#r-pl:3|↑]]</span> '''Polish''' also has the digraphs: {{vr|{{wrap|ch, cz, dz, dż, dź, sz, rz}}}}. {{vr|q, v, x}} occur only in loanwords, and are sometimes not considered as part of the alphabet. # <span id="n-pt">[[#r-pt|↑]][[#r-pt:1|↑]][[#r-pt:2|↑]][[#r-pt:3|↑]]</span> '''Portuguese''' uses the digraphs {{vr|ch, lh, nh, rr, ss}}. The [[Two dots (diacritic)|trema]] on {{vr|ü}} was used in [[Brazilian Portuguese]] from 1943 to 2009. European Portuguese in that case used the grave accent ({{vr|ù}}) from 1911 to 1920, then abolished. The grave accent was used on {{vr|e, i, o, u}}, until 1973. {{vr|è, ò}} are used in geographical names outside Europe and not part of the language proper. The now abandoned practice was to indicate underlying stress in words with suffixes that begin with {{lang|pt|-z}} or in words ending in {{lang|pt|-mente}}, e.g. {{lang|pt|cafèzeiro}}, {{lang|pt|açaìzal}}, {{lang|pt|sòmente}}, {{lang|pt|ùltimamente}} etc. The trema on {{vr|ï}} could be used to mark not stressed hiatuses, e.g. {{lang|pt|constituïção}}, although this use was only optional and applied to {{vr|ü}} too. Neither the digraphs nor accented letters are considered part of the alphabet. {{vr|k, w, y}} occur only in loanwords, and were not letters of the alphabet from 1911 (Portugal) or 1943 (Brazil) until 2009, but these letters were in fact used before 1911 in Portugal and before 1943 in Brazil when the word's etymology allowed, e.g. {{lang|pt|kilometro}}, {{lang|pt|sandwiche}}, {{lang|pt|typo}} etc. (although {{vr|w}} was formally not included in the alphabet). # <span id="n-rom">[[#r-rom|↑]][[#r-rom:1|↑]][[#r-rom|↑]][[#r-rom:3|↑]]</span> '''Romani''' has the digraphs: {{vr|čh, dž, kh, ph, th}}. # <span id="n-ro">[[#r-ro|↑]]</span> '''Romanian''' normally uses the letters {{vr|ș, ț}} ({{vr|s, t}} with a [[comma (diacritic)|comma diacritic]] below) but they are frequently replaced by {{vr|ş, ţ}} ({{vr|s, t}} with a [[cedilla]]) due to past lack of standardization. {{vr|k, q, w, x, y}} occur only in loanwords. # <span id="n-sk">[[#r-sk:1|↑]][[#r-sk:2|↑]][[#r-sk:3|↑]]</span> '''Slovak''' also has the digraphs {{vr|dz, dž, ch}} which are considered separate letters. While {{vr|{{wrap|á, ä, ď, é, í, ĺ, ň, ó, ô, ŕ, ť, ú', ý}}}} are considered separate letters, in collation they are treated merely as letters with diacritics. However, {{vr|č, ľ, š, ž}}, as well as the digraphs, are actually sorted as separate letters. {{vr|q, w, x, ö, ü}} occur only in loanwords. # <span id="n-wen">[[#r-wen|↑]][[#r-wen:1|↑]][[#r-wen:2|↑]][[#r-wen:3|↑]][[#r-wen:4|↑]]</span> '''Sorbian''' also uses the digraphs: {{vr|ch}}, {{vr|dź}}. {{vr|ř}} is only used in Upper Sorbian, and {{vr|ŕ}}, {{vr|ś}}, and {{vr|ź}} (outside the digraph {{vr|dź}}) are only used in Lower Sorbian. # <span id="n-es">[[#r-es:1|↑]][[#r-es:2|↑]][[#r-es:3|↑]]</span> '''Spanish''' uses several digraphs to represent single sounds: {{vr|ch}}, {{vr|gu}} (preceding {{vr|e}} or {{vr|i}}), {{vr|ll}}, {{vr|qu}}, {{vr|rr}}; of these, the digraphs {{vr|[[ch (digraph)|ch]]}} and {{vr|[[ll]]}} were traditionally considered individual letters with their own name ({{lang|es|che}}, {{lang|es|elle}}) and place in the alphabet (after {{vr|c}} and {{vr|l}}, respectively), but in order to facilitate international compatibility the [[Real Academia Española|Royal Spanish Academy]] decided to cease this practice in 1994 and all digraphs are now collated as combinations of two separate characters. While cedilla is etymologically Spanish diminutive of ceda ({{vr|z}}) and Sancho Pança is the original form in Cervantes books, C with cedilla {{vr|ç}} is now completely displaced by {{vr|z}} in contemporary language. In poetry, the diaeresis may be used to break a diphthong into separate vowels. Regarding that usage, Ortografía de la lengua española states that "diaeresis is usually placed over the closed vowel [i.e. {{vr|i}} or {{vr|u}}] and, when both are closed, generally over the first"{{Citation needed|date=March 2020}}. In this context, the use of {{vr|ï}} is rare, but part of the normative orthography. # <span id="n-sv">[[#r-sv|↑]]</span> '''Swedish''' uses {{vr|é}} in well integrated loan words like {{lang|sv|idé}} and {{lang|sv|armé}}, although {{vr|é}} is considered a modified {{vr|e}}, while {{vr|å}}, {{vr|ä}}, {{vr|ö}} are letters. {{vr|á}} and {{vr|à}} are rarely used words. {{vr|w}} and {{vr|z}} are used in some integrated words like [[World Wide Web|webb]] and [[wikt:zone|zon]]. {{vr|q}}, {{vr|ü}}, {{vr|è}}, and {{vr|ë}} are used for names only, but exist in Swedish names. For foreign names {{vr|ó}}, {{vr|ç}}, {{vr|ñ}} and more are sometimes used, but usually not. Swedish has many digraphs and some trigraphs. {{vr|{{wrap|ch, dj, lj, rl, rn, rs, sj, sk, si, ti, sch, skj, stj}}}} and others are usually pronounced as one sound. # <span id="n-tn">[[#r-tn|↑]][[#r-tn:1|↑]][[#r-tn:2|↑]][[#r-tn:3|↑]]</span> '''Tswana''' also has the digraphs: {{vr|{{wrap|kg, kh, ng, ph, th, tl, tlh, ts, tsh, tš, tšh}}}}. The letters {{vr|c}}, {{vr|q}}, and {{vr|x}} only appear in onomatopoeic and loanwords. The letters {{vr|v}} and {{vr|z}} only appear in loanwords. # <span id="n-tk">[[#r-tk|↑]][[#r-tk:1|↑]][[#r-tk:2|↑]][[#r-tk:3|↑]]</span> '''Turkmen''' had a slightly different alphabet in 1993–1995 (which used some rare letters) {{vr|ý}} was written as {{vr|ÿ}} (capital {{vr|¥}}), {{vr|ň}} as {{vr|ñ}}, {{vr|ş}} as {{vr|¢}} (capital {{vr|$}}), and {{vr|ž}} as {{vr|⌠}} (capital {{vr|£}}) (so that all characters were available in [[Code page 437]]). In the new alphabet, all characters are available in [[ISO/IEC 8859-2]]. # <span id="n-uli">[[#r-uli|↑]][[#r-uli:1|↑]][[#r-uli:2|↑]]</span> '''Ulithian''' also has the digraphs: {{vr|ch, l', mw, ng}}. {{vr|c}} is used only in digraphs. # <span id="n-uz">[[#r-uz|↑]]</span> '''Uzbek''' also has the digraphs: {{vr|ch, ng, sh}} considered as letters. {{vr|c}} is used only in digraphs. {{vr|g{{'}}}}, {{vr|o{{'}}}} and apostrophe {{vr|{{'}}}} are considered as letters. These letters have preferred typographical variants: {{vr|gʻ}}, {{vr|oʻ}}, and {{vr|ʼ}} respectively. # <span id="n-ve">[[#r-ve|↑]][[#r-ve:1|↑]][[#r-ve:2|↑]][[#r-ve:3|↑]]</span> '''Venda''' also has the digraphs and trigraphs: {{vr|{{wrap|bv, bw, dz, dzh, dzw, fh, hw, kh, khw, ng, ny, nz, ṅw, ph, pf, pfh, sh, sw, th, ts, tsh, tsw, ty, ṱh, vh, zh, zw}}}}. {{vr|c, j, q}} are used in foreign words. # <span id="n-vi">[[#r-vi|↑]][[#r-vi:1|↑]][[#r-vi:2|↑]][[#r-vi:3|↑]]</span> '''Vietnamese''' has seven additional base letters: {{vr|ă â đ ê ô ơ ư}}. It uses five tone markers that can go on top (or below) any of the 12 vowels ({{vr|{{wrap|a, ă, â, e, ê, i, o, ô, ơ, u, ư, y}}}}); e.g.: grave accent ({{vr|{{wrap|à, ằ, ầ, è, ề, ì, ò, ồ, ờ, ù, ừ, ỳ}}}}), hook above ({{vr|{{wrap|ả, ẳ, ẩ, ẻ, ể, ỉ, ỏ, ổ, ở, ủ, ử, ỷ}}}}), tilde ({{vr|{{wrap|ã, ẵ, ẫ, ẽ, ễ, ĩ, õ, ỗ, ỡ, ũ, ữ, ỹ}}}}), acute accent ({{vr|{{wrap|á, ắ, ấ, é, ế, í, ó, ố, ớ, ú, ứ, ý}}}}), and dot below ({{vr|{{wrap|ạ, ặ, ậ, ẹ, ệ, ị, ọ, ộ, ợ, ụ, ự, ỵ}}}}). It also uses several digraphs and trigraphs {{vr|{{wrap|ch, gh, gi, kh, ng, ngh, nh, ph, th, tr}}}} but they are no longer considered letters. # <span id="n-wa">[[#r-wa|↑]][[#r-wa:1|↑]][[#r-wa:2|↑]][[#r-wa:3|↑]]</span> '''Walloon''' has the digraphs and trigraphs: {{vr|{{wrap|ae, ch, dj, ea, jh, oe, oen, oi, sch, sh, tch, xh}}}}. The letter {{vr|x}} outside the digraph {{vr|xh}} is in some orthographies, but not the default two. The letter {{vr|q}} is in some orthographies, but not in the default two. Also in some orthographies are {{vr|à}}, {{vr|ì}}, {{vr|ù}}, and even {{vr|e̊}} and {{vr|o̊}} (which are not available as a precomposed character in Unicode, so {{vr|ë}} and {{vr|ö}} are used as substitutes) # <span id="n-cy">[[#r-cy|↑]][[#r-cy:1|↑]][[#r-cy:2|↑]][[#r-cy:3|↑]]</span> '''Welsh''' has the digraphs {{vr|ch}}, {{vr|dd}}, {{vr|ff}}, {{vr|ng}}, {{vr|ll}}, {{vr|ph}}, {{vr|rh}}, {{vr|th}}. Each of these digraphs is collated as a separate letter, and {{vr|ng}} comes immediately after {{vr|g}} in the alphabet. It also frequently uses [[circumflex]]es, and occasionally uses [[Diaeresis (diacritic)|diaereses]], [[acute accent]]s and [[grave accent]]s, on its seven vowels ({{vr|a, e, i, o, u, w, y}}), but accented characters are not regarded as separate letters of the alphabet. # <span id="n-xh">[[#r-xh|↑]][[#r-xh:1|↑]][[#r-xh:2|↑]][[#r-xh:3|↑]]</span> '''Xhosa''' has a large number of digraphs, trigraphs, and even one tetragraph are used to represent various phonemes: {{vr|{{wrap|bh, ch, dl, dy, dz, gc, gq, gr, gx, hh, hl, kh, kr, lh, mb, mf, mh, nc, ndl, ndz, ng, ng', ngc, ngh, ngq, ngx, nh, nkc, nkq, nkx, nq, nx, ntl, ny, nyh, ph, qh, rh, sh, th, ths, thsh, ts, tsh, ty, tyh, wh, xh, yh, zh}}}}. It also occasionally uses [[acute accent]]s, [[grave accent]]s, [[circumflex]]es, and [[Diaeresis (diacritic)|diaereses]] on its five vowels ({{vr|a, e, i, o, u}}), but accented characters are not regarded as separate letters of the alphabet. # <span id="n-ya">[[#r-ya|↑]][[#r-ya:1|↑]][[#r-ya:2|↑]]</span> '''Yapese''' has the digraphs and trigraphs: {{vr|{{wrap|aa, ae, ch, ea, ee, ii, k', l', m', n', ng, ng', oe, oo, p', t', th, th', uu, w', y{{'}}}}}}. {{vr|q}}, representing the glottal stop, is not always used. Often an apostrophe is used to represent the glottal stop instead. {{vr|c}} is used only in digraphs. {{vr|h}} is used only in digraphs and loanwords. {{vr|q}} is used only in loanwords. # <span id="n-yo">[[#r-yo|↑]][[#r-yo:1|↑]][[#r-yo:2|↑]][[#r-yo:3|↑]]</span> '''Yoruba''' uses the digraph {{vr|gb}}. Also, vowels take a grave accent, an acute accent, or no accent, depending on tone. Although the "dot below" diacritic is widely used, purists prefer a short vertical underbar (Unicode COMBINING VERTICAL LINE BELOW U+0329) - this resembles the IPA notation for a syllabic consonant, attached to the base of the letter ({{vr|e}}, {{vr|o}} or {{vr|s}}). The seven Yoruba vowels ({{vr|a}}, {{vr|e}}, {{vr|ẹ}}, {{vr|i}}, {{vr|o}}, {{vr|ọ}}, {{vr|u}}) can be uttered in three different tones: high (acute accent); middle (no accent) and low (grave accent). The letters {{vr|m}} and {{vr|n}}, when written without diacritics, indicate nasalisation of the preceding vowel. {{vr|m}} and {{vr|n}} also occur as syllabics - in these circumstances, they take acute or grave tonal diacritics, like the vowels. Middle tone is marked with a macron to differentiate it from the unmarked nasalising consonants. A tilde was used in older orthography (still occasionally used) to indicate a double vowel. This is tonally ambiguous, and has now been replaced by showing the paired vowels, each marked with the appropriate tones. However, where a double vowel has the tonal sequence high-low or low-high, it may optionally be replaced by a single vowel with a circumflex (high-low) or caron (low-high), e.g. á + à = {{vr|â}}; à + á = {{vr|ǎ}}. # <span id="n-zun">[[#r-zun|↑]][[#r-zun:1|↑]] '''Zuni''' contains the glottal stop {{vr|{{'}}}} and the digraph: {{vr|ch}}; {{vr|c}} is only used in that digraph. The other digraphs {{vr|kw}}, {{vr|sh}}, and {{vr|ts}} are not part of the alphabet.</span>
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)