Template:Short description Template:Use dmy dates Template:IPA notice Welsh orthography uses 29 letters (including eight digraphs) of the Latin script to write native Welsh words as well as established loanwords.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Majuscule forms (also called uppercase or capital letters)
A B C CH D DD E F FF G NG H I J L LL M N O P PH R RH S T TH U W Y
Titlecase forms
A B C Ch D Dd E F Ff G Ng H I J L Ll M N O P Ph R Rh S T Th U W Y
Minuscule forms (also called lowercase or small letters)
a b c ch d dd e f ff g ng h i j l ll m n o p ph r rh s t th u w y

Welsh orthography makes use of multiple diacritics, which are primarily used on vowels, namely the acute accent ({{#invoke:Lang|lang}}), the grave accent ({{#invoke:Lang|lang}}), the circumflex ({{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, or {{#invoke:Lang|lang}}) and the diaeresis ({{#invoke:Lang|lang}}). They are considered variants of their base letter, i.e. they are not alphabetised separately. The Welsh alphabet also lacks Template:Vr ({{#invoke:Lang|lang}}, {{#invoke:IPA|main}}), Template:Vr ({{#invoke:Lang|lang}}, {{#invoke:IPA|main}}), Template:Vr ({{#invoke:Lang|lang}}, {{#invoke:IPA|main}}), Template:Vr ({{#invoke:Lang|lang}}, {{#invoke:IPA|main}}), and Template:Vr ({{#invoke:Lang|lang}}, {{#invoke:IPA|main}}/{{#invoke:IPA|main}}).<ref>https://www.chelmsfordwelsh.org.uk/alphabet.html#:~:text=The%20letters%20J%2C%20K%2C%20Q,%2C%20ph%2C%20rh%20and%20th.</ref>

Foreign wordsEdit

Welsh borrows a number of words from English.<ref>https://www.grin.com/document/58172</ref> Those words are spelled according to Welsh spelling conventions, for example: {{#invoke:Lang|lang}} "bus", {{#invoke:Lang|lang}} "buck", {{#invoke:Lang|lang}} "bucket", {{#invoke:Lang|lang}} "car", {{#invoke:Lang|lang}} "noggin", {{#invoke:Lang|lang}} "gob", {{#invoke:Lang|lang}} "slogan", {{#invoke:Lang|lang}} "flannel", {{#invoke:Lang|lang}} "truant", and {{#invoke:Lang|lang}} "gaol".

Non-native letters in WelshEdit

The letter Template:Vr has only recentlyTemplate:When been accepted into Welsh orthography: for use in words borrowed from English which retain the {{#invoke:IPA|main}} sound, even when it originally was not represented by Template:Vr in English orthography, as in {{#invoke:Lang|lang}} ("garage"), {{#invoke:Lang|lang}} ("giraffe"), and {{#invoke:Lang|lang}} ("fridge"). Older borrowings of English words containing {{#invoke:IPA|main}} resulted in the sound being pronounced and spelled in various other ways, resulting in occasional doublets such as {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}} ("Japan").Template:Efn

The letters Template:Vr are not part of the Welsh Alphabet. However, these letters are used in foreign proper names and their derivatives: {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}. They are also sometimes used in technical and other specialized terms, like kilogram, queer, volt and zero, but in all cases can be, and often are, nativised: {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}}, {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}}.<ref>Thomas, Peter Wynn (1996) Gramadeg y Gymraeg. Cardiff: University of Wales Press: 757.</ref>

HistoryEdit

File:Welsh alphabet card italic C19th.jpg
A 19th-century Welsh alphabet printed in Welsh, without Template:Vr or Template:Vr

The earliest samples of written Welsh date from the 6th century and are in the Latin alphabet (see Old Welsh). The orthography differs from that of modern Welsh, particularly in the use of Template:Vr to represent the voiced plosives {{#invoke:IPA|main}} non initially. Similarly, the voiced fricatives {{#invoke:IPA|main}} were written Template:Vr.<ref name="Watkins">Watkins, T. Arwyn (1993) "Welsh" in Ball, Martin J. with Fife, James (Eds) The Celtic Languages. London/New York: Routledge: 289-348.</ref>

By the Middle Welsh period, this had given way to quite a bit of variability: Although Template:Vr were now used to represent {{#invoke:IPA|main}}, these sounds were also often written as in Old Welsh, while {{#invoke:IPA|main}} could be denoted by Template:Vr. In earlier manuscripts, moreover, fricatives were often not distinguished from plosives (e.g. Template:Vr for {{#invoke:IPA|main}}, now written Template:Vr).<ref>Evans, Simon D. (1964) A Grammar of Middle Welsh. Dublin: ColourBooks Ltd.</ref> The grapheme Template:Vr was also used, unlike in the modern alphabet, particularly before front vowels.<ref name="Watkins" /> The disuse of this letter is at least partly due to the publication of William Salesbury's Welsh New Testament and William Morgan's Welsh Bible, whose English printers, with type letter frequencies set for English and Latin, did not have enough Template:Vr letters in their type cases to spell every {{#invoke:IPA|main}} as Template:Vr, so the order went "C for K, because the printers have not so many as the Welsh requireth";<ref>English and Welsh,Template:Dead link an essay by J. R. R. Tolkien</ref> this was not liked at the time, but has become standard usage.

In this period, Template:Vr (capital Template:Vr) was also used interchangeably with Template:Vr, such as the passage in the 1567 New Testament: {{#invoke:Lang|lang}}, which contains both Template:Vr and Template:Vr. Elsewhere, the same word is spelt in different ways, e.g. {{#invoke:Lang|lang}} and {{#invoke:Lang|lang}}.<ref>Template:Citation.</ref>

The printer and publisher Lewis Jones, one of the co-founders of {{#invoke:Lang|lang}}, the Welsh-speaking settlement in Patagonia, favoured a limited spelling reform which replaced Welsh Template:Vr {{#invoke:IPA|main}} and Template:Vr {{#invoke:IPA|main}} with Template:Vr and Template:Vr, and from circa 1866 to 1886 Jones employed this innovation in a number of newspapers and periodicals he published and/or edited in the colony.<ref name="Watkins" /> However, the only real relic of this practice today is the Patagonian placename Trevelin ("mill town"), which in standard Welsh orthography would be {{#invoke:Lang|lang}}.

In 1928, a committee chaired by Sir John Morris-Jones standardised the orthography of modern Welsh.

In 1987, a committee chaired by Professor Stephen J. Williams made further small changes,Template:Which introducing Template:Vr. Not all modern writers adhere to the conventions established by these committees.<ref>Thomas, Peter Wynn (1996) Gramadeg y Gymraeg. Cardiff: University of Wales Press: 749.</ref>

Letter names and sound valuesEdit

"N" and "S" indicate variants specific to the northern and southern dialects of Welsh. Throughout Wales an alternative system is also in use in which all consonant letters are named using the corresponding consonant sound plus a schwa (e.g. {{#invoke:Lang|lang}} {{#invoke:IPA|main}} for {{#invoke:Lang|lang}}). In this system the vowels are named as below.

Letter Name Corresponding sounds English approximation
a lang}} main}} father (long)
b lang}} main}} bat
c lang}} main}} case
ch lang}} main}} No English equivalent; similar to loch in Scottish, but pronounced further back.
main}} when followed by a vowel; similarly, di and ti sometimes indicate {{#invoke:IPA|main}} and {{#invoke:IPA|main}} respectively when followed by a vowel, although these sounds are spelled j and ts in loanwords like {{#invoke:Lang|lang}} "jug" and {{#invoke:Lang|lang}} "watch".</ref> lang}} main}} day
dd lang}} main}} these
e lang}} main}} bed (short) / closest to hey (long)
f lang}} main}} of
ff lang}} main}} four
g lang}} main}} gate
ng lang}} main}} thing
main}}, h indicates voicelessness in the graphemes mh, nh, and ngh.</ref> lang}} main}} hat
i lang}}, {{#invoke:Lang|lang}} (S) main}} bit (short) / machine (long) / yes (as consonant; before vowels)
j lang}} main}} lang}} ('jelly', {{#invoke:IPA|main}}) and {{#invoke:Lang|lang}} ('jeans', {{#invoke:IPA|main}})
l lang}} main}} lad
ll lang}} main}} not present in English; a voiceless alveolar lateral fricative. A bit like what the consonant cluster "hl" would sound like.
m lang}} main}} mat
n lang}} main}} net
o lang}} main}} Short, like "bog" in RP; long like dawn in RP or stove in Scottish English
p lang}} main}} pet
ph<ref group=*>The digraph ph – which indicates the aspirate mutation of p (e.g. {{#invoke:Lang|lang}}) – may also be found very occasionally in words derived from Greek (e.g. {{#invoke:Lang|lang}}), although most words of Greek origin are spelt with ff (e.g. Template:Italics correction).</ref> lang}} main}} phone
r lang}} main}} Rolled R
rh lang}} main}} Voiceless rolled R
s<ref group=* name=si /> lang}} main}} sat
t<ref group=* name=si /> lang}} main}} stick
th lang}} main}} thin
u lang}} (N), {{#invoke:Lang|lang}} (S) main}} (N),<ref group="*" name=":0">In the North, the letters u and y are occasionally pronounced {{#invoke:IPA|main}}, the same as in the South, rather than {{#invoke:IPA|main}}. This is usually the case when the preceding vowel is {{#invoke:IPA|main}} or when y is preceded or followed by g {{#invoke:IPA|main}} or followed by w {{#invoke:IPA|main}}, forming a diphthong.{{#invoke:citation/CS1|citation CitationClass=web

}}</ref>
{{#invoke:IPA|main}} (S)

main}} not found in English. Identical to "î" and "â" in Romanian, and similar to the "e" in English roses.
w lang}} main}} push (short) / pool (long) / wet (as consonant)
main}} in unstressed monosyllabic words (e.g. {{#invoke:Lang|lang}} "the", {{#invoke:Lang|lang}} "my") or non-final syllables (regardless of whether these are stressed or not), but {{#invoke:IPA|main}} (N) or {{#invoke:IPA|main}} (S) in word-final syllables (again, regardless of stress).</ref> lang}} main}} (N),<ref name=":0" group="*" />
{{#invoke:IPA|main}} (S)
main}}, found in certain dialects of English that differentiate "Rosa's" and "roses", for example, General American.
Notes

<references group=* />

DiphthongsEdit

Orthography Northern dialects Southern dialects English (approximation only)
ae main}} <ref group=*>final or only syllable, e.g. chwarae, caeth</ref> main}} eye
main}} <ref group=*>non-final syllable. e.g. chwaraewr, caethwas</ref> main}} may
ai main}} main}} eye
au main}}, {{#invoke:IPA|main}} main}}, {{#invoke:IPA|main}} eye. Realised as bet (south) and cat (north) in plural endings.
aw main}} main}} how
ei main}} main}} As in eight
eu main}} main}} As in height
ew main}} main}} Roughly like Edward with the d removed: E'ward, or Cockney pronunciation of -ell in words like well, hell.
ey main}} main}} Two distinct vowels.
iw main}} main}} similar to Cockney pronunciation of -ill in words like bill, fill
oe main}} main}} boy
oi main}} main}} boy
ou main}} main}} boy
ow main}} main}} goal
uw main}} main}} main}}: see "iw" above. Northern {{#invoke:IPA|main}}: not present in English.
main}}). In these cases, the pronunciation of the "y" can then vary as shown in the previous table. For example: short final in "gwyn" ({{#invoke:IPA|main}} (N), {{#invoke:IPA|main}} (S)), long final in "gwych" ({{#invoke:IPA|main}} (N), {{#invoke:IPA|main}} (S)), non-final in "gwynion" ({{#invoke:IPA|main}}).</ref> main}} main}} not present in English; closest to gooey
main}}, see above) followed by a consonantal w {{#invoke:IPA|main}}</ref> main}} main}} see "uw" above
Notes

<references group=* />

DiacriticsEdit

Welsh makes use of a number of diacritics.

The circumflex (ˆ) is mostly used to mark long vowels, so â, ê, î, ô, û, ŵ, ŷ are always long. However, not all long vowels are marked with a circumflex, so the letters a, e, i, o, u, w, y with no circumflex do not necessarily represent short vowels; see Template:Section link.

The grave accent (`) is sometimes used, usually in words borrowed from another language, to mark vowels that are short when a long vowel would normally be expected, e.g. {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (a cough), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (a pass/permit or a lift in a car); {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (smoke), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (a mug).

The acute accent (´) is sometimes used to mark a stressed final syllable in a polysyllabic word. Thus the words {{#invoke:Lang|lang}} (to empty) and {{#invoke:Lang|lang}} (decline) have final stress. However, not all polysyllabic words with final stress are marked with the acute accent ({{#invoke:Lang|lang}} "Welsh" and {{#invoke:Lang|lang}} "forward/onward", for example, are written with none). The acute may also be used to indicate that a letter w represents a vowel where a glide might otherwise be expected, e.g. {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (two syllables) "manly", as opposed to {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (one syllable) "root".

Similarly, the diaeresis (¨) is used to indicate that two adjoining vowels are to be pronounced separately (not as a diphthong). However, it is also used to show that the letter i is used to represent the cluster {{#invoke:IPA|main}} which is always followed by another vowel, e.g. {{#invoke:Lang|lang}} (to copy) pronounced {{#invoke:IPA|main}}, not {{#invoke:IPA|main}}.

The grave and acute accents in particular are very often omitted in casual writing, and the same is true to a lesser extent of the diaeresis. The circumflex, however, is usually included. Accented vowels are not considered distinct letters for the purpose of collation.

Predicting vowel length from orthographyEdit

As mentioned above, vowels marked with the circumflex are always long, and those marked with the grave accent are always short. If a vowel is not marked with a diacritic, its length must be determined by its environment; the rules vary a bit according to dialect.<ref>Template:Cite book</ref><ref>Template:Cite book</ref>

In all dialects, only stressed vowels may be long; unstressed vowels are always short.

An unmarked (stressed) vowel is long:

  • in the last syllable of a word when no consonant follows: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (good).
  • before voiced stops b, d, g and before all fricatives (except for ll) ch, dd, f, ff, th, s: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (son), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (favourite), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (thing), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (night).

An unmarked vowel is short:

  • in an unstressed (proclitic) word: {{#invoke:Lang|lang}} {{#invoke:IPA|main}}.
  • before voiceless stops p, t, c {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (gate), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (sheepfold) and before all consonant clusters (except for those that start with s or ll) {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (saint), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (hedge), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (April).

The vowel y, when it is pronounced {{#invoke:IPA|main}}, is always shortTemplate:Contradictory inlineTemplate:Citation needed even when it appears in an environment where other vowels would be long: {{#invoke:Lang|lang}} (whole) {{#invoke:IPA|main}}. When pronounced as a close or near-close vowel ({{#invoke:IPA|main}} or {{#invoke:IPA|main}} in the North, {{#invoke:IPA|main}} or {{#invoke:IPA|main}} in the South), y follows the same rules as other vowels: {{#invoke:Lang|lang}} (day) {{#invoke:IPA|main}} (North) ~ {{#invoke:IPA|main}} (South), {{#invoke:Lang|lang}} (wind) {{#invoke:IPA|main}} (North) ~ {{#invoke:IPA|main}} (South).

Before l, m, n, and r, unmarked vowels are long in some words and short in others:

vowel long check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }} short check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}
i lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (wine) lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (scarcely)
e lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (old) lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (head)
y lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (man) lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (white)
w lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (bank up a fire) lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (often)
e lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (holly) lang}} main}} {{safesubst:#invoke:Check for unknown parameters|check|unknown=|preview=Page using Template:Center with unknown parameter "_VALUE_"|ignoreblank=y| 1 | style }}  (heart)

(The last four examples are given in South Welsh pronunciation only since vowels in nonfinal syllables are always short in North Welsh.)

Before nn and rr, vowels are always short: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (ash trees), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (to win), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (stone).

In Northern dialects, long vowels are stressed and appear in the final syllable of the word. Vowels in non-final syllables are always short. In addition to the rules above, a vowel is long in the North before a consonant cluster beginning with s: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (witness). Before ll, a vowel is short when no consonant follows the ll: {{#invoke:Lang|lang}} (better) {{#invoke:IPA|main}} It is long when another consonant does follow the ll: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (hair).

In Southern dialects, long vowels may appear in a stressed penultimate syllable as well as in a stressed word-final syllable. Before ll, a stressed vowel in the last syllable can be either long (e.g. {{#invoke:Lang|lang}} "better" {{#invoke:IPA|main}}) or short (e.g. {{#invoke:Lang|lang}} "hole" {{#invoke:IPA|main}}). However, a stressed vowel in the penult before ll is always short: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (clothes).Template:Citation needed Before s, a stressed vowel in the last syllable is long, as mentioned above, but a stressed vowel in the penult is short: {{#invoke:Lang|lang}} (measure) {{#invoke:IPA|main}}. Vowels are always short before consonant clusters: {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (saint), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (hair), {{#invoke:Lang|lang}} {{#invoke:IPA|main}} (witness).

DigraphsEdit

File:Lldigraph.png
The Middle-Welsh LL ligature.<ref>Template:Cite book</ref>
Unicode: U+1EFA and U+1EFB.

While the digraphs ch, dd, ff, ng, ll, ph, rh, th are each written with two symbols, they are all considered to be single letters. This means, for example that {{#invoke:Lang|lang}} (a town in South Wales) is considered to have only six letters in Welsh, compared to eight letters in English. Consequently, they each take up only a single space in Welsh crosswords. Ll itself had actually been written as the ligature Template:Not a typo in Middle Welsh.

Sorting is done in correspondence with the alphabet. For example, {{#invoke:Lang|lang}} comes before {{#invoke:Lang|lang}}, which comes before {{#invoke:Lang|lang}}, which comes before {{#invoke:Lang|lang}}. Automated sorting may occasionally be complicated by the fact that additional information may be needed to distinguish a genuine digraph from a juxtaposition of letters; for example {{#invoke:Lang|lang}} comes after {{#invoke:Lang|lang}} (in which the {{#invoke:Lang|lang}} stands for {{#invoke:IPA|main}}) but before {{#invoke:Lang|lang}} (in which n and g are pronounced separately as {{#invoke:IPA|main}}).

Although the digraphs above are considered to be single letters, only their first component letter is capitalised when a word in lower case requires an initial capital letter. Thus:

{{#invoke:Lang|lang}}, etc. (place names)
{{#invoke:Lang|lang}}, etc. (personal names)
{{#invoke:Lang|lang}} (other sentences starting with a digraph)

The two letters in a digraph are only both capitalised when the whole word is in uppercase:

{{#invoke:Lang|lang}} (as on a poster or sign)

The status of the digraphs as single letters is reflected in the stylised forms used in the logos of the National Library of Wales (logo) and Cardiff University (logo).

See alsoEdit

ReferencesEdit

Template:Reflist Template:Notelist

External linksEdit

Template:Sister project

Template:Welsh linguisticsTemplate:Language orthographies