Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Collation
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
===Alphabetical=== {{Main article|Alphabetical order}} [[Alphabetical order]] is the basis for many systems of collation where items of information are identified by strings consisting principally of [[letter (alphabet)|letters]] from an [[alphabet]]. The ordering of the strings relies on the existence of a standard ordering for the letters of the alphabet in question. (The system is not limited to alphabets in the strict technical sense; languages that use a [[syllabary]] or [[abugida]], for example [[Cherokee language|Cherokee]], can use the same ordering principle provided there is a set ordering for the symbols used.) To decide which of two strings comes first in alphabetical order, initially their first letters are compared. The string whose first letter appears earlier in the alphabet comes first in alphabetical order. If the first letters are the same, then the second letters are compared, and so on, until the order is decided. (If one string runs out of letters to compare, then it is deemed to come first; for example, "cart" comes before "carthorse".) The result of arranging a set of strings in alphabetical order is that words with the same first letter are grouped together, and within such a group words with the same first two letters are grouped together, and so on. [[Capital letter]]s are typically treated as equivalent to their corresponding lowercase letters. (For alternative treatments in computerized systems, see [[#Automated collation|Automated collation]], below.) Certain limitations, complications, and special conventions may apply when alphabetical order is used: * When strings contain [[space (character)|spaces]] or other word dividers, the decision must be taken whether to ignore these dividers or to treat them as symbols preceding all other letters of the alphabet. For example, if the first approach is taken then "car park" will come after "carbon" and "carp" (as it would if it were written "carpark"), whereas in the second approach "car park" will come before those two words. The first rule is used in many (but not all) [[dictionary|dictionaries]], the second in [[telephone directory|telephone directories]] (so that Wilson, Jim K appears with other people named Wilson, Jim and not after Wilson, Jimbo). * Abbreviations may be treated as if they were spelt out in full. For example, names containing "St." (short for the English word ''[[Saint]]'') are often ordered as if they were written out as "Saint". There is also a traditional convention in English that surnames beginning ''Mc'' and ''M''' are listed as if those prefixes were written ''Mac''. * Strings that represent personal names will often be listed by alphabetical order of surname, even if the [[given name]] comes first. For example, Juan Hernandes and Brian O'Leary should be sorted as "Hernandes, Juan" and "O'Leary, Brian" even if they are not written this way. * Very common initial words, such as ''The'' in English, are often ignored for sorting purposes. So ''[[The Shining (novel)|The Shining]]'' would be sorted as just "Shining" or "Shining, The". * When some of the strings contain [[numerical digit|numerals]] (or other non-letter characters), various approaches are possible. Sometimes such characters are treated as if they came before or after all the letters of the alphabet. Another method is for numbers to be sorted alphabetically as they would be spelled: for example ''[[1776 (film)|1776]]'' would be sorted as if spelled out "seventeen seventy-six", and {{Lang|fr|[[24 heures du Mans]]}} as if spelled "vingt-quatre..." (French for "twenty-four"). When numerals or other symbols are used as special graphical forms of letters, as in ''1337'' for [[leet]] or ''Se7en'' for the movie title ''[[Seven (1995 film)|Seven]]'', they may be sorted as if they were those letters. * Languages have different conventions for treating [[modified letter]]s and certain letter combinations. For example, in [[Spanish language|Spanish]] the letter ''Γ±'' is treated as a basic letter following ''n'', and the [[digraph (orthography)|digraphs]] ''ch'' and ''ll'' were formerly (until 1994) treated as basic letters following ''c'' and ''l'', although they are now alphabetized as two-letter combinations. A list of such conventions for various languages can be found at {{slink|Alphabetical order|Language-specific conventions}}. In several languages the rules have changed over time, and so older dictionaries may use a different order than modern ones. Furthermore, collation may depend on use. For example, German [[Dictionary|dictionaries]] and [[telephone directory|telephone directories]] use different approaches.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)