Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Unicode
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
=== Anomalies === {{Main|Unicode alias names and abbreviations}} ''The Unicode Standard'' has imposed rules intended to guarantee stability.<ref>{{Cite web|url=https://www.unicode.org/policies/stability_policy.html|title=Character Encoding Stability|website=Unicode |url-status=live |archive-url=https://web.archive.org/web/20240101053402/https://www.unicode.org/policies/stability_policy.html |archive-date= Jan 1, 2024 }}</ref> Depending on the strictness of a rule, a change can be prohibited or allowed. For example, a "name" given to a code point cannot and will not change. But a "script" property is more flexible, by Unicode's own rules. In version 2.0, Unicode changed many code point "names" from version 1. At the same moment, Unicode stated that, thenceforth, an assigned name to a code point would never change. This implies that when mistakes are published, these mistakes cannot be corrected, even if they are trivial (as happened in one instance with the spelling {{sc2|{{typo|BRAKCET}}}} for {{sc2|BRACKET}} in a character name). In 2006 a list of anomalies in character names was first published, and, as of June 2021, there were 104 characters with identified issues,<ref name="tn27">{{Cite web |date=14 June 2021 |title=Unicode Technical Note #27: Known Anomalies in Unicode Character Names |url=https://unicode.org/notes/tn27/ |website=Unicode}}</ref> for example: * {{unichar|034F|COMBINING GRAPHEME JOINER|nlink=Combining grapheme joiner}}: Does not join graphemes.<ref name="tn27" /> * {{unichar|2118|script capital p|nlink=Weierstrass p}}: This is a small letter. The capital is {{unichar|1D4AB|MATHEMATICAL SCRIPT CAPITAL P}}.<ref>{{Cite web|url=https://www.unicode.org/charts/PDF/U2100.pdf|title=Unicode chart: "actually this has the form of a lowercase calligraphic p, despite its name"}}</ref> * {{unichar|A015|YI SYLLABLE WU|nlink=Yi language}}: This is not a Yi syllable, but a Yi iteration mark. * {{unichar|FE18|PRESENTATION FORM FOR VERTICAL RIGHT WHITE LENTICULAR {{typo|BRAKCET}}}}: ''bracket'' is spelled incorrectly.<ref>{{Cite web|url=https://www.unicode.org/charts/PDF/UFE10.pdf|title=Misspelling of BRACKET in character name is a known defect}}</ref> (Spelling errors are resolved by using [[Unicode alias names and abbreviations|Unicode alias names]].) While Unicode defines the script designator (name) to be "{{tt|[[ʼPhags-pa script|Phags_Pa]]}}", in that script's character names, a hyphen is added: {{Unichar|A840|PHAGS-PA LETTER KA}}.<ref name=USA24>{{Cite web |year=2021 |title=Unicode Standard Annex #24: Unicode Script Property |url=https://www.unicode.org/reports/tr24/ |access-date=29 April 2022 |publisher=The Unicode Consortium |at=2.2 Relation to ISO 15924 Codes}}</ref><ref>{{Cite web |year=2023 |title=Scripts-15.1.0.txt |url=https://www.unicode.org/Public/UNIDATA/Scripts.txt |access-date=12 September 2023 |publisher=The Unicode Consortium}}</ref> This, however, is not an anomaly, but the rule: hyphens are replaced by underscores in script designators.<ref name=USA24 />
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)