Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Unicode
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
=== Web === {{Main|Unicode and HTML}} All [[W3C]] recommendations have used Unicode as their ''document character set'' since HTML 4.0. [[Web browser]]s have supported Unicode, especially UTF-8, for many years. There used to be display problems resulting primarily from [[typeface|font]] related issues; e.g. v6 and older of Microsoft [[Internet Explorer]] did not render many code points unless explicitly told to use a font that contains them.<ref>{{Cite web | last = Wood | first = Alan | title = Setting up Windows Internet Explorer 5, 5.5 and 6 for Multilingual and Unicode Support: ''Options for enabling Unicode in Internet Explorer 5, 5.5 and 6: Fonts (IE 5, 5.5 and 6)'' | url = https://www.alanwood.net/unicode/explorer.html#ie5 | publisher = Alan Wood | date = 2005-09-13 | access-date = 2025-04-12 | archive-url = https://web.archive.org/web/20250120141644/https://www.alanwood.net/unicode/explorer.html#ie5 | archive-date = 2025-01-20 | url-status = live }}</ref> Although syntax rules may affect the order in which characters are allowed to appear, [[XML]] (including [[XHTML]]) documents, by definition,<ref>{{Cite web | title = Extensible Markup Language (XML) 1.1 (Second Edition) | url = https://www.w3.org/TR/xml11 | publisher = [[World Wide Web Consortium]] | date = 2006-09-29 | access-date = 2025-04-12 | archive-url = https://web.archive.org/web/20250405204806/https://www.w3.org/TR/xml11/ | archive-date = 2025-04-05 | url-status = live }}</ref> comprise characters from most of the Unicode code points, with the exception of: * FFFE or FFFF. * most of the [[C0 and C1 control codes|C0 control codes]], * the permanently unassigned code points D800βDFFF, HTML characters manifest either directly as [[byte]]s according to the document's encoding, if the encoding supports them, or users may write them as numeric character references based on the character's Unicode code point. For example, the references <code>&#916;</code>, <code>&#1049;</code>, <code>&#1511;</code>, <code>&#1605;</code>, <code>&#3671;</code>, <code>&#12354;</code>, <code>&#21494;</code>, <code>&#33865;</code>, and <code>&#47568;</code> (or the same numeric values expressed in hexadecimal, with <code>&#x</code> as the prefix) should display on all browsers as Ξ, Π, Χ§ ,Ω , ΰΉ, γ, εΆ, θ, and λ§. When specifying [[Uniform Resource Identifier|URIs]], for example as [[URL]]s in [[HTTP]] requests, non-ASCII characters must be [[percent encoding|percent-encoded]].
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)