Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Extended Unix Code
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==={{anchor|MacChineseSimp|x-mac-chinesesimp}}Mac OS Chinese Simplified=== Other EUC-CN variants deviating from the EUC mechanism include the [[classic Mac OS]] Chinese Simplified script (known as Code page 10008 or <code>x-mac-chinesesimp</code>).<ref name="msdnlabels">{{cite web|url=https://msdn.microsoft.com/en-us/library/system.text.encoding.windowscodepage(v=vs.110).aspx |title=Encoding.WindowsCodePage Property – .NET Framework (current version) |work=MSDN |publisher=Microsoft}}</ref> It uses the bytes 0x80, 0x81, 0x82, 0xA0, 0xFD, 0xFE, and 0xFF for the [[ü|U with umlaut]] (ü), two special font metric characters, the [[non-breaking space]], the [[copyright sign]] (©), the [[trademark sign]] (™) and the ellipsis (...) respectively.<ref name="macsimchinese">{{cite web|url=https://unicode.org/Public/MAPPINGS/VENDORS/APPLE/CHINSIMP.TXT|title=Map (external version) from Mac OS Chinese Simplified encoding to Unicode 3.0 and later.|publisher=[[Apple, Inc]]}}<!-- Note: The comment blurb at the start of the file gives 0xFC and 0xFD as the © and ™ one-byte codes, contradicting its (the blurb's) statement that 0xA1–0xFC are the double-byte lead bytes. The actual (more authoritative) mapping data in the file lists 0xFD and 0xFE as the © and ™ one-byte codes. --></ref> This differs in what is regarded as a single-byte character versus the first byte of a two-byte character from both EUC (where, of those, 0xFD and 0xFE are defined as lead bytes) and GBK (where, of those, 0x81, 0x82, 0xFD and 0xFE are defined as lead bytes). This use of 0xA0, 0xFD, 0xFE and 0xFF matches [[MacJapanese|Apple's Shift_JIS variant]]. Besides these changes to the lead byte range, the other distinctive feature of the double-byte portion of Mac OS Chinese Simplified is the inclusion of two extensions to the basic GB 2312-80 set in rows 6 and 8.<ref name="macsimchinese" /> These are considered "standard extensions to GB 2312", neither of which is proprietary to Apple: the row 8 extension was taken from [[GB 6345.1]],<ref name="macsimchinese" /> both extensions are included by [[GB/T 12345]] (the traditional Chinese variant of GB 2312),<ref>{{cite book |title=Appendix F: GB/T 12345 |last=Lunde |first=Ken |author-link=Ken Lunde |chapter=CJKV Information Processing |isbn=9781565922242 |year=1998 |url=https://resources.oreilly.com/examples/9781565922242/blob/master/AppF/gbt12345.pdf |publisher=[[O'Reilly Media]]}}</ref> and both extensions are included by [[GB 18030]] (the successor to GB 2312).<ref>{{Cite book|url=https://archive.org/details/GB18030-2005|title=GB 18030-2005: Information Technology—Chinese coded character set|last=Standardization Administration of China (SAC)|date=2005-11-18}}</ref>
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)