Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Extended Unix Code
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
==EUC-TW== '''EUC-TW''' is a [[variable-width encoding|variable-length encoding]] that supports ASCII and 16 planes of {{nowrap|[[CNS 11643]]}}, each of which is 94Γ94. It is a rarely used encoding for [[traditional Chinese characters]] as used in [[Taiwan]]. Variants of [[Big5]] are much more common than EUC-TW, although Big5 only encodes the first two planes of CNS 11643 [[hanzi]], while [[UTF-8]] is becoming more common. * As an EUC/[[ISO 2022]] encoding, the [[C0 and C1 control codes#C0|C0 control characters]], ASCII space, and DEL are encoded as in ASCII. * A graphical character from ASCII (G0, code set 0) is encoded in GL as its usual single-byte representation (0x21β0x7E). * A character from CNS 11643 plane 1 (code set 1) is encoded as two bytes in GR (0xA1β0xFE). * A character in planes 1 through 16 of CNS 11643 (code set 2) is encoded as four bytes: ** The first byte is always 0x8E (Single Shift 2). ** The second byte (0xA1β0xB0) indicates the plane, the number of which is obtained by subtracting 0xA0 from that byte. ** The third and fourth bytes are in GR (0xA1β0xFE). Note that plane 1 of CNS 11643 is encoded twice as code set 1 and a part of code set 2.
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)