Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Code page
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
{{Short description|Dated classifications of computing character sets}} {{Use dmy dates|date=January 2020|cs1-dates=y}} In [[computing]], a '''code page''' is a [[character encoding]] and as such it is a specific association of a set of printable [[character (computing)|character]]s and [[control character]]s with unique numbers. Typically each number represents the binary value in a single byte. (In some contexts these terms are used more precisely; see {{section link|Character encoding|Terminology}}.) The term "code page" originated from [[IBM]]'s [[EBCDIC]]-based mainframe systems,<ref name="IBM_i_EBCDIC"/> but [[Microsoft]], [[SAP AG|SAP]],<ref name="SAP_CP"/> and [[Oracle Corporation]]<ref name="oracle.com"/> are among the vendors that use this term. The majority of vendors identify their own character sets by a name. In the case when there is a plethora of character sets (like in IBM), identifying character sets through a number is a convenient way to distinguish them. Originally, the code page numbers referred to the [[page number|''page'' number]]s in the IBM standard character set manual,<ref name="DEC_VT510"/><ref name="DEC_1994_Codepage"/><ref name="Paul_2001_CODEPAGE"/> a condition which has not held for a long time. Vendors that use a code page system allocate their own code page number to a character encoding, even if it is better known by another name; for example, [[UTF-8]] has been assigned page numbers 1208 at IBM, 65001 at Microsoft, and 4110 at SAP. [[Hewlett-Packard]] uses a similar concept in its [[HP-UX]] operating system and its [[Printer Command Language]]<ref name="PCL"/> (PCL) protocol for printers (either for HP printers or not). The terminology, however, is different: What others call a ''character set'', HP calls a ''symbol set'', and what IBM or Microsoft call a ''code page'', HP calls a ''symbol set code''. HP developed a series of symbol sets,<ref name="PCL-Symbol"/><ref name="PCL5"/> each with an associated symbol set code, to encode both its own character sets and other vendorsβ character sets. The multitude of character sets leads many vendors to recommend [[Unicode]]. == The code page numbering system == IBM introduced the concept of systematically assigning a small, but globally unique, 16 bit number to each character encoding that a computer system or collection of computer systems might encounter. The IBM origin of the numbering scheme is reflected in the fact that the smallest (first) numbers are assigned to variations of IBM's EBCDIC encoding and slightly larger numbers refer to variations of IBM's [[extended ASCII]] encoding as used in its PC hardware. With the release of [[PC DOS]] version 3.3 (and the near identical [[MS-DOS]] 3.3) IBM introduced the code page numbering system to regular PC users, as the code page numbers (and the phrase "code page") were used in new commands to allow the character encoding used by all parts of the OS to be set in a systematic way.<ref name="Duncan_1988_MS-DOS_Encyclopedia"/> [[File:IBM CJK Code Page Numbers.svg|right|thumb|IBM code page numbers (CPGIDs and CCSIDs) used for CJK encodings. Microsoft's use of code page numbers for CJK encodings differs, and is noted in brackets where applicable.]] After IBM and Microsoft ceased to cooperate in the 1990s, the two companies have maintained the list of assigned code page numbers independently from each other, resulting in some conflicting assignments. At least one third-party vendor ([[Oracle Corporation|Oracle]]) also has its own different list of numeric assignments.<ref name="oracle.com"/> IBM's current assignments are listed in their [[CCSID]] repository, while Microsoft's assignments are documented within the [[MSDN]].<ref name="Microsoft_Codepage-ID"/> Additionally, a list of the names and approximate IANA ([[Internet Assigned Numbers Authority]]) abbreviations for the installed code pages on any given Windows machine can be found in the Registry on that machine (this information is used by Microsoft programs such as [[Internet Explorer]]). Most well-known code pages, excluding those for the [[CJK characters|CJK]] languages and [[Vietnamese language|Vietnamese]],<!-- not using CJKV here because this applies to the modern romanised Vietnamese --> fit all their code-points into eight bits and do not involve anything more than mapping each code-point to a single character; furthermore, techniques such as combining characters, complex scripts, etc., are not involved. The text mode of standard ([[VGA-compatible text mode|VGA-compatible]]) PC graphics hardware is built around using an 8-bit code page, though it is possible to use two at once with some color depth sacrifice, and up to eight may be stored in the [[display adapter]] for easy switching.<ref name="VGA-Programming"/> There was a selection of third-party code page fonts that could be loaded into such hardware. However, it is now commonplace for operating system vendors to provide their own character encoding and rendering systems that run in a graphics mode and bypass this hardware limitation entirely. However the system of referring to character encodings by a code page number remains applicable, as an efficient alternative to string identifiers such as those specified by the IETF and IANA for use in various protocols such as e-mail and web pages. === Relationship to ASCII === The majority of code pages in current use are supersets of [[ASCII]], a 7-bit code representing 128 control codes and printable characters. In the distant past, 8-bit implementations of the ASCII code set the top bit to zero or used it as a [[parity bit]] in network data transmissions. When the top bit was made available for representing character data, a total of 256 characters and control codes could be represented. Most vendors (including IBM) used this extended range to encode characters used by various languages and graphical elements that allowed the imitation of primitive graphics on text-only output devices. No formal standard existed for these "extended ASCII character sets" and vendors referred to the variants as code pages, as IBM had always done for variants of EBCDIC encodings. === Relationship to Unicode === Unicode is an effort to include all characters from all currently and historically used human languages into single character enumeration (effectively one large single code page), removing the need to distinguish between different code pages when handling digitally stored text. Unicode tries to retain backwards compatibility with many legacy code pages, copying some code pages 1:1 in the design process. An explicit design goal of Unicode was to allow round-trip conversion between all common legacy code pages, although this goal has not always been achieved. Some vendors, namely IBM and Microsoft, have anachronistically assigned code page numbers to Unicode encodings. This convention allows code page numbers to be used as metadata to identify the correct decoding algorithm when encountering binary stored data. ==IBM code pages== === EBCDIC-based code pages === These code pages are used by IBM in its EBCDIC character sets for [[IBM mainframe|mainframe computers]].<ref>{{cite web |archive-url=https://web.archive.org/web/20120716195651/http://www-03.ibm.com/systems/i/software/globalization/codepages.html |url=http://www-03.ibm.com/systems/i/software/globalization/codepages.html |archive-date=2012-07-16 |title=IBM i Globalization: Code Pages |publisher=IBM}}</ref> {{Div col|colwidth=30em}} * 1 β USA WP, Original * 2 β USA * 3 β USA Accounting, Version A * 4 β USA * 5 β USA * 6 β Latin America * 7 β Germany F.R. / Austria * 8 β Germany F.R. * 9 β France, Belgium * 10 β Canada (English) * 11 β Canada (French) * 12 β Italy * 13 β Netherlands * 14 β Spain * 15 β Switzerland (French) * 16 β Switzerland (French / German) * 17 β Switzerland (German) * 18 β Sweden / Finland * 19 β Sweden / Finland WP, version 2 * 20 β Denmark/Norway * 21 β Brazil * 22 β Portugal * 23 β United Kingdom * 24 β United Kingdom * 25 β Japan (Latin) * 26 β Japan (Latin) * 27 β Greece (Latin) * 29 β Iceland * 30 β Turkey * 31 β South Africa * 32 β Czechoslovakia (Czech / Slovak) * 33 β Czechoslovakia * 34 β Czechoslovakia * 35 β Romania * 36 β Romania * 37 β USA/Canada - CECP (same with euro: 1140) * 37-2 β The real 3279 APL codepage, as used by C/370. This is very close to 1047, except for caret and not-sign inverted. It is not officially recognized by IBM, even though [[SHARE (computing)|SHARE]] has pointed out its existence.<ref name="IBM_2010"/> * 38 β USA ASCII * 39 β United Kingdom / Israel * 40 β United Kingdom * 251 β China * 252 β Poland * 254 β Hungary * 256 β International #1 (superseded by 500) * 257 β International #2 * 258 β International #3 * 259 β Symbols, Set 7 * 260 β Canadian French - 116 * 264 β Print Train & Text processing extended * 273 β Germany F.R./Austria - CECP (same with euro: 1141) * 274 β Old Belgium Code Page * 275 β Brazil - CECP * 276 β Canada (French) - 94 * 277 β Denmark, Norway - CECP (same with euro: 1142) * 278 β Finland, Sweden - CECP (same with euro: 1143) * 279 β French - 94<ref name="IBM_2010"/> * 280 β Italy - CECP (same with euro: 1144) * 281 β Japan (Latin) - CECP * 282 β Portugal - CECP * 283 β Spain - 190<ref name="IBM_2010"/> * 284 β Spain/Latin America - CECP (same with euro: 1145) * 285 β United Kingdom - CECP (same with euro: 1146) * 286 β Austria / Germany F.R. Alternate * 287 β Denmark / Norway Alternate * 288 β Finland / Sweden Alternate * 289 β Spain Alternate * [[Code page 290|290]] β Japanese (Katakana) Extended * [[Code page 293|293]] β APL * 297 β France (same with euro: 1147)<ref name="IBM_2010"/> * [[Code page 298|298]] β Japan (Katakana) * [[Code page 300|300]] β Japan (Kanji) DBCS (For JIS X 0213) * [[Code page 310|310]] β Graphic Escape APL/TN * 320 β Hungary * 321 β Yugoslavia * 322 β Turkey * 330 β International #4 * 340 β EBCDIC, OCR (same as 893, superseded by 892 and 893) * [[Code page 351|351]] β GDDM default * 352 β Printing and publishing option * [[Code page 353|353]] β BCDIC-A * [[Code page 354|354]] β BCDIC-B * [[Code page 355|355]] β PTTC/BCD standard option * [[Code page 357|357]] β PTTC/BCD H option * [[Code page 358|358]] β PTTC/BCD Correspondence option * [[Code page 359|359]] β PTTC/BCD Monocase option * [[Code page 360|360]] β PTTC/BCD Duocase option * 361 β EBCDIC Publishing International * 363 β Symbols, set 8 * 382 β EBCDIC Publishing Austria, Germany F.R. Alternate * 383 β EBCDIC Publishing Belgium * 384 β EBCDIC Publishing Brazil * 385 β EBCDIC Publishing Canada (French) * 386 β EBCDIC Publishing Denmark, Norway * 387 β EBCDIC Publishing Finland, Sweden * 388 β EBCDIC Publishing France * 389 β EBCDIC Publishing Italy * 390 β EBCDIC Publishing Japan (Latin) * 391 β EBCDIC Publishing Portugal * 392 β EBCDIC Publishing Spain, Philippines * 393 β EBCDIC Publishing Latin America (Spanish Speaking) * 394 β EBCDIC Publishing China (Hong Kong), UK, Ireland * 395 β EBCDIC Publishing Australia, New Zealand, USA, Canada (English) * 396 β BookMaster Specials * 410 β Cyrillic (revisions: 880, 1025, 1154) * 420 β Arabic * 421 β Maghreb/French * 423 β Greek (superseded by 875) * 424 β Hebrew (Bulletin Code) * 425 β Arabic / Latin for OS/390 Open Edition * 435 β Teletext Isomorphic * 500 β International #5 (ECECP; supersedes 256) (same with euro: 1148) * 803 β Hebrew Character Set A (Old Code) * 829 β Host Math Symbols- Publishing * 830 β Math Format * 831 β Portugal (Alternate) (same as 37) * 833 β Korean Extended (SBCS) * 834 β Korean Hangul (KSC5601; DBCS with UDCs) * 835 β Traditional Chinese DBCS * 836 β Simplified Chinese Extended * 837 β Simplified Chinese DBCS * 838 β Thai with Low Marks & Accented Characters (same with euro: 1160) * 839 β Thai DBCS * 870 β Latin 2 (same with euro: 1153) (revision: 1110) * 871 β Iceland (same with euro: 1149)<ref name="IBM_2010"/> * 875 β Greek (supersedes 423) * 880 β Cyrillic (revision of 410) (revisions: 1025, 1154) * 881 β United States - 5080 Graphics System * 882 β United Kingdom - 5080 Graphics System * 883 β Sweden - 5080 Graphics System * 884 β Germany - 5080 Graphics System * 885 β France - 5080 Graphics System * 886 β Italy - 5080 Graphics System * [[Code page 887|887]] β Japan - 5080 Graphics System * 888 β France AZERTY - 5080 Graphics System * 889 β Thailand * 890 β Yugoslavia * 892 β EBCDIC, OCR A * 893 β EBCDIC, OCR B * 905 β Latin 3 * 918 β Urdu Bilingual * 924 β Latin 9 * [[Code page 930|930]] β Japan MIX (290 + 300) (same with euro: 1390) * [[Code page 931|931]] β Japan MIX (37 + 300) * [[Code page 933|933]] β Korea MIX (833 + 834) (same with euro: 1364) * 935 β Simplified Chinese MIX (836 + 837) (same with euro: 1388) * 937 β Traditional Chinese MIX (37 + 835) (same with euro: 1371) * [[Code page 939|939]] β Japan MIX (1027 + 300) (same with euro: 1399) * [[Code page 1001|1001]] β MICR * 1002 β EBCDIC DCF Release 2 Compatibility * 1003 β EBCDIC DCF, US Text subset * 1005 β EBCDIC Isomorphic Text Communication * 1007 β EBCDIC Arabic (XCOM2) * 1024 β EBCDIC T.61 * 1025 β Cyrillic, Multilingual (same with euro: 1154) (Revision of 880) * 1026 β EBCDIC Turkey (Latin 5) (same with euro: 1155) (supersedes 905 in that country) * [[Code page 1027|1027]] β Japanese (Latin) Extended (JIS X 0201 Extended) * 1028 β EBCDIC Publishing Hebrew * [[Code page 1030|1030]] β Japanese (Katakana) Extended * [[Code page 1031|1031]] β Japanese (Latin) Extended * [[Code page 1032|1032]] β MICR, E13-B Combined * [[Code page 1033|1033]] β MICR, CMC-7 Combined * 1037 β Korea - 5080/6090 Graphics System * 1039 β GML Compatibility * 1047 β Latin 1/Open Systems<!-- C/370 variant of codepage 37, which takes into account the encoding of brackets that was inherited from the 3270 display system --><ref name="IBM_2010"/> * 1068 β DCF Compatibility * 1069 β Latin 4 * 1070 β USA / Canada Version 0 (Code page 37 Version 0) * 1071 β Germany F.R. / Austria (Code page 273 Version 0) * 1072 β Belgium (Code page 274 Version 0) * 1073 β Brazil (Code page 275 Version 0) * 1074 β Denmark, Norway (Code page 277 Version 0) * 1075 β Finland, Sweden (Code page 278 Version 0) * 1076 β Italy (Code page 280 Version 0) * 1077 β Japan (Latin) (Code page 281 Version 0) * 1078 β Portugal (Code page 282 Version 0) * 1079 β Spain / Latin America Version 0 (Code page 284 Version 0) * 1080 β United Kingdom (Code page 285 Version 0) * 1081 β France Version 0 (Code page 297 Version 0) * 1082 β Israel (Hebrew) * 1083 β Israel (Hebrew) * 1084 β International#5 Version 0 ([[Code page 500]] Version 0) * 1085 β Iceland (Code page 871 Version 0) * 1087 β Symbol Set * 1091 β Modified Symbols, Set 7 * {{anchor|CP1093}} 1093 β IBM Logo<ref>{{cite web|title=Code Page CPGID 01093 (pdf)|archive-url=https://web.archive.org/web/20150708031831/http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01093z.pdf|archive-date=2015-07-08|url=http://www-03.ibm.com/systems/resources/systems_i_software_globalization_pdf_cp01093z.pdf}}</ref> * 1097 β Farsi Bilingual * 1110 β Latin 2 (Revision of 870) * 1112 β Baltic Multilingual (same with euro: 1156) * 1113 β Latin 6 * 1122 β Estonia (same with euro: 1157) * 1123 β Cyrillic, Ukraine (same with euro: 1158) * 1130 β Vietnamese (same with euro: 1164) * 1132 β Lao EBCDIC * [[Code page 1136|1136]] β Hitachi Katakana * 1137 β Devanagari EBCDIC * 1140 β USA, Canada, etc. ECECP (same without euro: 37) (Traditional Chinese version: 1159) * 1141 β Austria, Germany ECECP (same without euro: 273) * 1142 β Denmark, Norway ECECP (same without euro: 277) * 1143 β Finland, Sweden ECECP (same without euro: 278) * 1144 β Italy ECECP (same without euro: 280) * 1145 β Spain, Latin America (Spanish) ECECP (same without euro: 284) * 1146 β UK ECECP (same without euro: 285) * 1147 β France ECECP with euro (same without euro: 297) * 1148 β International ECECP with euro (same without euro: 500) * 1149 β Icelandic ECECP with euro (same without euro: 871) * 1150 β Korean Extended with box characters * 1151 β Simplified Chinese Extended with box characters * 1152 β Traditional Chinese Extended with box characters * 1153 β Latin 2 Multilingual with euro (same without euro: 870) * 1154 β Cyrillic, Multilingual with euro (same without euro: 1025; an older version is * 1166) * 1155 β Turkey with euro (same without euro: 1026) (same with lira: 1175) * 1156 β Baltic Multi with euro (same without euro: 1112) * 1157 β Estonia with euro (same without euro: 1122) * 1158 β Cyrillic, Ukraine with euro (same without euro: 1123) * 1159 β T-Chinese EBCDIC (Traditional Chinese euro update of * 1140) * 1160 β Thai with Low Marks & Accented Characters with euro (same without euro: 838) * 1164 β Vietnamese with euro (same without euro: 1130) * 1165 β Latin 2/Open Systems * 1166 β Cyrillic Kazakh * 1175 β Turkey with euro and lira (same without lira: 1155) * 1278 β EBCDIC Adobe (PostScript) Standard Encoding * 1279 β Hitachi Japanese Katakana Host<ref name="Paul_2001_CODEPAGE"/> * 1300 β Generic Bar Code/OCR-B * 1301 β Zip + 4 POSTNET Bar Code * 1302 β Facing Identification Marks * 1303 β EBCDIC Bar Code * [[Code page 1364|1364]] β Korea MIX (833 + 834 + euro) (same without euro: 933) * 1371 β Traditional Chinese MIX (1159 + 835) (same without euro: 937) * 1376 β Traditional Chinese DBCS Host extension for HKSCS * 1377 β Mixed Host HKSCS Growing (37 + 1376) * 1378 β Traditional Chinese DBCS Host extension for HKSCS and Simplified Chinese (superset of 1376) * 1379 β Mixed Host HKSCS and Simplified Chinese Growingβ(37 + 1378) (superset of 1377) * 1388 β Simplified Chinese MIX (same without euro: 935) (836 + 837 + euro) * [[Code page 1390|1390]] β Simplified Chinese MIX Japan MIX (same without euro: 930) (290 + 300 + euro) * [[Code page 1399|1399]] β Japan MIX (1027 + 300 + euro) (same without euro: 939) {{div col end}} === DOS code pages === These code pages are used by IBM in its [[PC DOS]] operating system. These code pages were originally embedded directly in the [[text mode]] hardware of the graphic adapters used with the [[IBM PC]] and its clones, including the original MDA and CGA adapters whose character sets could only be changed by physically replacing a ROM chip that contained the font. The interface of those adapters (emulated by all later adapters such as VGA) was typically limited to single byte character sets with only 256 characters in each font/encoding (although VGA added partial support for slightly larger character sets). {{Div col|colwidth=30em}} * [[Code page 301|301]] β IBM-PC Japan (Kanji) DBCS * [[Code page 437|437]] β Original IBM PC hardware code page * 720 β Arabic (Transparent ASMO) * [[Code page 737|737]] β [[Greek language|Greek]] * 775 β Latin-7 * [[Code page 808|808]] β Russian with euro (same without euro: [[Code page 866|866]]) * [[Code page 848|848]] β Ukrainian with euro (same without euro: [[Code page 1125|1125]]) * [[Code page 849|849]] β Belarusian with euro (same without euro: [[Code page 1131|1131]]) * [[Code page 850|850]] β Latin-1 * 851 β Greek * 852 β Latin-2 * 853 β Latin-3 * 855 β [[Cyrillic script|Cyrillic]] (same with euro: 872) * 856 β [[Hebrew alphabet|Hebrew]] * 857 β Latin-5 * [[Code page 858|858]] β Latin-1 with [[euro]] symbol * 859 β Latin-9 * 860 β [[Portuguese language|Portuguese]] * [[Code page 861|861]] β [[Icelandic language|Icelandic]] * [[Code page 862|862]] β [[Hebrew language|Hebrew]] * [[Code page 863|863]] β [[French language|Canadian French]] * [[Code page 864|864]] β [[Arabic language|Arabic]] * [[Code page 865|865]] β [[Danish language|Danish]]/[[Norwegian language|Norwegian]] * [[Code page 866|866]] β Belarusian, Russian, Ukrainian (same with euro: [[Code page 808|808]]) * [[Code page 867|867]] β [[Hebrew language|Hebrew]] + euro (based on CP862) (conflictive ID: NEC [[KamenickΓ½ encoding|Czech (KamenickΓ½)]], which was created before this codepage) * [[Code page 868|868]] β [[Urdu language|Urdu]] * [[Code page 869|869]] β [[Greek alphabet|Greek]] * 872 β Cyrillic with euro (same without euro: 855) * [[Code page 874|874]] β Thai with Low Tone Marks & Ancient Chars (conflictive ID with Windows 874; version with euro: [[Code page 1161|1161]] Windows version: is IBM [[Code page 1162|1162]])<!-- Attention! Neither IBM 874 nor Windows 874 are rigorously the same as ISO 8859-11 / TIS 620-2533 ISO 8859-11 is probably IBM 873--> * 876 β OCR A * 877 β OCR B * [[Code page 878|878]] β [[KOI8-R]] * [[Code page 891|891]] β Korean PC SBCS * 898 β IBM-PC WP Multilingual * [[Code page 899|899]] β IBM-PC Symbol * [[Code page 903|903]] β Simplified Chinese PC SBCS * [[Code page 904|904]] β Traditional Chinese PC SBCS * 906 β International Set #5 3812/3820 * [[Code page 907|907]] β ASCII APL (3812) * [[Code page 909|909]] β IBM-PC APL2 Extended * [[Code page 910|910]] β IBM-PC APL2 * [[Code page 911|911]] β IBM-PC Japan #1 * [[Code page 926|926]] β Korean PC DBCS * [[Code page 927|927]] β Traditional Chinese PC DBCS * [[Code page 928|928]] β Simplified Chinese PC DBCS * [[Code page 929|929]] β Thai PC DBCS * [[Code page 932 (IBM)|932]] β IBM-PC Japan MIX (DOS/V) (DBCS) ([[Code page 897|897]] + [[Code page 301|301]]) (conflictive ID with Windows 932; Windows version is IBM 943) * [[Code page 934|934]] β IBM-PC Korea MIX (DOS/V) (DBCS) ([[Code page 891|891]] + [[Code page 926|926]]) * [[Code page 936 (IBM)|936]] β IBM-PC Simplified Chinese MIX (gb2312) (DOS/V) (DBCS) ([[Code page 903|903]] + [[Code page 928|928]]) (conflictive ID with Windows 936; Windows version is IBM 1386) * [[Code page 938|938]] β IBM-PC Traditional Chinese MIX (DOS/V, OS/2) ([[Code page 904|904]] + [[Code page 927|927]]) * [[Code page 942|942]] β IBM-PC Japan MIX (Japanese SAA (OS/2)) ([[Code page 1041|1041]] + [[Code page 301|301]]) * [[Code page 943|943]] β IBM-PC Japan OPEN ([[Code page 897|897]] + [[Code page 941|941]]) (Windows CP 932) * [[Code page 944|944]] β IBM-PC Korea MIX (Korean SAA (OS/2)) ([[Code page 1040|1040]] + [[Code page 926|926]]) * [[Code page 946|946]] β IBM-PC Simplified Chinese (Simplified Chinese SAA (OS/2)) ([[Code page 1042|1042]] + [[Code page 928|928]]) * [[Code page 948|948]] β IBM-PC Traditional Chinese (Traditional Chinese SAA (OS/2)) ([[Code page 1043|1043]] + [[Code page 927|927]]) * [[Code page 949 (IBM)|949]] β Korean (Extended Wansung (ks_c_5601-1987)) ([[Code page 1088|1088]] + [[Code page 951|951]]) (conflictive ID with Windows 949 (Unified Hangul Code); Windows version is IBM 1363) * [[Code page 951|951]] β Korean DBCS (IBM KS Code) (conflictive ID with Windows 951, a hack of Windows 950 with Unicode mappings for some PUA Unicode characters found in HKSCS, based on the file name) * 1034 β Printer Application - Shipping Label, Set #2 * [[Code page 1040|1040]] β Korean Extended * [[Code page 1041|1041]] β Japanese Extended (JIS X 0201 Extended) * [[Code page 1042|1042]] β Simplified Chinese Extended * [[Code page 1043|1043]] β Traditional Chinese Extended * 1044 β Printer Application - Shipping Label, Set #1 * [[Code page 1086|1086]] β IBM-PC Japan #1 * [[Code page 1088|1088]] β Revised Korean (SBCS) * [[Code page 1092|1092]] β IBM-PC Modified Symbols * [[Code page 1098|1098]] β [[Persian language|Farsi]] * 1108 β DITROFF Base Compatibility * 1109 β DITROFF Specials Compatibility * [[Code page 1115|1115]] β IBM-PC People's Republic of China * [[Code page 1116|1116]] β Estonian * [[Code page 1117|1117]] β Latvian * [[Code page 1118|1118]] β Lithuanian (IBM's implementation of Lika's [[code page 774]]) * [[Code page 1119|1119]] β Lithuanian and Russian (IBM's implementation of Lika's [[code page 772]]) * [[Code page 1125|1125]] β Cyrillic, Ukrainian (same with euro: [[Code page 848|848]]) (IBM modification of [[RUSCII character set|RUSCII]]) * [[Code page 1127|1127]] β IBM-PC Arabic / French * [[Code page 1131|1131]] β IBM-PC Data, Cyrillic, Belarusian (same with euro: [[Code page 849|849]]) * [[Code page 1139|1139]] β Japan Alphanumeric Katakana * [[Code page 1161|1161]] β Thai with Low Tone Marks & Ancient Chars with euro (same without euro: [[Code page 874|874]]) * [[Code page 1167|1167]] β [[KOI8-RU]] * [[Code page 1168|1168]] β [[KOI8-U]] * [[Code page 1370|1370]] β Traditional Chinese MIX ([[Big5|Big5 encoding]]) ([[Code page 1114|1114]] + [[Code page 947|947]] + euro) (same without euro: [[Code page 950|950]]) * [[Code page 1380|1380]] β IBM-PC Simplified Chinese GB PC-DATA (DBCS PC IBM GB 2312-80) * [[Code page 1381|1381]] β IBM-PC Simplified Chinese ([[Code page 1115|1115]] + [[Code page 1380|1380]]) * [[Code page 1393|1393]] β Japanese JIS X 0213 DBCS * [[Code page 1394|1394]] β IBM-PC Japan (JIS X 0213) ([[Code page 897|897]] + [[Code page 1393|1393]]) {{div col end}} When dealing with older hardware, protocols and file formats, it is often necessary to support these code pages, but newer encoding systems, in particular Unicode, are encouraged for new designs. DOS code pages are typically stored in .CPI files.<ref name="Paul_2001_CPI"/><ref name="Elliott_2006_CPI"/><ref name="AEB_2001_CPI"/><ref name="Haralambous_2007_CPI"/><ref name="MS_1991_PR"/> === IBM AIX code pages === These code pages are used by IBM in its [[AIX]] operating system. They emulate several character sets, namely those ones designed to be used accordingly to ISO, such as UNIX-like operating systems. {{Div col|colwidth=30em}} * [[Code page 367|367]] β 7-bit US-ASCII * [[Code page 371|371]] β 7-bit US-ASCII APL * [[Code page 806|806]] β ISCII * [[Code page 813|813]] β [[ISO 8859-7]] * [[Code page 819|819]] β [[ISO 8859-1]] * [[Code page 895|895]] β 7-bit Japan Latin * [[Code page 896|896]] β 7-bit Japan Katakana Extended * [[Code page 901|901]] β [[ISO 8859-13]] with euro (later extended) (same without euro: [[Code page 921|921]]) * [[Code page 902|902]] β ISO Estonian with euro (same without euro: [[Code page 922|922]]) * [[Code page 912|912]] β [[ISO 8859-2]] (extended in 1999) * [[Code page 913|913]] β [[ISO 8859-3]] * [[Code page 914|914]] β [[ISO 8859-4]] * [[Code page 915|915]] β [[ISO 8859-5]] (extended after 1995) * [[Code page 916|916]] β [[ISO 8859-8]] * [[Code page 919|919]] β [[ISO 8859-10]] * [[Code page 920|920]] β [[ISO 8859-9]] * [[Code page 921|921]] β [[ISO 8859-13]] (extended after 1995) (same with euro: [[Code page 901|901]]) * [[Code page 922|922]] β ISO Estonian (same with euro: [[Code page 902|902]]) * [[Code page 923|923]] β [[ISO 8859-15]] * [[Code page 952|952]] β EUC Japanese for JIS X 0208 * [[Code page 953|953]] β EUC Japanese for JIS X 0212 * [[Code page 954|954]] β EUC Japanese ([[Code page 895|895]] + [[Code page 952|952]] + [[Code page 896|896]] + [[Code page 953|953]]) * [[Code page 955|955]] β TCP Japanese, JIS X 0208-1978 * [[Code page 956|956]] β TCP Japanese ([[Code page 895|895]] + [[Code page 952|952]] + [[Code page 896|896]] + [[Code page 953|953]]) * [[Code page 957|957]] β TCP Japanese ([[Code page 895|895]] + [[Code page 955|955]] + [[Code page 896|896]] + [[Code page 953|953]]) * [[Code page 958|958]] β TCP Japanese ([[Code page 367|367]] + [[Code page 952|952]] + [[Code page 896|896]] + [[Code page 953|953]]) * [[Code page 959|959]] β TCP Japanese ([[Code page 367|367]] + [[Code page 955|955]] + [[Code page 896|896]] + [[Code page 953|953]]) * [[Code page 960|960]] β Traditional Chinese DBCS-EUC SICGCC Primary Set (1st plane) * [[Code page 961|961]] β Traditional Chinese DBCS-EUC SICGCC Full Set + IBM Select + UDC * [[Code page 963|963]] β Traditional Chinese TCP, CNS 11643 plane 2 only * [[Code page 964|964]] β EUC Traditional Chinese ([[Code page 367|367]] + [[Code page 960|960]] + [[Code page 961|961]]) * [[Code page 965|965]] β TCP Traditional Chinese ([[Code page 367|367]] + [[Code page 960|960]] + [[Code page 963|963]]) * [[Code page 970|970]] β EUC Korean ([[Code page 367|367]] + [[Code page 971|971]]) * [[Code page 971|971]] β EUC Korean DBCS (G1, KSC 5601 1989 (including 188 UDC)) * [[Code page 1006|1006]] β ISO 8-bit Urdu * [[Code page 1008|1008]] β ISO 8-bit Arabic * [[Code page 1009|1009]] β 7-bit ISO IRV * [[Code page 1010|1010]] β 7-bit France * [[Code page 1011|1011]] β 7-bit Germany F.R. * [[Code page 1012|1012]] β 7-bit Italy * [[Code page 1013|1013]] β 7-bit United Kingdom * [[Code page 1014|1014]] β 7-bit Spain * [[Code page 1015|1015]] β 7-bit Portugal * [[Code page 1016|1016]] β 7-bit Norway * [[Code page 1017|1017]] β 7-bit Denmark * [[Code page 1018|1018]] β 7-bit Finland/Sweden * [[Code page 1019|1019]] β 7-bit Netherlands * [[Code page 1029|1029]] β Arabic Extended * [[Code page 1036|1036]] β CCITT T.61 * [[Code page 1046|1046]] β Arabic Extended (Euro) * [[Code page 1089|1089]] β [[ISO 8859-6]] * [[Code page 1111|1111]] β Variant of [[ISO 8859-2]] * [[Code page 1124|1124]] β ISO Ukrainian, similar to [[ISO 8859-5]] * [[Code page 1129|1129]] β ISO Vietnamese (same with euro: [[Code page 1163|1163]]) * [[Code page 1133|1133]] β ISO Lao * [[Code page 1163|1163]] β ISO Vietnamese with euro (same without euro: [[Code page 1129|1129]]) * [[Code page 1350|1350]] β EUC Japanese (JISeucJP) ([[Code page 367|367]] + [[Code page 952|952]] + [[Code page 896|896]] + [[Code page 953|953]]) * [[Code page 1382|1382]] β EUC Simplified Chinese (DBCS PC GB 2312-80) * [[Code page 1383|1383]] β EUC Simplified Chinese ([[Code page 367|367]] + [[Code page 1382|1382]]) {{div col end}} Code page 819 is identical to Latin-1, [[ISO/IEC 8859-1]], and with slightly-modified commands, permits MS-DOS machines to use that encoding. It was used with IBM AS/400 minicomputers. === IBM OS/2 code pages === These code pages are used by IBM in its [[DOS#DOS under OS/2 and Windows|OS/2]] operating system. * [[Code page 1004|1004]] β Latin-1 Extended, Desk Top Publishing/Windows<ref name="IBM_CP1004"/> === Windows emulation code pages === These code pages are used by IBM when emulating the [[Microsoft Windows]] character sets. Most of these code pages have the same number as Microsoft code pages, although they are not '''exactly''' identical. Some code pages, though, are new from IBM, not devised by Microsoft. {{Div col|colwidth=30em}} * [[Code page 897|897]] β IBM-PC SBCS Japanese (JIS X 0201-1976) * [[Code page 941|941]] β IBM-PC Japanese DBCS for Open environment * [[Code page 947|947]] β IBM-PC DBCS for ([[Big5|Big5 encoding]]) * [[Code page 950|950]] β Traditional Chinese MIX ([[Big5|Big5 encoding]]) ([[Code page 1114|1114]] + [[Code page 947|947]]) (same with euro: [[Code page 1370|1370]]) * [[Code page 1114|1114]] β IBM-PC SBCS (Simplified Chinese; [[GBK (character encoding)|GBK]]; Traditional Chinese; [[Big5|Big5 encoding]]) * [[Code page 1126|1126]] β IBM-PC Korean SBCS * [[Code page 1162|1162]] β Windows Thai (Extension of [[Code page 874|874]]; but still called that in Windows) * 1169 β Windows Cyrillic Asian * [[Code page 1174|1174]] β Windows Kazakh<ref name="Kazakh_1174"/> * [[Code page 1250|1250]] β Windows [[Central Europe]] * [[Code page 1251|1251]] β Windows [[Cyrillic script|Cyrillic]] * [[Code page 1252|1252]] β Windows [[Western Europe|Western]] * [[Code page 1253|1253]] β Windows [[Greek alphabet|Greek]] * [[Code page 1254|1254]] β Windows [[Turkish alphabet|Turkish]] * [[Code page 1255|1255]] β Windows [[Hebrew alphabet|Hebrew]] * [[Code page 1256|1256]] β Windows [[Arabic alphabet|Arabic]] * [[Code page 1257|1257]] β Windows [[Baltic languages|Baltic]] * [[Code page 1258|1258]] β Windows [[Vietnamese alphabet|Vietnamese]] * 1360 β Korean JOHAB DBCS * [[Code page 1361|1361]] β Korean ([[JOHAB]]) * [[Code page 1362|1362]] β Korean Hangul DBCS * [[Code page 1363|1363]] β Windows Korean ([[Code page 1126|1126]] + [[Code page 1362|1362]]) (Windows CP 949) * [[Code page 1372|1372]] β IBM-PC MS T Chinese [[Big5|Big5 encoding]] (Special for DB2) * [[Code page 1373|1373]] β Windows Traditional Chinese (extension of [[Code page 950|950]]) * [[Code page 1374|1374]] β IBM-PC DB [[Big5|Big5 encoding]] extension for HKSCS * [[Code page 1375|1375]] β Mixed [[Big5|Big5 encoding]] extension for HKSCS (intended to match [[Code page 950|950]]) * [[Code page 1385|1385]] β IBM-PC Simplified Chinese DBCS (Growing CS for GB18030, also used for GBK PC-DATA.) * [[Code page 1386|1386]] β IBM-PC Simplified Chinese GBK ([[Code page 1114|1114]] + [[Code page 1385|1385]]) (Windows CP 936) * [[Code page 1391|1391]] β Simplified Chinese 4 Byte (Growing CS for GB18030, also used for GBK PC-DATA.) * [[Code page 1392|1392]] β IBM-PC Simplified Chinese MIX ([[Code page 1252|1252]] + [[Code page 1385|1385]] + [[Code page 1391|1391]]) {{div col end}} === Macintosh emulation code pages === These code pages are used by IBM when emulating the Apple [[Mac (computer)|Macintosh]] character sets. {{Div col|colwidth=30em}} * [[Code page 1275|1275]] β Apple Roman * [[Code page 1280|1280]] β Apple Greek * [[Code page 1281|1281]] β Apple Turkish * [[Code page 1282|1282]] β Apple Central European * [[Code page 1283|1283]] β Apple Cyrillic * [[Code page 1284|1284]] β Apple Croatian * [[Code page 1285|1285]] β Apple Romanian * [[Code page 1286|1286]] β Apple Icelandic {{div col end}} === Adobe emulation code pages === These code pages are used by IBM when emulating the [[Adobe Systems|Adobe]] character sets. {{Div col|colwidth=30em}} * [[Code page 1038|1038]] β Adobe Symbol Encoding * [[Code page 1276|1276]] β Adobe (PostScript) Standard Encoding * [[Code page 1277|1277]] β Adobe (PostScript) Latin 1 {{div col end}} === HP emulation code pages === These code pages are used by IBM when emulating the [[Hewlett-Packard|HP]] character sets. {{Div col|colwidth=30em}} * [[Code page 1050|1050]] β HP Roman Extension * [[Code page 1051|1051]] β HP Roman-8 * [[Code page 1052|1052]] β HP Gothic Legal * [[Code page 1053|1053]] β HP Gothic-1 (almost the same as [[ISO 8859-1]]) * [[Code page 1054|1054]] β HP ASCII * [[Code page 1055|1055]] β HP PC-Line * [[Code page 1056|1056]] β HP Line Draw * 1057 β HP PC-8 (almost the same as [[code page 437]]) * [[Code page 1058|1058]] β HP PC-8DN ('''not''' the same as [[code page 865]]) * [[Code page 1351|1351]] β Japanese DBCS HP character set * [[Code page 5039|5039]] β Japanese MIX ([[Code page 1041|1041]] + [[Code page 1351|1351]]) {{div col end}} === DEC emulation code pages === These code pages are used by IBM when emulating the [[Digital Equipment Corporation|DEC]] character sets. {{Div col|colwidth=30em}} * [[Code page 1020|1020]] β 7-bit Canadian (French) NRC Set * [[Code page 1021|1021]] β 7-bit Switzerland NRC Set * [[Code page 1023|1023]] β 7-bit Spanish NRC Set * [[Code page 1090|1090]] β Special Characters and Line Drawing Set * [[Multinational Character Set|1100]] β DEC Multinational * [[Code page 1101|1101]] β 7-bit British NRC Set * [[Code page 1102|1102]] β 7-bit Dutch NRC Set * [[Code page 1103|1103]] β 7-bit Finnish NRC Set * [[Code page 1104|1104]] β 7-bit French NRC Set * [[Code page 1105|1105]] β 7-bit Norwegian/Danish NRC Set * [[Code page 1106|1106]] β 7-bit Swedish NRC Set * [[Code page 1107|1107]] β 7-bit Norwegian/Danish NRC Alternate * [[Code Page 1287|1287]] β DEC Greek * [[Code Page 1288|1288]] β DEC Turkish {{div col end}} === IBM Unicode code pages === {{Div col|colwidth=30em}} * [[Code page 1200|1200]] β [[UTF-16BE]] Unicode ([[big-endian]]) with IBM [[Private Use Area]] (PUA)<ref name="ism-registered-ccsids">{{cite web|url=http://www-01.ibm.com/software/globalization/ccsid/ccsid_registered.jsp|title=IBM Coded Character Set Identifier (CCSID)|website=IBM|archive-url=https://web.archive.org/web/20091126030417/http://www-01.ibm.com/software/globalization/ccsid/ccsid_registered.jsp|archive-date=2009-11-26}}</ref> * [[Code page 1201|1201]] β [[UTF-16BE]] Unicode ([[big-endian]])<ref name="ism-registered-ccsids"/> * [[Code page 1202|1202]] β [[UTF-16LE]] Unicode ([[little-endian]]) with IBM PUA<ref name="ism-registered-ccsids"/> * [[Code page 1203|1203]] β [[UTF-16LE]] Unicode ([[little-endian]])<ref name="ism-registered-ccsids"/> * [[Code page 1208|1208]] β [[UTF-8]] Unicode with IBM PUA<ref name="ism-registered-ccsids"/> * [[Code page 1209|1209]] β [[UTF-8]] Unicode<ref name="ism-registered-ccsids"/> * [[Code page 1400|1400]] β ISO 10646 UCS-BMP (Based on Unicode 6.0)<ref name="ism-registered-ccsids"/> * [[Code page 1401|1401]] β ISO 10646 UCS-SMP (Based on Unicode 6.0)<ref name="ism-registered-ccsids"/> * [[Code page 1402|1402]] β ISO 10646 UCS-SIP (Based on Unicode 6.0)<ref name="ism-registered-ccsids"/> * [[Code page 1414|1414]] β ISO 10646 UCS-SSP (Based on Unicode 4.0)<ref name="ism-registered-ccsids"/> * [[Code page 1445|1445]] β IBM AFP PUA No. 1 * [[Code page 1446|1446]] β ISO 10646 [[UCS-PUP15]] (Based on Unicode 4.0)<ref name="ism-registered-ccsids"/> * [[Code page 1447|1447]] β ISO 10646 [[UCS-PUP16]] (Based on Unicode 4.0)<ref name="ism-registered-ccsids"/> * [[Code page 1448|1448]] β UCS-BMP (Generic UDC) * [[Code page 1449|1449]] β IBM default PUA {{div col end}} == Microsoft code pages == {{main|Windows code page}} === Windows code pages === These code pages are used by Microsoft in its own Windows operating system. Microsoft defined [[windows code pages|a number of code pages]] known as the ANSI code pages (as the first one, 1252 was based on an [[apocrypha]]l ANSI draft of what became [[ISO 8859-1]]). Code page 1252 is built on ISO 8859-1 but uses the range 0x80-0x9F for extra printable characters rather than the C1 control codes from [[ISO 6429]] mentioned by ISO 8859-1.<ref>{{cite book|title=ISO/IEC 8859-1:1998(E)|url=https://www.iso.org/standard/28245.html|publisher=[[ISO]]|date=1998-04-15|page=1|quote=The coded characters in this set may be used in conjunction with coded control functions selected from ISO/IEC 6429.|access-date=2020-10-30 |archive-date=2020-10-30 |archive-url=https://web.archive.org/web/20201030180126/https://www.iso.org/standard/28245.html|url-status=live}}</ref> Some of the others are based in part on other parts of [[ISO 8859]] but often rearranged to make them closer to 1252. {{Div col|colwidth=30em}} * 42 β Windows Symbol * [[Code page 1162|874]] β Windows [[Thai script|Thai]] * [[Code page 1250|1250]] β Windows [[Central Europe]] * [[Code page 1251|1251]] β Windows [[Cyrillic script|Cyrillic]] * [[Code page 1252|1252]] β Windows [[Western Europe|Western]] * [[Code page 1253|1253]] β Windows [[Greek alphabet|Greek]] * [[Code page 1254|1254]] β Windows [[Turkish alphabet|Turkish]] * [[Code page 1255|1255]] β Windows [[Hebrew alphabet|Hebrew]] * [[Code page 1256|1256]] β Windows [[Arabic alphabet|Arabic]] * [[Code page 1257|1257]] β Windows [[Baltic languages|Baltic]] * [[Code page 1258|1258]] β Windows [[Vietnamese alphabet|Vietnamese]] {{div col end}} Microsoft recommends new applications use UTF-8 or UCS-2/UTF-16 instead of these code pages.<ref name="Microsoft_Codepages"/> === DBCS code pages === These code pages represent [[double-byte character set|DBCS]] character encodings for various CJK languages. In Microsoft operating systems, these are used as both the "OEM" and "Windows" code page for the applicable locale. {{Div col|colwidth=30em}} * [[Code page 932 (Microsoft Windows)|932]] β Supports [[Japanese writing system|Japanese]] [[Shift-JIS]] * [[Code page 936 (Microsoft Windows)|936]] β Supports [[Simplified Chinese characters|Simplified Chinese]] [[GB2312]] or [[GBK (character encoding)|GBK]] * [[Unified Hangul Code|949]] β Supports [[Hangul|Korean]] Unified Hangul Code * [[Code page 950|950]] β Supports [[Traditional Chinese characters|Traditional Chinese]] [[Big5]] ** [[Code page 950|951]] β Supports [[Traditional Chinese characters|Traditional Chinese]] [[Big5]] with [[HKSCS]] {{div col end}} === MS-DOS code pages === These code pages are used by Microsoft in its MS-DOS operating system. Microsoft refers to these as the OEM code pages because they were defined by the [[original equipment manufacturer]]s who licensed MS-DOS for distribution with their hardware, not by Microsoft or a standards organization. Most of these code pages have the same number as the equivalent IBM code pages, although some are not ''exactly'' identical.<ref name="IBM"/> {{Div col|colwidth=30em}} * 708 β Arabic (ASMO 708) * 720 β Arabic (Transparent ASMO) * [[Code page 737|737]] β [[Greek language|Greek]] * [[Code page 850|850]] β Latin-1 * 851 β Greek * 852 β Latin-2 * 855 β [[Cyrillic script|Cyrillic]] * 857 β Latin-5 * [[Code page 858|858]] β Latin-1 with [[euro]] symbol * 859 β Latin-9 * 860 β [[Portuguese language|Portuguese]] * [[Code page 861|861]] β [[Icelandic language|Icelandic]] * [[Code page 862|862]] β [[Hebrew language|Hebrew]] * [[Code page 863|863]] β [[French language|Canadian French]] * [[Code page 864|864]] β [[Arabic language|Arabic]] * [[Code page 865|865]] β [[Danish language|Danish]]/[[Norwegian language|Norwegian]] * [[Code page 866|866]] β Belarusian, Russian, Ukrainian * [[Code page 869|869]] β [[Greek alphabet|Greek]] {{div col end}} === Macintosh emulation code pages === These code pages are used by Microsoft when emulating the Apple [[Mac (computer)|Macintosh]] character sets. {{Div col|colwidth=30em}} * [[Code page 10000|10000]] - Apple [[Mac-Roman encoding|Macintosh Roman]] * [[Code page 10001|10001]] - Apple Japanese * [[Code page 10002|10002]] - Apple Traditional Chinese (Big5) * [[Code page 10003|10003]] - Apple Korean * [[Code page 10004|10004]] - Apple [[MacArabic encoding|Arabic]] * [[Code page 10005|10005]] - Apple Hebrew * [[Code page 10006|10006]] - Apple [[MacGreek encoding|Greek]] * [[Code page 10007|10007]] - Apple [[MacCyrillic encoding|Macintosh Cyrillic]] * [[Code page 10008|10008]] - Apple Simplified Chinese (GB 2312) * [[Code page 10010|10010]] - Apple [[MacRomanian encoding|Romanian]] * [[Code page 10017|10017]] - Apple [[Macintosh Ukrainian encoding|Ukrainian]] * [[Code page 10021|10021]] - Apple Thai * [[Code page 10029|10029]] - Apple [[Macintosh Central European encoding|Macintosh Central Europe]] * [[Code page 10079|10079]] - Apple [[Mac Icelandic encoding|Icelandic]] * [[Code page 10081|10081]] - Apple [[MacTurkish encoding|Turkish]] * [[Code page 10082|10082]] - Apple [[MacCroatian encoding|Croatian]] {{div col end}} === Various other Microsoft code pages === The following code page numbers are specific to Microsoft Windows. IBM may use different numbers for these code pages. They emulate several character sets, namely those ones designed to be used accordingly to ISO,{{clarify|date=July 2020}} such as UNIX-like operating systems. {{Div col|colwidth=30em}} * 20000 β Traditional Chinese CNS * 20001 β Traditional Chinese TCA * 20002 β Traditional Chinese ETEN * 20003 β Traditional Chinese IBM5500 * 20004 β Traditional Chinese TeleText * 20005 β Traditional Chinese Wang * 20105 β 7-bit [[ITU T.50|IA5]] [[ISO-646-IRV|IRV]]<ref name="MS_2014_CP201xx"/><ref name="WG_CP201xx"/><ref name="WU_CP20105"/><!-- Windows 7 Ultimate, x-IA5 --> ([[CP 1009]]) * 20106 β 7-bit [[ITU T.50|IA5]] German ([[DIN 66003]])<ref name="MS_2014_CP201xx"/><ref name="WG_CP201xx"/><ref name="WU_CP20106"/><!-- Windows 7 Ultimate, x-IA5-German --> * 20107 β 7-bit [[ITU T.50|IA5]] Swedish (SEN 850200 C)<ref name="MS_2014_CP201xx"/><ref name="WG_CP201xx"/><ref name="WU_CP20107"/><!-- Windows 7 Ultimate, x-IA5-Swedish --> * 20108 - 7-bit [[ITU T.50|IA5]] Norwegian (NS 4551-2)<ref name="MS_2014_CP201xx"/><ref name="WG_CP201xx"/><ref name="WU_CP20108"/><!-- Windows 7 Ultimate, x-IA5-Norwegian --> * 20127 β 7-bit [[US-ASCII]]<ref name="MS_2014_CP201xx"/><ref name="WG_CP201xx"/><ref name="WU_CP20127"/><!-- Windows 7 Ultimate --> * 20261 β [[ITU T.61|CCITT T.61]] * 20269 β [[ISO 6937]] * 20273 * 20277 * 20278 * 20284 * 20285 * 20290 - [[Japanese language in EBCDIC]] * 20297 * 20420 * 20423 * 20424 * 20833 * 20838 * 20866 β [[KOI8-R]] * 20871 * 20880 β EBCDIC Cyrillic (880) * 20905 * 20924 * 20932 - [[EUC-JP]] * 20936<!-- IBM Code Page 1381? --> * 20949<!-- IBM Code Page 949? --> * 21025 β EBCDIC Cyrillic (1025) * 21027 * 21866 β [[KOI8-U]] * 28591 β [[ISO-8859-1]] * 28592 β [[ISO-8859-2]] * 28593 β [[ISO-8859-3]] * 28594 β [[ISO-8859-4]] * 28595 β [[ISO-8859-5]] * 28596 β [[ISO-8859-6]] * 28597 β [[ISO-8859-7]] * 28598 β [[ISO-8859-8]] * 28599 β [[ISO-8859-9]] * 28600 β [[ISO-8859-10]] * 28601 β [[ISO-8859-11]] * 28602 β not used (reserved for [[ISO-8859-12]]) * 28603 β [[ISO-8859-13]] * 28604 β [[ISO-8859-14]] * 28605 β [[ISO-8859-15]] * 28606 β [[ISO-8859-16]] * 38596 β [[ISO-8859-6]]<!-- similar to CP28596 --> * 38598 β [[ISO-8859-8]]<!-- similar to CP28598 --> {{div col end}} === Microsoft Unicode code pages === {{Div col|colwidth=30em}} * 1200 β [[UTF-16LE]] Unicode ([[little-endian]]) * 1201 β [[UTF-16BE]] Unicode ([[big-endian]]) * 12000 β [[UTF-32LE]] Unicode ([[little-endian]]) * 12001 β [[UTF-32BE]] Unicode ([[big-endian]]) * 65000 β [[UTF-7]] Unicode * 65001 β [[UTF-8]] Unicode * 65520 β Empty Unicode Plane {{div col end}} == HP Symbol Sets == HP developed a series of Symbol Sets (each with its associated Symbol Set Code) to encode either its own character sets or other vendorsβ character sets. They are normally 7-bit character sets which, when moved to the higher part and associated with the ASCII character set, make up 8-bit character sets. <!-- This list is not exhaustive --> === HP own Symbol Sets === {{Div col|colwidth=30em}} * Symbol Set 0E β HP Roman Extension β 7-bit character set with accented letters (coded by IBM as [[code page 1050]]) * Symbol Set 0G β HP 7-bit German * Symbol Set 0L β HP 7-bit PC Line (coded by IBM as [[code page 1055]]) * Symbol Set 0M β HP Math-7 * Symbol Set 0T β HP Thai-8 * Symbol Set 1S β HP 7-bit Spanish * Symbol Set 1U β HP 7-bit Gothic Legal (coded by IBM as [[code page 1052]]) * Symbol Set 4Q β HP Line Draw (coded by IBM as [[code page 1056]]) * Symbol Set 4U β [[HP Roman#HP Roman-9|HP Roman-9]] β Roman-8 + β¬ * Symbol Set 7J β HP Desktop * Symbol Set 7S β HP 7-bit European Spanish * Symbol Set 8E β HP East-8<!-- Check if my source (http://www.cestina.cz/kodovani/) is not mistaken and that this symbol set really exists --> * Symbol Set 8G β HP Greek-8 (based on IR 088; not on ELOT 927) * Symbol Set 8H β HP Hebrew-8 * Symbol Set 8I β MS LineDraw (ASCII + HP PC Line) * Symbol Set 8K β HP Kana-8 (ASCII + Japanese Katakana) * Symbol Set 8L β HP LineDraw (ASCII + HP Line Draw) * Symbol Set 8M β HP Math-8 (ASCII + HP Math-8) * Symbol Set 8R β HP Cyrillic-8<!-- (ASCII + GOST 13052? KOI-8?) --> * Symbol Set 8S β HP 7-bit Latin American Spanish * Symbol Set 8T β HP Turkish-8 * Symbol Set 8U β [[HP Roman#HP Roman-8|HP Roman-8]] (ASCII + HP Roman Extension; coded by IBM as [[code page 1051]]) * Symbol Set 8V β HP Arabic-8<!-- Contradictory sources about "Arabic-8"; http://h30434.www3.hp.com/t5/Printer-Software-and-Drivers/Arabic-fonts-on-Network-Printers/td-p/2231625 and http://printronix.com/emea/wp-content/uploads/manuals/PTX_PRM_ACA_P8_258187a.pdf --> * Symbol Set 9K β HP Korean-8<!-- (ASCII + Jamo Code Table?) --> * Symbol Set 9T β PC 8T (also known as Code Page 437-T; this is '''not''' code page 857) * Symbol Set 9V β Latin / Arabic for Windows (this is '''not''' [[code page 1256]]) * Symbol Set 11U β PC 8D/N (also known as Code Page 437-N; coded by IBM as [[code page 1058]]; this is '''not''' [[code page 865]]) * Symbol set 14G β PC-8 Greek Alternate (also known as Code Page 437-G; almost the same as [[code page 737]]) * Symbol Set 18K β<!-- HP Korea-15? Korean; based on KS X 1001 --> * Symbol Set 18T β<!-- HP ROC-15? Traditional Chinese; based on Big5 --> * Symbol Set 19C β<!-- HP PRC-15? Simplified Chinese; based on GB 2312 --> * Symbol Set 19K β<!-- HP Japan-15? Japanese; based on Shift-JIS; coded by IBM as [[code page 5039]] --> {{div col end}} === Symbol Sets from other vendors === {{Div col|colwidth=30em}} * Symbol Set 0D β ISO 60: 7-bit Norwegian * Symbol Set 0F β ISO 25: 7-bit French * Symbol Set 0H β HP 7-bit Hebrew β Practically the same as Israeli Standard [[SI 960]] * Symbol Set 0I β ISO 15: 7-bit Italian * Symbol Set 0K β ISO 14: 7-bit Japanese Katakana * Symbol Set 0N β ISO 8859-1 Latin 1 (Initially called "Gothic-1"; coded by IBM as code page 1053) * Symbol Set 0R β ISO 8859-5 Latin/Cyrillic (1986 version β IR 111) * Symbol Set 0S β ISO 11: 7-bit Swedish * Symbol Set 0U β ISO 6: 7-bit U.S. * Symbol Set 0V β Arabic<!-- 7-bit or 8-bit? ASMO 449? --> * Symbol Set 1D β ISO 61: 7-bit Norwegian * Symbol Set 1E β ISO 4: 7-bit U. K. * Symbol Set 1F β ISO 69: 7-bit French * Symbol Set 1G β ISO 21: 7-bit German * Symbol Set 1K β ISO 13: 7-bit Japanese Latin * Symbol Set 1T β Windows Thai (Practically the same as [[code page 1162|874]]) * Symbol Set 2K β ISO 57: 7-bit Simplified Chinese Latin * Symbol Set 2N β ISO 8859-2 Latin 2 * Symbol Set 2S β ISO 17: 7-bit Spanish * Symbol Set 2U β ISO 2: 7-bit International Reference Version * Symbol Set 3N β ISO 8859-3 Latin 3 * Symbol Set 3R β PC-866 Russia (Practically the same as [[code page 866]]) * Symbol Set 3S β ISO 10: 7-bit Swedish * Symbol Set 4N β ISO 8859-4 Latin 4 * Symbol Set 4S β ISO 16: 7-bit Portuguese * Symbol Set 5M β PS Math Symbol (Practically the same as [[Adobe Symbols]]) * Symbol Set 5N β ISO 8859-9 Latin 5 * Symbol Set 5S β ISO 84: 7-bit Portuguese * Symbol Set 5T β Windows 3.1 Latin-5 (Practically the same as [[code page 1254]]) * Symbol Set 6J β Microsoft Publishing * Symbol Set 6M β Ventura Math * Symbol Set 6N β ISO 8859-10 Latin 6 * Symbol Set 6S β ISO 85: 7-bit Spanish * Symbol Set 7H β ISO 8859-8 Latin/Hebrew * Symbol Set 9E β Windows 3.1 Latin 2 (Practically the same as [[code page 1250]]) * Symbol Set 9G β Windows 98 Greek (Practically the same as [[code page 1253]]) * Symbol Set 9J β PC 1004 * Symbol Set 9L β Ventura ITC Zapf Dingbats * Symbol Set 9N β ISO 8859-15 Latin 9 * Symbol Set 9R β Windows 98 Cyrillic (Practically the same as [[code page 1251]]) * Symbol Set 9U β Windows 3.0 * Symbol Set 10G β PC-851 Latin/Greek (Practically the same as code page 851) * Symbol Set 10J β PS Text (Practically the same as [[PostScript Standard Encoding|Adobe Standard]]) * Symbol Set 10L β PS ITC Zapf Dingbats (Practically the same as Adobe Dingbats) * Symbol Set 10N β ISO 8859-5 Latin/Cyrillic (1988 version β IR 144) * Symbol Set 10R β PC-855 Cyrillic (Practically the same as code page 855) * Symbol Set 10T β Teletex<!-- (CCITT T.61?) --> * Symbol Set 10U β PC-8 (Practically the same as [[code page 437]]; coded by IBM as code page 1057) * Symbol Set 10V β CP-864 (Practically the same as [[code page 864]]) * Symbol Set 11G β CP-869 (Practically the same as [[code page 869]]) * Symbol Set 11J β PS ISO Latin-1 (Practically the same as Adobe Latin-1) * Symbol Set 11N β ISO 8859-6 Latin/Arabic * Symbol Set 12G β PC Latin/Greek (Practically the same as [[code page 737]]) * Symbol Set 12J β MC Text (Practically the same as [[Mac OS Roman|Macintosh Roman]]) * Symbol Set 12N β ISO 8859-7 Latin/Greek * Symbol Set 12R β PC Gost (Practically the same as PC GOST Main) * Symbol Set 12U β PC-850 Latin 1 (Practically the same as [[code page 850]]) * Symbol Set 13J β Ventura International * Symbol Set 13R β PC Bulgarian (Practically the same as [[MIK (character set)|MIK]]) * Symbol Set 13U β PC-858 Latin 1 + β¬ (Practically the same as [[code page 858]]) * Symbol Set 14J β Ventura U. S. * Symbol Set 14L β Windows Dingbats * Symbol Set 14P β ABICOMP International (Practically the same as [[ABICOMP character set|ABICOMP]]) * Symbol Set 14R β PC Ukrainian (Practically the same as [[RUSCII]]) * Symbol Set 15H β PC-862 Israel (Practically the same as [[code page 862]]) * Symbol Set 16U β PC-857 Latin 5 (Practically the same as code page 857) * Symbol Set 17U β PC-852 Latin 2 (Practically the same as code page 852) * Symbol Set 18N β [[UTF-8]] * Symbol Set 18U β PC-853 Latin 3 (Practically the same as code page 853) * Symbol Set 19L β Windows 98 Baltic (Practically the same as [[code page 1257]]) * Symbol Set 19M β Windows Symbol * Symbol Set 19U β Windows 3.1 Latin 1 (Practically the same as [[code page 1252]]) * Symbol Set 20U β PC-860 Portugal (Practically the same as code page 860) * Symbol Set 21U β PC-861 Iceland (Practically the same as [[code page 861]]) * Symbol Set 23U β PC-863 Canada - French (Practically the same as [[code page 863]]) * Symbol Set 24Q β PC-Polish Mazowia (Practically the same as [[Mazovia encoding]]) * Symbol Set 25U β PC-865 Denmark/Norway (Practically the same as [[code page 865]]) * Symbol Set 26U β PC-775 Latin 7 (Practically the same as code page 775) * Symbol Set 27Q β PC-8 PC Nova (Practically the same as [PC Nova) * Symbol Set 27U β PC Latvian Russian (also known as 866-Latvian) * Symbol Set 28U β PC Lithuanian/Russian (Practically the same as [[code page 774]]) * Symbol Set 29U β PC-772 Lithuanian/Russian (Practically the same as [[code page 772]]) {{div col end}} == Code pages from other vendors == These code pages are independent assignments by third party vendors. Since the original IBM PC code page ([[code page 437|number 437]]) was not really designed for international use, several partially compatible country or region specific variants emerged. These code pages number assignments are '''not official''' neither by IBM, neither by Microsoft and almost none of them is referred as a usable character set by IANA. The numbers assigned to these code pages are arbitrary and may clash to registered numbers in use by IBM or Microsoft. Some of them may predate codepage switching being added in DOS 3.3. {{Div col|colwidth=30em}} * [[Code page 100|100]] β DOS Hebrew hardware fontpage (Not from IBM; [[Hebrew MS-DOS|HDOS]])<ref name="Paul_2002"/> * 111 β DOS Greek (Not from IBM; [[AST Premium Exec DOS 5.0]]<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) * 112 β DOS Turkish (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) * 113 β DOS Yugoslavian (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) * 151 β DOS Nafitha Arabic (Not from IBM; [[Arabic MS-DOS|ADOS]])<!-- EPROM fontpage --> * 152 β DOS Nafitha Arabic (Not from IBM; [[Arabic MS-DOS|ADOS]])<!-- EPROM fontpage --> * 161 β DOS [[Arabic language|Arabic]] (Not from IBM; [[Arabic MS-DOS|ADOS]])<ref name="Paul_2002"/> * 162 β DOS Arabic with vowel diacritics (Not from IBM; ADOS) * 163 β DOS Arabic and French (Not from IBM; ADOS)<ref name="Paul_2002"/> * 164 β DOS Arabic and French with vowel diacritics (Not from IBM; ADOS) * [[Code page 165|165]] β DOS Arabic (864 Extended) (Not from IBM; ADOS)<ref name="Paul_2002"/> * 166 β IBM Arabic PC (ADOS)<!-- hardware fontpage --><ref name="Paul_2002"/> * [[Code page 437|190]] β DEC DOS German (appears to be identical to Code page 437) * [[Code page 210|210]] β DEC DOS Greek (NEC Jetmate printers) * 220 β DEC DOS Spanish (Not from IBM) * 489 β Czechoslovakian [OCR software 1993] * [[Code page 620|620]] β DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM)<!-- Fido Mazowia? Variant with characters "Δ" and "Δ" in positions 80 and 87? --> * [[Code page 667|667]] β DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM) * 668 β DOS Polish<!--Different than Mazovia! --> (Not from IBM) * 706 β MS-DOS Server Arabic Sakhr (Not from IBM; [[Sakhr Computers|Sakhr Software]] from [[MSX]] Computers)<!--Not to be confused with Arabic Sakr, below. --> * 707 β MS-DOS Arabic Sakhr (Not from IBM; [[Sakhr Computers|Sakhr Software]] from [[MSX]] Computers)<!--Not to be confused with Arabic Sakr, below. --> * 709 β MS-DOS Arabic ([[Code page ASMO449+|ASMO 449+]]/BCON V4)<!-- not sure if available in any DOS --> * 710 β MS-DOS Arabic (Transparent Arabic)<!-- not sure if available in any DOS --> * 711 β MS-DOS Arabic Nafitha Enhanced (Not from IBM) * 714 β MS-DOS Arabic Sakr (Not from IBM)<!--Not to be confused with Arabic Sakhr, above. --> * 715 β MS-DOS Arabic APTEC (Not from IBM) * 721 β MS-DOS Arabic Nafitha International (Not from IBM) * 768 β Arabic Al-Arabi (Not from IBM) * 770 β DOS Estonian, Latvian, Lithuanian<ref name="CP770"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian RST 1095-89 National Standard) * [[Code page 771|771]] β DOS Lithuanian/Cyrillic β KBL<ref name="CP771"/> (From Lithuanian Lika Software<ref name="lika"/>) * [[Code page 772|772]] β DOS Lithuanian/Cyrillic<ref name="CP772"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian LST 1284:1993 National Standard; adopted by IBM as [[code page 1119]]) * 773 β DOS Latin-7 β KBL (From Lithuanian Lika Software) * [[Code page 774|774]] β DOS Lithuanian<ref name="CP774"/> (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian LST 1283:1993 National Standard; adopted by IBM as [[code page 1118]]) * 775 β DOS Latin-7 Baltic Rim (From Lithuanian Lika Software;<ref name="lika"/> Lithuanian LST 1590-1 National Standard; adopted by IBM and Microsoft as code page 775) * 776 β DOS Lithuanian (extended CP770)<ref name="lithuanian-charsets"/> (From Lithuanian Lika Software<ref name="lika"/>) * 777 β DOS Accented Lithuanian (old) (extended CP773) β KBL<ref name="lithuanian-charsets"/> (From Lithuanian Lika Software<ref name="lika"/>) * 778 β DOS Accented Lithuanian (extended CP775)<ref name="lithuanian-charsets"/> (From Lithuanian Lika Software<ref name="lika"/>) * [[Code page 790|790]] β DOS [[Mazovia encoding|Polish (Mazovia)]] with curly quotation marks * 854 β Spanish<ref name="Hogan_1992_REF-DE"/><ref name="Paul_2001_CODEPAGE"/><!-- Cites the Hogan book for CP854 as well since 1995. No other source found so far. May actually have been the Latin 4 code page! --> * 881 β Latin 1 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM EBCDIC 881) * 882 β Latin 2 (ISO 8859-2) (Not from IBM; same as Code page 912; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM EBCDIC 882) * 883 β Latin 3 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM EBCDIC 883) * 884 β Latin 4 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM EBCDIC 884) * 885 β Latin 5 (Not from IBM; AST Premium Exec DOS 5.0<ref name="RBIL"/><ref name="Paul_1997_NWDOSTIP"/><ref name="Paul_2001_NWDOSTIP"/>) (conflictive ID with IBM EBCDIC 885) * [[Code page 895|895]] β [[KamenickΓ½ encoding|Czech (KamenickΓ½)]], (Not from IBM; conflictive ID with IBM CP895 β 7-bit EUC Japanese Roman) * [[Mazovia encoding|896]] β DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM; conflictive ID with IBM CP896 β 7-bit EUC Japanese Katakana)<!-- Variant with the character "zΕ" in position 9B? --> * [[Code page 900|900]] β DOS Russian (Russian MS-DOS 5.0 LCD.CPI) * [[ISO 8859-7|928]] β Greek (on Star<ref name="star"/> printers); same as Greek National Standard [[ISO 8859-7|ELOT 928]] (Not from IBM; conflictive ID with IBM CP928 β Simplified Chinese PC DBCS) * [[Code page 966|966]] β Saudi Arabian (Not from IBM) * 972 β Hebrew (VT100) (Not from IBM) * [[Code page 991|991]] β DOS [[Mazovia encoding|Polish (Mazovia)]] (Not from IBM) * [[Code page 999|999]] β DOS Serbo-Croatian I (Not from IBM); also known as PC Nova and CroSCII; lower part is JUSI.B1.002, upper part is code page 437; supports [[Slovenian language|Slovenian]] and [[Serbo-Croatian language|Serbo-Croatian]] (Latin script) * [[Code page 1001|1001]] β Arabic (on Star<ref name="star"/> printers) (Not from IBM; conflictive ID with IBM CP1001 β MICR) * [[Code page 1261|1261]] β Windows Korean IBM-1261 LMBCS-17, similar to [[Code page 1363|1363]]<!--https://web.archive.org/web/20161220082724/https://fossies.org/dox/w32tex-src/ucnv__lmb_8c_source.html Isn't it, by any chance, a misprint of code page 1361 (Johab)? Then code page 1261 is Windows Latin-3.--> * [[Code page 1270|1270]] β Windows SΓ‘mi * [[Code page 1300|1300]] β ANSI [PTS-DOS 6.70, not 6.51] (Not from IBM; conflictive ID with IBM EBCDIC 1300 β Generic Bar Code/OCR-B) * [[Code page 771|2001]] β Lithuanian KBL (on Star<ref name="star"/> printers); same as code page 771 * [[Code page 1116|3001]] β Estonian 1 (on Star<ref name="star"/> printers); same as code page 1116 * [[Code page 922|3002]] β Estonian 2 (on Star<ref name="star"/> printers); same as code page 922 * 3011 β Latvian 1 (on Star<ref name="star"/> printers); same as code page 437-Latvian * [[Code page 866-Latvian|3012]] β Latvian-2 (on Star<ref name="star"/> printers); same as code page 866-Latvian (Latvian RST 1040-90 National Standard) * [[MIK (character set)|3021]] β Bulgarian (on Star<ref name="star"/> printers); same as MIK * [[Code page 862|3031]] β Hebrew (on Star<ref name="star"/> printers); same as code page 862 * 3041 β Maltese (on Star<ref name="star"/> printers); same as [[ISO 646]] Maltese * 3840 β IBM-Russian (on Star<ref name="star"/> printers); nearly the same as [[CP 866]] * 3841 β Gost-Russian (on Star<ref name="star"/> printers); [[KOI-8|GOST 13052]] plus characters for Central Asian languages * [[Mazovia encoding|3843]] β Polish (on Star<ref name="star"/> printers); same as Mazovia * [[KamenickΓ½ encoding|3844]] β CS2 (on Star<ref name="star"/> printers); same as KamenickΓ½ * [[CWI-2|3845]] β Hungarian (on Star<ref name="star"/> printers); same as CWI * 3846 β Turkish (on Star<ref name="star"/> printers); same as PC-8 Turkish + old Turkish Lira sign (TΚ) at code point A8 * [[BraSCII|3847]] β Brazil-ABNT (on Star<ref name="star"/> printers); same as the Brazilian National Standard NBR-9614:1986<!-- I am still investigating if BraSCII is the same as ISO 8859-1 or not --> * [[ABICOMP character set|3848]] β Brazil-ABICOMP (on Star<ref name="star"/> printers); same as ABICOMP * 3850 β Standard KU (on Star<ref name="star"/> printers); variation of the Kasetsart University encoding for Thai * 3860 β Rajvitee KU (on Star<ref name="star"/> printers); variation of the Kasetsart University encoding for Thai * 3861 β Microwiz KU (on Star<ref name="star"/> printers); variation of the Kasetsart University encoding for Thai * 3863 β STD988 TIS (on Star<ref name="star"/> printers); variation of the TIS 620 encoding for Thai * 3864 β Popular TIS (on Star<ref name="star"/> printers); variation of the TIS 620 encoding for Thai * 3865 β Newsic TIS (on Star<ref name="star"/> printers); variation of the TIS 620 encoding for Thai * [[FOCAL character set|28799]] β [[FOCAL character set|FOCAL]] (on Star<ref name="star"/> printers); same as FOCAL character set * [[RPL character set|28800]] β [[RPL character set|HP RPL]] (on Star<ref name="star"/> printers); same as [[RPL character set|RPL]] * (number missing) β [[CWI-2]] (for DOS) supports [[Hungarian language|Hungarian]] * (number missing) β [[MIK (character set)|MIK]] (for DOS) supports [[Bulgarian language|Bulgarian]] * (number missing) β DOS Serbo-Croatian II; supports [[Slovenian language|Slovenian]] and [[Serbo-Croatian language|Serbo-Croatian]] (Latin script) * (number missing) β Russian [[Alternative code page (Russian)|Alternative code page]] (for DOS); this is the origin for IBM [[CP 866]] {{div col end}} == {{anchor|0|999|57344|61439|65280|65533|65534|65535}}List of code page assignments == {{Incomplete list|date=July 2014}} List of known code page assignments (incomplete): {| class="wikitable" style="float:center;" ! ID !! Names !! Description !! Origin !! Platform !! DOS !! OS/2 !! Windows !! Mac !! Else !! Encoding !! Comment |- | 0 || {{N/A}} || Reserved || IBM, Microsoft || {{N/A}} || 3.3+ || 1.0+ || {{dunno}} || {{dunno}} || {{dunno}} || || Internal OS use<ref name="Paul_2002"/> |- | 437 || CP437, IBM437 || PC US || IBM<ref name="CP437"/> || IBM PC || 3.3+ || 1.0+ || {{Yes}} || {{dunno}} || {{Yes}} || 8-bit [[SBCS]] || |- | 57344 - 61439 || {{N/A}} || Private use derivations || IBM || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{varies|various}} || Private use code page derivations (E000h-EFFFh) |- | 65280 - 65533 || {{N/A}} || Private use definitions || IBM || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{N/A}} || {{varies|various}} || Private use code page definitions (FF00h-FFFDh) |- | 65534 || {{N/A}} || Reserved || IBM, Microsoft || {{N/A}} || {{dunno}} || {{dunno}} || {{dunno}} || {{dunno}} || {{dunno}} || {{varies|various}} || Internal OS use (FFFEh) |- | 65535 || {{N/A}} || Reserved || IBM, Microsoft || {{N/A}} || 3.3+ || 1.0+ || {{dunno}} || {{dunno}} || {{dunno}} || {{varies|various}} || Internal OS use (FFFFh)<ref name="Paul_2002"/> |} == Criticism == Many older character encodings (unlike Unicode) suffer from several problems. Some vendors insufficiently document the meaning of all code point values in their code pages, which decreases the reliability of handling textual data consistently through various computer systems. Some vendors add proprietary extensions to established code pages, to add or change certain code point values: for example, byte 0x5C in [[Shift JIS]] can represent either a [[back slash]] or a [[yen sign]] depending on the platform. Finally, in order to support several languages in a program that does not use Unicode, the code page used for each string/document needs to be stored. Applications may also mislabel text in [[Windows-1252]] as [[ISO-8859-1]]. The only difference between these code pages is that the code point values in the range 0x80{{ndash}}0x9F, used by ISO-8859-1 for control characters, are instead used as additional printable characters in Windows-1252{{snd}} notably for [[quotation marks]], the [[euro sign]] and the [[trademark symbol]] among others. Browsers on non-Windows platforms would tend to show empty boxes or question marks for these characters, making the text hard to read. Most browsers fixed this by ignoring the character set and interpreting as Windows-1252 to look acceptable. In HTML5, treating ISO-8859-1 as Windows-1252 is even codified as a [[W3C]] standard.<ref>{{cite web |url=https://encoding.spec.whatwg.org/#names-and-labels |title=Encoding |at=sec. 4.2 Names and labels |publisher=[[WHATWG]] |date=27 January 2015 |access-date=4 February 2015 |archive-url=https://web.archive.org/web/20150204174315/https://encoding.spec.whatwg.org/#names-and-labels |archive-date=4 February 2015 |url-status=live}}</ref> Although browsers were typically programmed to deal with this behaviour, this was not always true of other software. Consequently, when receiving a file transfer from a Windows system, non-Windows platforms would either ignore these characters or treat them as a standard control characters and attempt to take the specified control action accordingly. Due to Unicode's extensive documentation, vast repertoire of characters and stability policy of characters, the problems listed above are rarely a concern for Unicode. [[UTF-8]] (which can encode over one million codepoints) has replaced the code-page method in terms of popularity on the Internet.<ref name="Statistics"/><ref name="Statistics_UTF-8"/> == {{anchor|Private use|User-definable}}Private code pages == When, early in the history of personal computers, users did not find their character encoding requirements met, private or local code pages were created using [[terminate-and-stay-resident]] utilities or by re-programming [[BIOS]] [[EPROM]]s. In some cases, unofficial code page numbers were invented (e.g. CP895). When more diverse character set support became available most of those code pages fell into disuse, with some exceptions such as the [[KamenickΓ½ encoding|KamenickΓ½]] or KEYBCS2 encoding for the [[Czech alphabet|Czech]] and [[Slovak alphabet|Slovak]] alphabets. Another character set is [[Iran System encoding standard]] that was created by Iran System corporation for [[Persian language]] support. This standard was in use in Iran in DOS-based programs and after introduction of Microsoft code page 1256 this standard became obsolete. However some Windows and DOS programs using this encoding are still in use and some Windows fonts with this encoding exist. In order to overcome such problems, the [[IBM Character Data Representation Architecture]] level 2 specifically reserves ranges of code page IDs for user-definable and private-use assignments. Whenever such code page IDs are used, the user must not assume that the same functionality and appearance can be reproduced in another system configuration or on another device or system unless the user takes care of this specifically. The code page range 57344-61439 ({{Mono|E000h}}-{{Mono|EFFFh}}) is officially reserved for user-definable code pages (or actually CCSIDs in the context of [[IBM CDRA]]), whereas the range 65280-65533 ({{Mono|FF00h}}-{{Mono|FFFDh}}) is reserved for any user-definable "private use" assignments. For example, a non-registered custom variant of code page 437 ({{Mono|1B5h}}) or 28591 ({{Mono|6FAF}}) could become 57781 ({{Mono|E1B5h}}) or 61359 ({{Mono|EFAFh}}), respectively, in order to avoid potential conflicts with other assignments and maintain the sometimes existing internal numerical logic in the assignments of the original code pages. An unregistered private code page not based on an existing code page, a device specific code page like a printer font, which just needs a logical handle to become addressable for the system, a frequently changing download font, or a code page number with a symbolic meaning in the local environment could have an assignment in the private range like 65280 ({{Mono|FF00h}}). The code page IDs 0, 65534 ({{Mono|FFFEh}}) and 65535 ({{Mono|FFFFh}}) are reserved for internal use by operating systems such as DOS and must not be assigned to any specific code pages. == See also == * [[Windows code page]] * [[Character encoding]] * [[CCSID]] IBM's official "code page" definitions and assignments * [[Charset detection]] * [[Unicode]] ==References== {{Reflist|refs= <ref name="PCL">{{Cite web|url=https://www.pclviewer.com/resources/pcl_symbolset.html|title=Printer Command Language Symbol Sets|website=www.pclviewer.com|access-date=2021-05-25 |archive-date=2020-07-31 |archive-url=https://web.archive.org/web/20200731225417/https://www.pclviewer.com/resources/pcl_symbolset.html|url-status=live}}</ref> <ref name="PCL-Symbol">{{Cite web|url=http://pclhelp.com/pcl-symbol-sets/|title=HP Symbol Sets|website=pclhelp.com|access-date=2017-02-20 |archive-date=2015-02-19 |archive-url=https://web.archive.org/web/20150219212843/http://pclhelp.com/pcl-symbol-sets/|url-status=dead}}</ref> <ref name="PCL5">{{Cite web |url=http://h10032.www1.hp.com/ctg/Manual/bpl13206.pdf |title=PCL5 Camparison Guide |access-date=2017-02-20 |archive-date=2017-02-21 |archive-url=https://web.archive.org/web/20170221105403/http://h10032.www1.hp.com/ctg/Manual/bpl13206.pdf |url-status=live }}</ref> <ref name="Microsoft_Codepage-ID">{{cite web |url=http://msdn.microsoft.com/en-us/library/windows/desktop/dd317756%28v=vs.85%29.aspx#Code |title=Code Page Identifiers |publisher=[[Microsoft]] |website=microsoft.com |access-date=2014-10-27 |archive-date=2014-10-27 |archive-url=https://web.archive.org/web/20141027012724/http://msdn.microsoft.com/en-us/library/windows/desktop/dd317756(v=vs.85).aspx#Code |url-status=live }}</ref> <ref name="Duncan_1988_MS-DOS_Encyclopedia">{{Cite book |title=The MS-DOS Encyclopedia: versions 1.0 through 3.2 |author-first1=Ray |author-last1=Duncan |author-first2=Steve |author-last2=Bostwick |author-first3=Keith |author-last3=Burgoyne |author-first4=Robert A. |author-last4=Byers |author-first5=Thom |author-last5=Hogan |author-first6=Jim |author-last6=Kyle |author-first7=Gordon |author-last7=Letwin |author-link7=Gordon Letwin |author-first8=Charles |author-last8=Petzold |author-link8=Charles Petzold |author-first9=Chip |author-last9=Rabinowitz |author-first10=Jim |author-last10=Tomlin |author-first11=Richard |author-last11=Wilton |author-first12=Van |author-last12=Wolverton |author-first13=William |author-last13=Wong |author-first14=JoAnne |author-last14=Woodcock |contribution=Technical advisors |contributor-first1=Mark |contributor-last1=Zbikowski |contributor-link1=Mark Zbikowski |contributor-first2=Paul |contributor-last2=Allen |contributor-link2=Paul Allen |contributor-first3=Steve |contributor-last3=Ballmer |contributor-link3=Steve Ballmer |contributor-first4=Reuben |contributor-last4=Borman |contributor-first5=Rob |contributor-last5=Borman |contributor-first6=John |contributor-last6=Butler |contributor-first7=Chuck |contributor-last7=Carroll |contributor-first8=Mark |contributor-last8=Chamberlain |contributor-first9=David |contributor-last9=Chell |contributor-first10=Mike |contributor-last10=Colee |contributor-first11=Mike |contributor-last11=Courtney |contributor-first12=Mike |contributor-last12=Dryfoos |contributor-first13=Rachel |contributor-last13=Duncan |contributor-first14=Kurt |contributor-last14=Eckhardt |contributor-first15=Eric |contributor-last15=Evans |contributor-first16=Rick |contributor-last16=Farmer |contributor-first17=Bill |contributor-last17=Gates |contributor-link17=Bill Gates |contributor-first18=Michael |contributor-last18=Geary |contributor-first19=Bob |contributor-last19=Griffin |contributor-first20=Doug |contributor-last20=Hogarth |contributor-first21=James W. |contributor-last21=Johnson |contributor-first22=Kaamel |contributor-last22=Kermaani |contributor-first23=Adrian |contributor-last23=King |contributor-first24=Reed |contributor-last24=Koch |contributor-first25=James |contributor-last25=Landowski |contributor-first26=Chris |contributor-last26=Larson |contributor-first27=Thomas |contributor-last27=Lennon |contributor-first28=Dan |contributor-last28=Lipkie |contributor-first29=Marc |contributor-last29=McDonald |contributor-link29=Marc McDonald |contributor-first30=Bruce |contributor-last30=McKinney |contributor-first31=Pascal |contributor-last31=Martin |contributor-first32=Estelle |contributor-last32=Mathers |contributor-first33=Bob |contributor-last33=Matthews <!-- |contributor-link33=Robert Matthews (scientist)??? --> |contributor-first34=David |contributor-last34=Melin |contributor-first35=Charles |contributor-last35=Mergentime |contributor-first36=Randy |contributor-last36=Nevin |contributor-first37=Dan |contributor-last37=Newell |contributor-first38=Tani |contributor-last38=Newell |contributor-first39=David |contributor-last39=Norris |contributor-first40=Mike |contributor-last40=O'Leary |contributor-first41=Bob |contributor-last41=O'Rear |contributor-link41=Bob O'Rear |contributor-first42=Mike |contributor-last42=Olsson |contributor-first43=Larry |contributor-last43=Osterman |contributor-first44=Ridge |contributor-last44=Ostling |contributor-first45=Sunil |contributor-last45=Pai |contributor-first46=Tim |contributor-last46=Paterson |contributor-link46=Tim Paterson |contributor-first47=Gary |contributor-last47=Perez |contributor-first48=Chris |contributor-last48=Peters |contributor-first49=Charles |contributor-last49=Petzold |contributor-link49=Charles Petzold |contributor-first50=John |contributor-last50=Pollock |contributor-first51=Aaron |contributor-last51=Reynolds |contributor-link51=Aaron R. Reynolds |contributor-first52=Darryl |contributor-last52=Rubin |contributor-first53=Ralph |contributor-last53=Ryan |contributor-first54=Karl |contributor-last54=Schulmeisters |contributor-first55=Rajen |contributor-last55=Shah |contributor-first56=Barry |contributor-last56=Shaw |contributor-first57=Anthony |contributor-last57=Short |contributor-first58=Ben |contributor-last58=Slivka |contributor-first59=Jon |contributor-last59=Smirl |contributor-first60=Betty |contributor-last60=Stillmaker |contributor-first61=John |contributor-last61=Stoddard |contributor-first62=Dennis |contributor-last62=Tillman |contributor-first63=Greg |contributor-last63=Whitten |contributor-first64=Natalie |contributor-last64=Yount |contributor-first65=Steve |contributor-last65=Zeck |date=1988 |edition=Completely reworked |publisher=[[Microsoft Press]] |location=Redmond, Washington, USA |isbn=1-55615-049-0 |lccn=87-21452 |oclc=16581341}} [https://www.pcjs.org/pubs/pc/reference/microsoft/mspl13/msdos/encyclopedia/] {{Webarchive|url=https://web.archive.org/web/20181014053041/https://www.pcjs.org/pubs/pc/reference/microsoft/mspl13/msdos/encyclopedia/ |date=2018-10-14 }} (xix+1570 pages; 26 cm) (NB. This edition was published in 1988 after extensive rework of the withdrawn 1986 first edition by a different team of authors.)</ref> <ref name="Paul_2001_CPI">{{cite web |title=Format description of DOS, OS/2, and Windows NT .CPI, and Linux .CP files |type=CPI.LST file |author-first=Matthias R. |author-last=Paul |date=2001-06-10 |edition=1.30 |orig-year=1995 |url=http://www.columbia.edu/~em36/wpdos/MatthiasPaulCPI.zip<!-- not an official distribution archive, but contains an older version of CPI.LST --> |access-date=2016-08-20 |url-status=live |archive-url=https://web.archive.org/web/20160420065252/http://www.columbia.edu/~em36/wpdos/MatthiasPaulCPI.zip |archive-date=2016-04-20}}</ref> <ref name="Elliott_2006_CPI">{{cite web |author-first=John C. |author-last=Elliott |title=CPI file format |date=2006-10-14 |work=Seasip.info |url=http://www.seasip.info/DOS/CPI/cpi.html |access-date=2016-09-22 |url-status=live |archive-url=https://web.archive.org/web/20160922165307/http://www.seasip.info/DOS/CPI/cpi.html |archive-date=2016-09-22}}</ref> <ref name="AEB_2001_CPI">{{cite web |title=CPI fonts |author-first=Andries Evert |author-last=Brouwer |author-link=Andries Evert Brouwer |date=2001-02-10 |version=0.2 |url=https://www.win.tue.nl/~aeb/linux/kbd/font-formats-3.html |access-date=2016-09-22 |url-status=live |archive-url=https://web.archive.org/web/20160922165133/https://www.win.tue.nl/~aeb/linux/kbd/font-formats-3.html |archive-date=2016-09-22}}</ref> <ref name="Haralambous_2007_CPI">{{cite book |title=Fonts & Encodings |author-first=Yannis |author-last=Haralambous |translator-first=P. Scott |translator-last=Horne |location=Sebastopol, California, USA |publisher=[[O'Reilly Media, Inc.]] |date=September 2007 |edition=1 |isbn=978-0-596-10242-5 |pages=601β602, 611}}</ref> <ref name="MS_1991_PR">{{cite book |title=MS-DOS Programmer's Reference |publisher=[[Microsoft Press]] |date=1991 |isbn=1-55615-329-5 |url-access=registration |url=https://archive.org/details/isbn_9781556153297}}</ref> <ref name="IBM_i_EBCDIC">{{Cite web|url=https://www.ibm.com/support/knowledgecenter/SSEQ5Y_14.0.0/com.ibm.pcomm.doc/reference/html/hcp_reference02.htm|title=Contents|website=www.ibm.com}}</ref> <ref name="SAP_CP">{{cite web |url=http://help.sap.com/abapdocu_70/en/ABENCODEPAGE_GLOSRY.htm |title=Code Page |work=sap.com |access-date=2009-08-08 |archive-date=2009-11-14 |archive-url=https://web.archive.org/web/20091114153529/http://help.sap.com/abapdocu_70/en/ABENCODEPAGE_GLOSRY.htm |url-status=live }}</ref> <ref name="oracle.com">{{cite web |url=http://download.oracle.com/docs/cd/E12032_01/doc/epm.921/html_esb_dbag/glossary.htm#1009960 |title=Glossary |work=oracle.com |access-date=2009-08-08 |archive-date=2011-09-30 |archive-url=https://web.archive.org/web/20110930123137/http://download.oracle.com/docs/cd/E12032_01/doc/epm.921/html_esb_dbag/glossary.htm#1009960 |url-status=live }}</ref> <ref name="DEC_VT510">{{cite web |title=VT510 Video Terminal Programmer Information |at=7.1. Character Sets - Overview |publisher=[[Digital Equipment Corporation]] (DEC) |url=http://www.vt100.net/docs/vt510-rm/chapter7.html#S7.1 |access-date=2017-02-15 |quote=In addition to traditional [[Digital Equipment Corporation|DEC]] and [[ISO]] character sets, which conform to the structure and rules of [[ISO 2022]], the [[VT510]] supports a number of IBM PC code pages ([[page number]]s in IBM's standard character set manual) in [[PCTerm]] mode to emulate the [[console terminal]] of industry-standard PCs. |archive-date=2016-01-26 |archive-url=https://web.archive.org/web/20160126192029/http://www.vt100.net/docs/vt510-rm/chapter7.html#S7.1 |url-status=live }}</ref> <ref name="DEC_1994_Codepage">{{cite book |title=VT520/VT525 Video Terminal Programmer Information |chapter=7.1. Character Sets - Overview |date=July 1994 |page=7-1 |id=EK-VT520-RM. A01 |publisher=[[Digital Equipment Corporation]] (DEC) |url=http://web.mit.edu/dosathena/doc/www/ek-vt520-rm.pdf |access-date=2017-02-15 |url-status=live |archive-url=https://web.archive.org/web/20170215184432/http://web.mit.edu/dosathena/doc/www/ek-vt520-rm.pdf |archive-date=2017-02-15 |quote=In addition to traditional [[Digital Equipment Corporation|DEC]] and [[ISO]] character sets the [[VT520]] supports a number of IBM PC code pages (which refer to [[page number]]s in IBM's standard character set manual) in [[PCTerm]] mode to emulate the [[console terminal]] of industry-standard PCs.}}</ref> <ref name="Paul_2001_CODEPAGE">{{cite web |title=Overview on DOS, OS/2, and Windows codepages |type=CODEPAGE.LST file |author-first=Matthias R. |author-last=Paul |date=2001-06-10 |edition=1.59 preliminary |orig-year=1995 |url=http://www.columbia.edu/~em36/wpdos/MatthiasPaulCPI.zip<!-- not an official distribution archive, but contains an older preliminary version of CODEPAGE.LST --> |access-date=2016-08-20 |url-status=live |archive-url=https://web.archive.org/web/20160420065252/http://www.columbia.edu/~em36/wpdos/MatthiasPaulCPI.zip |archive-date=2016-04-20}}</ref> <ref name="Hogan_1992_REF-DE">{{cite book |title=Die PC-Referenz fΓΌr Programmierer |language=German |author-first=Thom |author-last=Hogan |edition=2 |date=1992 |publisher=Systhema Verlag GmbH |isbn=3-89390-272-4}} (NB. This book is the German translation of "The Programmer's PC Sourcebook" by Microsoft Press. It mentions the code page ID 854 for Spain.<!-- To be checked: Do English version and other editions of this book contain 854 as well? -->)</ref> <ref name="IBM_CP1004">{{cite web |title=Codepage 1004 - Windows Extended |publisher=[[IBM]] |date=2001 |url=http://www.borgendale.com/codepage/cp1004.gif |access-date=2018-05-13 |url-status=live |archive-url=https://web.archive.org/web/20180513184106/http://www.borgendale.com/codepage/cp1004.gif |archive-date=2018-05-13}}</ref> <ref name="VGA-Programming">{{cite web |url=http://www.osdever.net/FreeVGA/vga/vgatext.htm |title=VGA/SVGA Video Programming--VGA Text Mode Operation |website=osdever.net |access-date=2006-09-23 |archive-date=2010-09-01 |archive-url=https://web.archive.org/web/20100901012805/http://www.osdever.net/FreeVGA/vga/vgatext.htm |url-status=live }}</ref> <ref name="IBM_2010">{{citation |title=xlate - Transliterate Contents of Records |publisher=[[IBM Corporation]] |orig-year=1986 |date=2010 |url=http://vm.marist.edu/%7Epipeline/bhelp/ACH0275.html |access-date=2016-10-18 |archive-date=2019-06-16 |archive-url=https://web.archive.org/web/20190616013546/http://vm.marist.edu/~pipeline/bhelp/ACH0275.html |url-status=live }}</ref> <ref name="Microsoft_Codepages">{{cite web |url=http://msdn.microsoft.com/en-us/library/dd317752.aspx |title=Code Pages |publisher=[[Microsoft]] |website=microsoft.com |access-date=2010-12-21 |archive-date=2011-02-27 |archive-url=https://web.archive.org/web/20110227024708/http://msdn.microsoft.com/en-us/library/dd317752.aspx |url-status=live }}</ref> <ref name="IBM">{{Cite web|url=https://github.com/pentaho/pentaho-reporting/tree/master/libraries/libfonts/encodings/VENDORS/IBM|title=pentaho/pentaho-reporting|website=GitHub|access-date=2017-02-20 |archive-date=2019-06-16 |archive-url=https://web.archive.org/web/20190616013543/https://github.com/pentaho/pentaho-reporting/tree/master/libraries/libfonts/encodings/VENDORS/IBM|url-status=live}}</ref> <ref name="MS_2014_CP201xx">{{cite web |title=Code Page Identifiers |publisher=Microsoft|work=[[Microsoft Developer Network]] |date=2014 |url=https://msdn.microsoft.com/de-de/en-en/library/windows/desktop/dd317756(v=vs.85).aspx |access-date=2016-06-19 |url-status=live |archive-url=https://web.archive.org/web/20160619132819/https://msdn.microsoft.com/de-de/en-en/library/windows/desktop/dd317756%28v%3Dvs.85%29.aspx |archive-date=2016-06-19}}</ref> <ref name="WG_CP201xx">{{cite web |title=Web Encodings - Internet Explorer - Encodings |date=2012-10-23 |work=WHATWG Wiki |url=https://wiki.whatwg.org/wiki/Web_Encodings#Encodings_3 |access-date=2016-06-20 |url-status=live |archive-url=https://web.archive.org/web/20160620184648/https://wiki.whatwg.org/wiki/Web_Encodings#Encodings_3 |archive-date=2016-06-20}}</ref> <ref name="WU_CP20105">{{cite web |title=Western European (IA5) encoding - Windows charsets |author-first=Antonin |author-last=Foller |date=2014 |orig-year=2011 |work=WUtils.com - Online web utility and help |publisher=Motobit Software |url=http://wutils.com/encodings/x-ia5 |access-date=2016-06-20 |url-status=live |archive-url=https://web.archive.org/web/20160620183843/http://wutils.com/encodings/x-ia5 |archive-date=2016-06-20}}</ref> <ref name="WU_CP20106">{{cite web |title=German (IA5) encoding - Windows charsets |author-first=Antonin |author-last=Foller |date=2014 |orig-year=2011 |work=WUtils.com - Online web utility and help |publisher=Motobit Software |url=http://wutils.com/encodings/x-ia5-german |access-date=2016-06-20 |url-status=live |archive-url=https://web.archive.org/web/20160620183739/http://wutils.com/encodings/x-ia5-german |archive-date=2016-06-20}}</ref> <ref name="WU_CP20107">{{cite web |title=Swedish (IA5) encoding - Windows charsets |author-first=Antonin |author-last=Foller |date=2014 |orig-year=2011 |work=WUtils.com - Online web utility and help |publisher=Motobit Software |url=http://wutils.com/encodings/x-ia5-swedish |access-date=2016-06-20 |url-status=live |archive-url=https://web.archive.org/web/20160620183231/http://wutils.com/encodings/x-ia5-swedish |archive-date=2016-06-20}}</ref> <ref name="WU_CP20108">{{cite web |title=Norwegian (IA5) encoding - Windows charsets |author-first=Antonin |author-last=Foller |date=2014 |orig-year=2011 |work=WUtils.com - Online web utility and help |publisher=Motobit Software |url=http://wutils.com/encodings/x-ia5-norwegian |access-date=2016-06-20 |url-status=live |archive-url=https://web.archive.org/web/20160620183449/http://wutils.com/encodings/x-ia5-norwegian |archive-date=2016-06-20}}</ref> <ref name="WU_CP20127">{{cite web |title=US-ASCII encoding - Windows charsets |author-first=Antonin |author-last=Foller |date=2014 |orig-year=2011 |work=WUtils.com - Online web utility and help |publisher=Motobit Software |url=http://wutils.com/encodings/us-ascii |access-date=2016-06-20 |url-status=live |archive-url=https://web.archive.org/web/20160620192058/http://wutils.com/encodings/us-ascii |archive-date=2016-06-20}}</ref> <ref name="Paul_2002">{{citation |title=Technical info on undocumented DOS country info for LCASE, ARAMODE and CCTORC records |author-first=Matthias R. |author-last=Paul |date=2002-09-05 |publisher=FreeDOS development list fd-dev at Topica |url=http://permalink.gmane.org/gmane.os.freedos.devel/364 |access-date=2016-05-26 |url-status=dead |archive-url=https://web.archive.org/web/20160527142512/http://permalink.gmane.org/gmane.os.freedos.devel/364 |archive-date=2016-05-27}}</ref> <ref name="RBIL">{{cite book |author-first=Ralf D. |author-last=Brown |author-link=Ralf D. Brown |title=The x86 Interrupt List |title-link=Ralf Brown's Interrupt List |version=61 |date=2002-12-29}}</ref> <ref name="Paul_1997_NWDOSTIP">{{cite book |title=NWDOS-TIPs β Tips & Tricks rund um Novell DOS 7, mit Blick auf undokumentierte Details, Bugs und Workarounds |series=MPDOSTIP |author-first=Matthias R. |author-last=Paul |date=1997-07-30 |edition=3 |language=de |url=http://www.antonis.de/dos/dos-tuts/mpdostip/html/nwdostip.htm |access-date=2012-01-11 |url-status=dead |archive-url=https://web.archive.org/web/20160522014302/http://www.antonis.de/dos/dos-tuts/mpdostip/html/nwdostip.htm |archive-date=22 May 2016 }} (NB. NWDOSTIP.TXT is a comprehensive work on [[Novell DOS 7]] and [[OpenDOS 7.01]], including the description of many undocumented features and internals. It is part of the author's yet larger MPDOSTIP.ZIP collection maintained up to 2001 and distributed on many sites at the time. The provided link points to a HTML-converted older version of the NWDOSTIP.TXT file.)</ref> <ref name="Paul_2001_NWDOSTIP">{{cite book |title=NWDOS-TIPs β Tips & Tricks rund um Novell DOS 7, mit Blick auf undokumentierte Details, Bugs und Workarounds |series=MPDOSTIP |author-first=Matthias R. |author-last=Paul |date=2001-04-09 |edition=3 |language=de}}</ref> <ref name="lika">{{Cite web|url=http://www.likit.lt/indexw.php|title=LIKIT|website=www.likit.lt|access-date=2017-02-25 |archive-date=2017-04-19 |archive-url=https://web.archive.org/web/20170419171720/http://www.likit.lt/indexw.php|url-status=live}}</ref> <ref name="CP770">{{cite web |url=http://www.likit.lt/nostyle/770.htm |title=770 |access-date=2017-02-25 |archive-date=2017-02-26 |archive-url=https://web.archive.org/web/20170226132541/http://www.likit.lt/nostyle/770.htm |url-status=live }} From Lithuanian Lika Software</ref> <ref name="CP771">{{cite web |url=http://www.likit.lt/nostyle/771.htm |title=771 |access-date=2017-02-25 |archive-date=2017-02-26 |archive-url=https://web.archive.org/web/20170226132539/http://www.likit.lt/nostyle/771.htm |url-status=live }} From Lithuanian Lika Software</ref> <ref name="CP772">{{cite web |url=http://www.likit.lt/nostyle/772.htm |title=772 |access-date=2017-02-25 |archive-date=2017-02-26 |archive-url=https://web.archive.org/web/20170226132547/http://www.likit.lt/nostyle/772.htm |url-status=live }} From Lithuanian Lika Software</ref> <ref name="CP774">{{cite web |url=http://www.likit.lt/nostyle/774.htm |title=774 |access-date=2017-02-25 |archive-date=2017-02-26 |archive-url=https://web.archive.org/web/20170226132542/http://www.likit.lt/nostyle/774.htm |url-status=live }} From Lithuanian Lika Software</ref> <ref name="star">{{cite web |url=https://www.star-m.jp/eng/service/usermanual/lc8021um.pdf |title=Star LC 8021 User's Manual |access-date=2017-02-20 |archive-date=2020-09-29 |archive-url=https://web.archive.org/web/20200929022733/https://www.star-m.jp/eng/service/usermanual/lc8021um.pdf |url-status=live }}</ref> <ref name="CP437">{{cite web |author=IBM |title=SBCS code page information document - CPGID 00437 |url=http://www-01.ibm.com/software/globalization/cp/cp00437.html |access-date=2014-07-04 |archive-date=2016-06-09 |archive-url=https://web.archive.org/web/20160609084933/https://www-01.ibm.com/software/globalization/cp/cp00437.html |url-status=live }}</ref> <ref name="Statistics">{{cite web |url=http://w3techs.com/technologies/overview/character_encoding/all |title=Usage Statistics of Character Encodings for Websites, (updated daily) |website=w3techs.com |access-date=2015-08-06}}</ref> <ref name="Statistics_UTF-8">{{cite web |url=http://trends.builtwith.com/encoding/UTF-8 |title=UTF-8 Usage Statistics |website=trends.builtwith.com |access-date=2011-03-28 |archive-date=2011-03-24 |archive-url=https://web.archive.org/web/20110324131209/http://trends.builtwith.com/encoding/UTF-8 |url-status=dead }}</ref> <ref name="lithuanian-charsets">{{cite web |url=http://xn--lietuvyb-ceb.lt/standartai/ra%C5%A1men%C5%B3-koduo%C4%8Di%C5%B3-standartai/ |title=lietuvybΔ.lt - RaΕ‘menΕ³ koduotΔs |language=lt |trans-title=lietuvybΔ.lt - Character encodings |access-date=2019-08-28 |archive-date=2019-08-28 |archive-url=https://web.archive.org/web/20190828013454/http://xn--lietuvyb-ceb.lt/standartai/ra%25C5%25A1men%25C5%25B3-koduo%25C4%258Di%25C5%25B3-standartai/ |url-status=live }}</ref> <ref name="Kazakh_1174">{{cite web|url=https://www.ibm.com/downloads/cas/G01BQVRV|title=Character Data Representation Architecture|website=[[IBM]] |access-date=2019-10-12 |archive-date=2019-06-23 |archive-url=https://web.archive.org/web/20190623065058/https://www.ibm.com/downloads/cas/G01BQVRV|url-status=live}}<!-- Is it one of these (https://web.archive.org/web/20190419163948/http://www.sci.kz/~sairan/kazcode/ β check the tables 5.2.1)? --></ref> }} == External links == {{Wikibooks|Character Encodings/Code Tables}} * [http://www.ibm.com/software/globalization/cdra/glossary.jsp#SPTGLCDPG IBM CDRA glossary] * {{webarchive|url=https://web.archive.org/web/20160205110331/http://www-01.ibm.com/software/globalization/g11n-res.html|date=2016-02-05|title=IBM code pages}} * {{webarchive|url=https://web.archive.org/web/20090906204346/http://www-01.ibm.com/software/globalization/cp/cp_es.jsp|date=2009-09-06|title=IBM code pages by encoding scheme}} * [http://site.icu-project.org/charts/charset IBM/ICU Charset Information] * [https://docs.microsoft.com/en-us/windows/win32/intl/code-page-identifiers Microsoft Code Page Identifiers] (Microsoft's list contains only code pages actively used by normal apps on Windows. See also [https://web.archive.org/web/20040225223721/http://www.sharmahd.com/tm/codepages.html Torsten Mohrin's list] for the full list of supported code pages) * {{webarchive|url=https://web.archive.org/web/20121023050917/http://msdn.microsoft.com/en-us/goglobal/bb964654|date=2012-10-23|title=Shorter Microsoft list containing only the ANSI and OEM code pages but with links to more detail on each}} * [http://www.i18nguy.com/unicode/codepages.html Character Sets And Code Pages At The Push Of A Button] * [https://docs.microsoft.com/en-us/windows-server/administration/windows-commands/chcp Microsoft Chcp command: Display and set the console active code page] {{character encoding}} [[Category:Character encoding]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)
Pages transcluded onto the current version of this page
(
help
)
:
Template:Ambox
(
edit
)
Template:Anchor
(
edit
)
Template:Character encoding
(
edit
)
Template:Cite book
(
edit
)
Template:Cite web
(
edit
)
Template:Clarify
(
edit
)
Template:Div col
(
edit
)
Template:Div col end
(
edit
)
Template:Dunno
(
edit
)
Template:Incomplete list
(
edit
)
Template:Main
(
edit
)
Template:Main other
(
edit
)
Template:Mono
(
edit
)
Template:N/A
(
edit
)
Template:Ndash
(
edit
)
Template:Reflist
(
edit
)
Template:Section link
(
edit
)
Template:Short description
(
edit
)
Template:Sister project
(
edit
)
Template:Snd
(
edit
)
Template:Use dmy dates
(
edit
)
Template:Varies
(
edit
)
Template:Webarchive
(
edit
)
Template:Wikibooks
(
edit
)
Template:Yes
(
edit
)