ANSEL
Template:Short description Template:Infobox character encoding ANSEL, the American National Standard for Extended Latin Alphabet Coded Character Set for Bibliographic Use, was a character set used in text encoding. It provided a table of coded values for the representation of characters of the extended Latin alphabet in machine-readable form for thirty-five languages written in the Latin alphabet and for fifty-one romanized languages. ANSEL adds 63 graphic characters to ASCII,<ref name=ANSEL>Template:Cite tech report</ref> including 29 combining diacritic characters.
The initial revision of ANSEL was released in 1985, and before 1993 it was registered as Registration #231 in the ISO International Register of Coded Character Sets to be Used with Escape Sequences.<ref name="ISO-IR #231">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> The standard was reaffirmed in 2003 although it has been administratively withdrawn by ANSI effective 14 February 2013.<ref name="Z39.47-1993">{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>
The requirement of hardware capable of overprinting accents doomed this from ever becoming a popular extended ASCII.Template:Citation needed
Code page layoutEdit
The following table shows ANSI/NISO Z39.47-1993 (R2003).<ref name="Z39.47-1993"/> Non-ASCII characters are shown with their Unicode code point. A combining diacritic precedes the spacing character on which it should be superimposed<ref name=ANSEL/> (in Unicode the combining diacritic is after the base character).
UseEdit
GEDCOMEdit
The GEDCOM specification for exchanging genealogical data refers to ANSEL (ANSI/NISO Z39.47-1985) as a valid text encoding for GEDCOM files and extends it with additional characters which are shown in the following table.<ref name="GEDCOM 5.5">Template:Cite tech report</ref><ref name="GEDCOM 5.3">Template:Cite tech report</ref>
Hex | Unicode | Glyph | Description |
---|---|---|---|
0xBE | 25A1 | □ | empty box |
0xBF | 25A0 | ■ | black box |
0xCD | 0065 | e | midline e |
0xCE | 006F | o | midline o |
0xCF | 00DF | ß | es zet |
0xFC | 0338 | ̸ | diacritic slash through char |
MARC21Edit
The Extended Latin character set from MARC 21 is synchronized with ANSEL<ref name="ISO-IR #231" /> but additionally supports the eszett (ß) character at C7 and the euro sign (€) at C8.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>