Code page 858

Code page 858 (CCSID 858)[2] (also known as CP 858, IBM 00858, OEM 858[3]) is a code page used under DOS to write Western European languages.[4]

Code page 858
MIME / IANAIBM00858
Alias(es)CCSID00858, CP00858, PC-Multilingual-850+euro[1]
Transforms / EncodesISO 8859-1
Preceded byCode page 850

Similarly to code page 850, code page 858 supports the entire repertoire of ISO 8859-1, but in a different arrangement. Code page 858 was created from code page 850 in 1998 by changing code point 213 (D5hex) from dotless i (ı) (a character not included in ISO-8859-1) to the euro sign (€).[4] Unlike most code pages modified to support the euro sign, the generic currency sign at CFhex (an ISO-8859-1 character, changed to the euro sign in ISO-8859-15) was not chosen as the character to replace.[nb 1]

Still, instead of adding support for the new code page 858, IBM's PC DOS 2000, also released in 1998, changed the definition of the existing code page 850 to what IBM called modified code page 850 to include the euro sign at code point 213.[nb 2][5][6][7][8][9] More recent IBM/MS products implemented codepage 858 under its own ID.

Character set

The following table shows code page 858. Each character appears with its equivalent Unicode code-point. The decimal value of the location is the Alt code. Only the second half of the table (code points 128–255) is shown, the first half (code points 0–127) being the same as code page 850.

Code page 858[10][11][3]
0 1 2 3 4 5 6 7 8 9 A B C D E F
8x title="Alt+128 U+00C7 LATIN CAPITAL LETTER C WITH CEDILLA" style="padding:1px;"|Ç title="Alt+129 U+00FC LATIN SMALL LETTER U WITH DIAERESIS" style="padding:1px;"|ü title="Alt+130 U+00E9 LATIN SMALL LETTER E WITH ACUTE" style="padding:1px;"|é title="Alt+131 U+00E2 LATIN SMALL LETTER A WITH CIRCUMFLEX" style="padding:1px;"|â title="Alt+132 U+00E4 LATIN SMALL LETTER A WITH DIAERESIS" style="padding:1px;"|ä title="Alt+133 U+00E0 LATIN SMALL LETTER A WITH GRAVE" style="padding:1px;"|à title="Alt+134 U+00E5 LATIN SMALL LETTER A WITH RING ABOVE" style="padding:1px;"|å title="Alt+135 U+00E7 LATIN SMALL LETTER C WITH CEDILLA" style="padding:1px;"|ç title="Alt+136 U+00EA LATIN SMALL LETTER E WITH CIRCUMFLEX" style="padding:1px;"|ê title="Alt+137 U+00EB LATIN SMALL LETTER E WITH DIAERESIS" style="padding:1px;"|ë title="Alt+138 U+00E8 LATIN SMALL LETTER E WITH GRAVE" style="padding:1px;"|è title="Alt+139 U+00EF LATIN SMALL LETTER I WITH DIAERESIS" style="padding:1px;"|ï title="Alt+140 U+00EE LATIN SMALL LETTER I WITH CIRCUMFLEX" style="padding:1px;"|î title="Alt+141 U+00EC LATIN SMALL LETTER I WITH GRAVE" style="padding:1px;"|ì title="Alt+142 U+00C4 LATIN CAPITAL LETTER A WITH DIAERESIS" style="padding:1px;"|Ä title="Alt+143 U+00C5 LATIN CAPITAL LETTER A WITH RING ABOVE" style="padding:1px;"|Å
9x title="Alt+144 U+00C9 LATIN CAPITAL LETTER E WITH ACUTE" style="padding:1px;"|É title="Alt+145 U+00E6 LATIN SMALL LETTER AE" style="padding:1px;"|æ title="Alt+146 U+00C6 LATIN CAPITAL LETTER AE" style="padding:1px;"|Æ title="Alt+147 U+00F4 LATIN SMALL LETTER O WITH CIRCUMFLEX" style="padding:1px;"|ô title="Alt+148 U+00F6 LATIN SMALL LETTER O WITH DIAERESIS" style="padding:1px;"|ö title="Alt+149 U+00F2 LATIN SMALL LETTER O WITH GRAVE" style="padding:1px;"|ò title="Alt+150 U+00FB LATIN SMALL LETTER U WITH CIRCUMFLEX" style="padding:1px;"|û title="Alt+151 U+00F9 LATIN SMALL LETTER U WITH GRAVE" style="padding:1px;"|ù title="Alt+152 U+00FF LATIN SMALL LETTER Y WITH DIAERESIS" style="padding:1px;"|ÿ title="Alt+153 U+00D6 LATIN CAPITAL LETTER O WITH DIAERESIS" style="padding:1px;"|Ö title="Alt+154 U+00DC LATIN CAPITAL LETTER U WITH DIAERESIS" style="padding:1px;"|Ü title="Alt+155 U+00F8 LATIN SMALL LETTER O WITH STROKE" style="padding:1px;"|ø title="Alt+156 U+00A3 POUND SIGN" style="padding:1px;"|£ title="Alt+157 U+00D8 LATIN CAPITAL LETTER O WITH STROKE" style="padding:1px;"|Ø title="Alt+158 U+00D7 MULTIPLICATION SIGN" style="padding:1px;"|× title="Alt+159 U+0192 LATIN SMALL LETTER F WITH HOOK" style="padding:1px;"|ƒ
Ax title="Alt+160 U+00E1 LATIN SMALL LETTER A WITH ACUTE" style="padding:1px;"|á title="Alt+161 U+00ED LATIN SMALL LETTER I WITH ACUTE" style="padding:1px;"|í title="Alt+162 U+00F3 LATIN SMALL LETTER O WITH ACUTE" style="padding:1px;"|ó title="Alt+163 U+00FA LATIN SMALL LETTER U WITH ACUTE" style="padding:1px;"|ú title="Alt+164 U+00F1 LATIN SMALL LETTER N WITH TILDE" style="padding:1px;"|ñ title="Alt+165 U+00D1 LATIN CAPITAL LETTER N WITH TILDE" style="padding:1px;"|Ñ title="Alt+166 U+00AA FEMININE ORDINAL INDICATOR" style="padding:1px;"|ª title="Alt+167 U+00BA MASCULINE ORDINAL INDICATOR" style="padding:1px;"|º title="Alt+168 U+00BF INVERTED QUESTION MARK" style="padding:1px;"|¿ title="Alt+169 U+00AE REGISTERED SIGN" style="padding:1px;"|® title="Alt+170 U+00AC NOT SIGN" style="padding:1px;"|¬ title="Alt+171 U+00BD VULGAR FRACTION ONE HALF" style="padding:1px;"|½ title="Alt+172 U+00BC VULGAR FRACTION ONE QUARTER" style="padding:1px;"|¼ title="Alt+173 U+00A1 INVERTED EXCLAMATION MARK" style="padding:1px;"|¡ title="Alt+174 U+00AB LEFT-POINTING DOUBLE ANGLE QUOTATION MARK" style="padding:1px;"|« title="Alt+175 U+00BB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK" style="padding:1px;"|»
Bx title="Alt+176 U+2591 LIGHT SHADE" style="padding:1px;"| title="Alt+177 U+2592 MEDIUM SHADE" style="padding:1px;"| title="Alt+178 U+2593 DARK SHADE" style="padding:1px;"| title="Alt+179 U+2502 BOX DRAWINGS LIGHT VERTICAL" style="padding:1px;"| title="Alt+180 U+2524 BOX DRAWINGS LIGHT VERTICAL AND LEFT" style="padding:1px;"| title="Alt+181 U+00C1 LATIN CAPITAL LETTER A WITH ACUTE" style="padding:1px;"|Á title="Alt+182 U+00C2 LATIN CAPITAL LETTER A WITH CIRCUMFLEX" style="padding:1px;"|Â title="Alt+183 U+00C0 LATIN CAPITAL LETTER A WITH GRAVE" style="padding:1px;"|À title="Alt+184 U+00A9 COPYRIGHT SIGN" style="padding:1px;"|© title="Alt+185 U+2563 BOX DRAWINGS DOUBLE VERTICAL AND LEFT" style="padding:1px;"| title="Alt+186 U+2551 BOX DRAWINGS DOUBLE VERTICAL" style="padding:1px;"| title="Alt+187 U+2557 BOX DRAWINGS DOUBLE DOWN AND LEFT" style="padding:1px;"| title="Alt+188 U+255D BOX DRAWINGS DOUBLE UP AND LEFT" style="padding:1px;"| title="Alt+189 U+00A2 CENT SIGN" style="padding:1px;"|¢ title="Alt+190 U+00A5 YEN SIGN" style="padding:1px;"|¥ title="Alt+191 U+2510 BOX DRAWINGS LIGHT DOWN AND LEFT" style="padding:1px;"|
Cx title="Alt+192 U+2514 BOX DRAWINGS LIGHT UP AND RIGHT" style="padding:1px;"| title="Alt+193 U+2534 BOX DRAWINGS LIGHT UP AND HORIZONTAL" style="padding:1px;"| title="Alt+194 U+252C BOX DRAWINGS LIGHT DOWN AND HORIZONTAL" style="padding:1px;"| title="Alt+195 U+251C BOX DRAWINGS LIGHT VERTICAL AND RIGHT" style="padding:1px;"| title="Alt+196 U+2500 BOX DRAWINGS LIGHT HORIZONTAL" style="padding:1px;"| title="Alt+197 U+253C BOX DRAWINGS LIGHT VERTICAL AND HORIZONTAL" style="padding:1px;"| title="Alt+198 U+00E3 LATIN SMALL LETTER A WITH TILDE" style="padding:1px;"|ã title="Alt+199 U+00C3 LATIN CAPITAL LETTER A WITH TILDE" style="padding:1px;"|Ã title="Alt+200 U+255A BOX DRAWINGS DOUBLE UP AND RIGHT" style="padding:1px;"| title="Alt+201 U+2554 BOX DRAWINGS DOUBLE DOWN AND RIGHT" style="padding:1px;"| title="Alt+202 U+2569 BOX DRAWINGS DOUBLE UP AND HORIZONTAL" style="padding:1px;"| title="Alt+203 U+2566 BOX DRAWINGS DOUBLE DOWN AND HORIZONTAL" style="padding:1px;"| title="Alt+204 U+2560 BOX DRAWINGS DOUBLE VERTICAL AND RIGHT" style="padding:1px;"| title="Alt+205 U+2550 BOX DRAWINGS DOUBLE HORIZONTAL" style="padding:1px;"| title="Alt+206 U+256C BOX DRAWINGS DOUBLE VERTICAL AND HORIZONTAL" style="padding:1px;"| title="Alt+207 U+00A4 CURRENCY SIGN" style="padding:1px;"|¤
Dx title="Alt+208 U+00F0 LATIN SMALL LETTER ETH" style="padding:1px;"|ð title="Alt+209 U+00D0 LATIN CAPITAL LETTER ETH" style="padding:1px;"|Ð title="Alt+210 U+00CA LATIN CAPITAL LETTER E WITH CIRCUMFLEX" style="padding:1px;"|Ê title="Alt+211 U+00CB LATIN CAPITAL LETTER E WITH DIAERESIS" style="padding:1px;"|Ë title="Alt+212 U+00C8 LATIN CAPITAL LETTER E WITH GRAVE" style="padding:1px;"|È title="Alt+213 U+20AC EURO SIGN" style="padding:1px;background:#FFD"| title="Alt+214 U+00CD LATIN CAPITAL LETTER I WITH ACUTE" style="padding:1px;"|Í title="Alt+215 U+00CE LATIN CAPITAL LETTER I WITH CIRCUMFLEX" style="padding:1px;"|Î title="Alt+216 U+00CF LATIN CAPITAL LETTER I WITH DIAERESIS" style="padding:1px;"|Ï title="Alt+217 U+2518 BOX DRAWINGS LIGHT UP AND LEFT" style="padding:1px;"| title="Alt+218 U+250C BOX DRAWINGS LIGHT DOWN AND RIGHT" style="padding:1px;"| title="Alt+219 U+2588 FULL BLOCK" style="padding:1px;"| title="Alt+220 U+2584 LOWER HALF BLOCK" style="padding:1px;"| title="Alt+221 U+00A6 BROKEN BAR" style="padding:1px;"|¦ title="Alt+222 U+00CC LATIN CAPITAL LETTER I WITH GRAVE" style="padding:1px;"|Ì title="Alt+223 U+2580 UPPER HALF BLOCK" style="padding:1px;"|
Ex title="Alt+224 U+00D3 LATIN CAPITAL LETTER O WITH ACUTE" style="padding:1px;"|Ó title="Alt+225 U+00DF LATIN SMALL LETTER SHARP S" style="padding:1px;"|ß title="Alt+226 U+00D4 LATIN CAPITAL LETTER O WITH CIRCUMFLEX" style="padding:1px;"|Ô title="Alt+227 U+00D2 LATIN CAPITAL LETTER O WITH GRAVE" style="padding:1px;"|Ò title="Alt+228 U+00F5 LATIN SMALL LETTER O WITH TILDE" style="padding:1px;"|õ title="Alt+229 U+00D5 LATIN CAPITAL LETTER O WITH TILDE" style="padding:1px;"|Õ title="Alt+230 U+00B5 MICRO SIGN" style="padding:1px;"|µ title="Alt+231 U+00FE LATIN SMALL LETTER THORN" style="padding:1px;"|þ title="Alt+232 U+00DE LATIN CAPITAL LETTER THORN" style="padding:1px;"|Þ title="Alt+233 U+00DA LATIN CAPITAL LETTER U WITH ACUTE" style="padding:1px;"|Ú title="Alt+234 U+00DB LATIN CAPITAL LETTER U WITH CIRCUMFLEX" style="padding:1px;"|Û title="Alt+235 U+00D9 LATIN CAPITAL LETTER U WITH GRAVE" style="padding:1px;"|Ù title="Alt+236 U+00FD LATIN SMALL LETTER Y WITH ACUTE" style="padding:1px;"|ý title="Alt+237 U+00DD LATIN CAPITAL LETTER Y WITH ACUTE" style="padding:1px;"|Ý title="Alt+238 U+00AF MACRON" style="padding:1px;"|¯ title="Alt+239 U+00B4 ACUTE ACCENT" style="padding:1px;"|´
Fx title="Alt+240 U+00AD SOFT HYPHEN" style="font-size:75%;padding:1px;"|SHY title="Alt+241 U+00B1 PLUS-MINUS SIGN" style="padding:1px;"|± title="Alt+242 U+2017 DOUBLE LOW LINE" style="padding:1px;"| title="Alt+243 U+00BE VULGAR FRACTION THREE QUARTERS" style="padding:1px;"|¾ title="Alt+244 U+00B6 PILCROW SIGN" style="padding:1px;"| title="Alt+245 U+00A7 SECTION SIGN" style="padding:1px;"|§ title="Alt+246 U+00F7 DIVISION SIGN" style="padding:1px;"|÷ title="Alt+247 U+00B8 CEDILLA" style="padding:1px;"|¸ title="Alt+248 U+00B0 DEGREE SIGN" style="padding:1px;"|° title="Alt+249 U+00A8 DIAERESIS" style="padding:1px;"|¨ title="Alt+250 U+00B7 MIDDLE DOT" style="padding:1px;"|· title="Alt+251 U+00B9 SUPERSCRIPT ONE" style="padding:1px;"|¹ title="Alt+252 U+00B3 SUPERSCRIPT THREE" style="padding:1px;"|³ title="Alt+253 U+00B2 SUPERSCRIPT TWO" style="padding:1px;"|² title="Alt+254 U+25A0 BLACK SQUARE" style="padding:1px;"| title="Alt+255 U+00A0 NO-BREAK SPACE" style="font-size:75%;padding:1px;"|NBSP
  Differences from code page 850

Notes

  1. As well as ISO-8859-1 versus -15, compare code pages 808 (from 866), 848 (from 1125), 849 (from 1131) and 872 (from 855), ISO-IR-205 (from ISO-8859-4), ISO-IR-206 (from ISO-8859-13), and the changes to MacRoman and MacCyrillic.
  2. The reason for this might have been down to existing restrictions in the implementation of the codepage switching logic under MS-DOS/PC DOS, which limited .CPI files to 64 KB in size or about six codepages maximum, a limitation, which was circumvented in some OEM versions of MS-DOS, in Windows NT, and also does not exist in DR-DOS. Further, the parser in MS-DOS/PC DOS limits the number of possible country / codepage entries in COUNTRY.SYS files to a maximum of 146 or 438, a limitation non-existent in DR-DOS. So, adding support for codepage 858 might have meant to drop another (e.g. codepage 850) at the same time, which might not have been a viable solution at that time, given that some applications were hard-wired to use codepage 850.

References

  1. "Character Sets". Internet Assigned Numbers Authority (IANA). 2018-12-12. Archived from the original on 2022-01-28. Retrieved 2022-01-28.
  2. "CCSID 858 information document". IBM. Archived from the original on 2016-03-27.
  3. "OEM 858". Go Global Developer Center. Microsoft. Archived from the original on 2016-03-08. Retrieved 2011-11-23.
  4. "Code page 858 information document". IBM. Archived from the original on 2016-08-20.
  5. Paul, Matthias R. (2001-08-15). "Changing codepages in FreeDOS" (Technical design specification). Archived from the original on 2016-08-20. Retrieved 2013-05-08. The new official ID for the Multilingual "codepage 850 with EURO SIGN" is 858, not 850. IBM will switch to use 858 instead of their 850 variant with future issues of their products. […] I can only guess why they didn't add 858 to their EGAx.CPI, COUNTRY.SYS, and KEYBOARD.SYS files in PC DOS 2000. Many third-party applications are designed to work with 850 and didn't know about 858 at the time PC DOS 2000 was released, so it's easier for everyone, but unfortunately it's not compatible. […] As explained above, COUNTRY.SYS and KEYBOARD.SYS contain only two codepage entries for a given country in Western issues of DOS. (In Arabic and Hebrew issues there can be up to 8 codepages for one country, in theory there is no limit below the range of allowed codepages 1..65534). […] The problem is that removing support for 850 might have caused compatibility problems with applications which are hard-wired to use 850. Adding 858 as a third choice to all the files would have increased the file and table sizes significantly. The COUNTRY.SYS file parser in MS-DOS/PC DOS IO.SYS/IBMBIO.COM sets aside a 6 Kb (for DOS 6) scratchpad to load all the info. This allows a maximum of 438 entries in a COUNTRY.SYS file to be accepted, otherwise you will get the message "COUNTRY.SYS too large.". The NLSFUNC parser does not have this limitation, and the file parsers in DR-DOS (kernel and NLSFUNC) also do not know of such a restriction. Older issues of MS-DOS/PC DOS even had a 2 Kb buffer for a maximum of 146 entries.
  6. Paul, Matthias R. (2001-06-10) [1995]. "DOS COUNTRY.SYS file format" (COUNTRY.LST file) (1.44 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.
  7. Starikov, Yuri (2005-04-11). "15-летию Russian MS-DOS 4.01 посвящается" [15 Years of Russian MS-DOS 4.01] (in Russian). Archived from the original on 2016-08-20. Retrieved 2014-05-07.
  8. Paul, Matthias R. (2001-08-27). "Changing codepages in FreeDOS (follow-up)". Archived from the original on 2014-10-01. Retrieved 2013-05-08. […] one could also create custom .CPI files in the traditional FONT style without difficulties, but you could only store up to […] six codepages in such a file if it should be usable by MS-DOS/PC DOS (some OEM issues and NT can handle files larger than 64 Kb, but MS-DOS/PC DOS can not). (NB. Based on fd-dev post .)
  9. Paul, Matthias R. (2001-06-10) [1995]. "Format description of DOS, OS/2, and Windows NT .CPI, and Linux .CP files" (CPI.LST file) (1.30 ed.). Archived from the original on 2016-04-20. Retrieved 2016-08-20.
  10. "Code Page CPGID 00858" (PDF). IBM. Retrieved 2022-01-28.
  11. "Code Page CPGID 00858" (text). IBM. Retrieved 2022-01-28.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.