ISO 8859
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) |
(Code table links) |
||
Line 11: | Line 11: | ||
ISO/IEC 8859 is a family of 8-bit character encodings that are extensions of ASCII. All of them reserve code values 80 through 9F (128 through 159 decimal) for control characters, though a lot of websites and e-mail programs bogusly indicate one of these encodings for content that actually uses printable characters in those positions via some proprietary system-specific encoding (usually Windows). The ISO 8859 encodings are the following: | ISO/IEC 8859 is a family of 8-bit character encodings that are extensions of ASCII. All of them reserve code values 80 through 9F (128 through 159 decimal) for control characters, though a lot of websites and e-mail programs bogusly indicate one of these encodings for content that actually uses printable characters in those positions via some proprietary system-specific encoding (usually Windows). The ISO 8859 encodings are the following: | ||
− | * ISO 8859-1: Latin-1, used for western European languages including English | + | * ISO 8859-1: Latin-1, used for western European languages including English [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-1.kte Code table]] |
− | * ISO 8859-2: Latin-2, used for central and eastern European languages | + | * ISO 8859-2: Latin-2, used for central and eastern European languages [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-2.kte Code table]] |
− | * ISO 8859-3: Latin-3, for Esperanto, Galician, Maltese, and Turkish | + | * ISO 8859-3: Latin-3, for Esperanto, Galician, Maltese, and Turkish [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-3.kte Code table]] |
− | * ISO 8859-4: Latin-4, for Scandinavian and Baltic languages | + | * ISO 8859-4: Latin-4, for Scandinavian and Baltic languages [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-4.kte Code table]] |
− | * ISO 8859-5: Cyrillic characters | + | * ISO 8859-5: Cyrillic characters [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-5.kte Code table]] |
− | * ISO 8859-6: Arabic characters | + | * ISO 8859-6: Arabic characters [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-6.kte Code table]] |
− | * ISO 8859-7: Modern Greek | + | * ISO 8859-7: Modern Greek [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-7.kte Code table]] |
− | * ISO 8859-8: Hebrew | + | * ISO 8859-8: Hebrew [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-8.kte Code table]] |
− | * ISO 8859-9: Latin-5, Turkish variant of Latin-1 | + | * ISO 8859-9: Latin-5, Turkish variant of Latin-1 [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-9.kte Code table]] |
− | * ISO 8859-10: Latin-6, for Lappish, Nordic, and Inuit languages | + | * ISO 8859-10: Latin-6, for Lappish, Nordic, and Inuit languages [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-10.kte Code table]] |
− | * ISO 8859-11: Thai | + | * ISO 8859-11: Thai [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-11.kte Code table]] |
* (ISO 8859-12 was abandoned) | * (ISO 8859-12 was abandoned) | ||
− | * ISO 8859-13: Latin-7, Baltic Rim languages | + | * ISO 8859-13: Latin-7, Baltic Rim languages [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-13.kte Code table]] |
− | * ISO 8859-14: Latin-8, Celtic | + | * ISO 8859-14: Latin-8, Celtic [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-14.kte Code table]] |
− | * ISO 8859-15: Latin-9, Revision of Latin-1, includes Euro sign | + | * ISO 8859-15: Latin-9, Revision of Latin-1, includes Euro sign [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-15.kte Code table]] |
− | * ISO 8859-16: Latin-10, Romanian | + | * ISO 8859-16: Latin-10, Romanian [[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-16.kte Code table]] |
[[ftp://ftp.unicode.org/Public/MAPPINGS/ISO8859/ ISO 8859 to Unicode mapping tables]] | [[ftp://ftp.unicode.org/Public/MAPPINGS/ISO8859/ ISO 8859 to Unicode mapping tables]] |
Revision as of 00:40, 10 November 2012
File Formats | > | Electronic File Formats | > | Character Encoding | > | ISO 8859 |
ISO/IEC 8859 is a family of 8-bit character encodings that are extensions of ASCII. All of them reserve code values 80 through 9F (128 through 159 decimal) for control characters, though a lot of websites and e-mail programs bogusly indicate one of these encodings for content that actually uses printable characters in those positions via some proprietary system-specific encoding (usually Windows). The ISO 8859 encodings are the following:
- ISO 8859-1: Latin-1, used for western European languages including English [Code table]
- ISO 8859-2: Latin-2, used for central and eastern European languages [Code table]
- ISO 8859-3: Latin-3, for Esperanto, Galician, Maltese, and Turkish [Code table]
- ISO 8859-4: Latin-4, for Scandinavian and Baltic languages [Code table]
- ISO 8859-5: Cyrillic characters [Code table]
- ISO 8859-6: Arabic characters [Code table]
- ISO 8859-7: Modern Greek [Code table]
- ISO 8859-8: Hebrew [Code table]
- ISO 8859-9: Latin-5, Turkish variant of Latin-1 [Code table]
- ISO 8859-10: Latin-6, for Lappish, Nordic, and Inuit languages [Code table]
- ISO 8859-11: Thai [Code table]
- (ISO 8859-12 was abandoned)
- ISO 8859-13: Latin-7, Baltic Rim languages [Code table]
- ISO 8859-14: Latin-8, Celtic [Code table]
- ISO 8859-15: Latin-9, Revision of Latin-1, includes Euro sign [Code table]
- ISO 8859-16: Latin-10, Romanian [Code table]