ISO 8859-1

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Add note on misuse of iso-8859-1 header for windows-1252)
 
(9 intermediate revisions by 3 users not shown)
Line 1: Line 1:
 
{{FormatInfo
 
{{FormatInfo
 
|formattype=electronic
 
|formattype=electronic
|subcat=Character Encodings
+
|subcat=Character encoding
 +
|subcat2=ISO 8859
 +
|released=1987
 +
|charset=ISO-8859-1
 +
|charsetaliases=iso-ir-100, ISO_8859-1, latin1, l1, IBM819, CP819, csISOLatin1
 +
|mibenum=4
 +
|codepage=819, 28591
 +
|cfstringencoding=513
 +
|nsstringencoding=5
 +
|wikidata={{wikidata|Q935289}}
 +
|released=1987
 
}}
 
}}
ISO 8859-1, aka Latin-1, is the most widely used encoding of the [[ISO 8859]] family for English-language text. [[Windows 1252]] encoding includes all the printing characters of ISO 8859-1 and includes additional printing characters by replacing some control characters of the C1 (128-159) range (and often web developers, e-mail software, and other people and programs that generate or transmit text, mistakenly identify Windows-1252-encoded documents as ISO-8859-1 in headers and parameters).
+
'''ISO 8859-1''', aka Latin-1, is the most widely used encoding of the [[ISO 8859]] family for text in English and other western European languages. [[Windows 1252]] encoding includes all the printing characters of ISO 8859-1 and includes additional printing characters by replacing some [[C1 controls|control characters of the C1 (128-159) range]] (and often web developers, e-mail software, and other people and programs that generate or transmit text, mistakenly identify Windows-1252-encoded documents as ISO-8859-1 in headers and parameters).
  
ISO 8859-1 was updated to [[ISO 8859-15]], called Latin-9, replacing some of the less used characters and adding the Euro (€) sign. However, those who have decided to update from Latin-1 have generally gone straight to Unicode, so Latin-9 has not become a widespread replacement. ISO [[8859-9]] or Latin-5, designed for use with Turkish scripts, also differs from Latin-1 in only a few code points.
+
ISO 8859-1 was updated to [[ISO 8859-15]], called Latin-9, replacing some of the less used characters and adding the Euro (€) sign. However, those who have decided to update from Latin-1 have generally gone straight to [[Unicode]], so Latin-9 has not become a widespread replacement. [[ISO 8859-9]] or Latin-5, designed for use with Turkish scripts, also differs from Latin-1 in only a few code points.
  
[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-1.kte Code table]
+
The Latin-1, Latin-9, and Windows 1252 encodings all incorporate the [[ASCII]] characters in the first 128 code points (0-127), including the [[C0 controls]].
 +
 
 +
== Links ==
 +
* [http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-1.kte Code table]
 +
* [[Wikipedia:ISO/IEC 8859-1|Wikipedia article]]

Latest revision as of 17:30, 25 May 2019

File Format
Name ISO 8859-1
Ontology
Wikidata ID Q935289
IANA charset ISO-8859-1
IANA aliases iso-ir-100, ISO_8859-1, latin1, l1, IBM819, CP819, csISOLatin1
IANA MIBenum 4
Code Page 819, 28591
CFStringEncoding 513
NSStringEncoding 5
Released 1987

ISO 8859-1, aka Latin-1, is the most widely used encoding of the ISO 8859 family for text in English and other western European languages. Windows 1252 encoding includes all the printing characters of ISO 8859-1 and includes additional printing characters by replacing some control characters of the C1 (128-159) range (and often web developers, e-mail software, and other people and programs that generate or transmit text, mistakenly identify Windows-1252-encoded documents as ISO-8859-1 in headers and parameters).

ISO 8859-1 was updated to ISO 8859-15, called Latin-9, replacing some of the less used characters and adding the Euro (€) sign. However, those who have decided to update from Latin-1 have generally gone straight to Unicode, so Latin-9 has not become a widespread replacement. ISO 8859-9 or Latin-5, designed for use with Turkish scripts, also differs from Latin-1 in only a few code points.

The Latin-1, Latin-9, and Windows 1252 encodings all incorporate the ASCII characters in the first 128 code points (0-127), including the C0 controls.

[edit] Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox