ISO 8859-1

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Add note on misuse of iso-8859-1 header for windows-1252)
Line 3: Line 3:
 
|subcat=Character Encodings
 
|subcat=Character Encodings
 
}}
 
}}
ISO 8859-1, aka Latin-1, is the most widely used encoding of the [[ISO 8859]] family for English-language text. [[Windows 1252]] encoding includes all the printing characters of ISO 8859-1 and includes additional printing characters by replacing some control characters of the C1 (128-159) range (and often web developers, e-mail software, and other people and programs that generate or transmit text, mistakenly identify Windows-1252-encoded documents as ISO-8859-1 in headers and parameters).
+
ISO 8859-1, aka Latin-1, is the most widely used encoding of the [[ISO 8859]] family for English-language text. [[Windows 1252]] encoding includes all the printing characters of ISO 8859-1 and includes additional printing characters by replacing some [[C1 controls|control characters of the C1 (128-159) range]] (and often web developers, e-mail software, and other people and programs that generate or transmit text, mistakenly identify Windows-1252-encoded documents as ISO-8859-1 in headers and parameters).
  
 
ISO 8859-1 was updated to [[ISO 8859-15]], called Latin-9, replacing some of the less used characters and adding the Euro (€) sign. However, those who have decided to update from Latin-1 have generally gone straight to Unicode, so Latin-9 has not become a widespread replacement. ISO [[8859-9]] or Latin-5, designed for use with Turkish scripts, also differs from Latin-1 in only a few code points.
 
ISO 8859-1 was updated to [[ISO 8859-15]], called Latin-9, replacing some of the less used characters and adding the Euro (€) sign. However, those who have decided to update from Latin-1 have generally gone straight to Unicode, so Latin-9 has not become a widespread replacement. ISO [[8859-9]] or Latin-5, designed for use with Turkish scripts, also differs from Latin-1 in only a few code points.
 +
 +
The Latin-1, Latin-9, and Windows 1252 encodings all incorporate the [[ASCII]] characters in the first 128 code points (0-127), including the [[C0 controls]].
  
 
[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-1.kte Code table]
 
[http://www.kreativekorp.com/charset/encoding.php?file=iso-8859-1.kte Code table]

Revision as of 13:49, 27 November 2012

File Format
Name ISO 8859-1
Ontology

ISO 8859-1, aka Latin-1, is the most widely used encoding of the ISO 8859 family for English-language text. Windows 1252 encoding includes all the printing characters of ISO 8859-1 and includes additional printing characters by replacing some control characters of the C1 (128-159) range (and often web developers, e-mail software, and other people and programs that generate or transmit text, mistakenly identify Windows-1252-encoded documents as ISO-8859-1 in headers and parameters).

ISO 8859-1 was updated to ISO 8859-15, called Latin-9, replacing some of the less used characters and adding the Euro (€) sign. However, those who have decided to update from Latin-1 have generally gone straight to Unicode, so Latin-9 has not become a widespread replacement. ISO 8859-9 or Latin-5, designed for use with Turkish scripts, also differs from Latin-1 in only a few code points.

The Latin-1, Latin-9, and Windows 1252 encodings all incorporate the ASCII characters in the first 128 code points (0-127), including the C0 controls.

Code table

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox