Windows encodings
From Just Solve the File Format Problem
(Difference between revisions)
(Created page with "{{FormatInfo |formattype=electronic |subcat=Character encoding }} '''Windows encodings''' refers to the various legacy character encodings used by the non-Unicode Microsof...") |
|||
Line 15: | Line 15: | ||
* [[Windows 1257]] (Baltic Rim) | * [[Windows 1257]] (Baltic Rim) | ||
* [[Windows 1258]] (Vietnamese) | * [[Windows 1258]] (Vietnamese) | ||
+ | |||
+ | == Links == | ||
+ | * [https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/ Unicode mappings] | ||
[[Category:Microsoft]] | [[Category:Microsoft]] | ||
[[Category:Windows]] | [[Category:Windows]] |
Revision as of 17:10, 25 January 2018
Windows encodings refers to the various legacy character encodings used by the non-Unicode Microsoft Windows API, and most non-Unicode-aware Windows applications. (GUI applications, anyway. Windows console applications default to using the MS-DOS encodings in many cases.)
In Windows jargon, these encodings are misleadingly called "ANSI", or in some cases (also somewhat misleadingly), "multi-byte" encodings.
All too many file formats use one of these encodings, with no reliable way to determine which one.
List of encodings
- Windows 1252 (ISO 8859-1 plus additional characters)
- Windows 1255 (Hebrew)
- Windows 1256 (Arabic, Farsi, Urdu)
- Windows 1257 (Baltic Rim)
- Windows 1258 (Vietnamese)