Windows encodings
From Just Solve the File Format Problem
(Difference between revisions)
(Created page with "{{FormatInfo |formattype=electronic |subcat=Character encoding }} '''Windows encodings''' refers to the various legacy character encodings used by the non-Unicode Microsof...") |
Revision as of 17:05, 25 January 2018
Windows encodings refers to the various legacy character encodings used by the non-Unicode Microsoft Windows API, and most non-Unicode-aware Windows applications. (GUI applications, anyway. Windows console applications default to using the MS-DOS encodings in many cases.)
In Windows jargon, these encodings are misleadingly called "ANSI", or in some cases (also somewhat misleadingly), "multi-byte" encodings.
All too many file formats use one of these encodings, with no reliable way to determine which one.
List of encodings
- Windows 1252 (ISO 8859-1 plus additional characters)
- Windows 1255 (Hebrew)
- Windows 1256 (Arabic, Farsi, Urdu)
- Windows 1257 (Baltic Rim)
- Windows 1258 (Vietnamese)