Windows encodings
From Just Solve the File Format Problem
Windows encodings refers to the various legacy character encodings used by the non-Unicode Microsoft Windows API, and most non-Unicode-aware Windows applications. (GUI applications, anyway. Windows console applications default to using the MS-DOS encodings in many cases.)
In Windows jargon, these encodings are misleadingly called "ANSI", or in some cases (also somewhat misleadingly), "multi-byte" encodings.
All too many file formats use one of these encodings, with no reliable way to determine which one.
List of encodings
- Windows 1252 (ISO 8859-1 plus additional characters)
- Windows 1255 (Hebrew)
- Windows 1256 (Arabic, Farsi, Urdu)
- Windows 1257 (Baltic Rim)
- Windows 1258 (Vietnamese)