Windows encodings

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
m
 
(2 intermediate revisions by one user not shown)
Line 2: Line 2:
 
|formattype=electronic
 
|formattype=electronic
 
|subcat=Character encoding
 
|subcat=Character encoding
 +
|thiscat=Windows encodings
 
}}
 
}}
 
'''Windows encodings''' (or '''Windows code pages''') refers to the various legacy character encodings used by the non-[[Unicode]] Microsoft Windows API, and most non-Unicode-aware Windows applications.
 
'''Windows encodings''' (or '''Windows code pages''') refers to the various legacy character encodings used by the non-[[Unicode]] Microsoft Windows API, and most non-Unicode-aware Windows applications.
Line 12: Line 13:
  
 
== List of encodings ==
 
== List of encodings ==
* [[Windows 1250]] (Central European)
+
* [[Windows 1250]] (Central European) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1250 code table]
* [[Windows 1251]] (Cyrillic)
+
* [[Windows 1251]] (Cyrillic) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1251 code table]
* [[Windows 1252]] (Western European; ISO 8859-1 plus additional characters)
+
* [[Windows 1252]] (Western European; ISO 8859-1 plus additional characters) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1252 code table]
* [[Windows 1253]] (Greek)
+
* [[Windows 1253]] (Greek) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1253 code table]
* [[Windows 1254]] (Turkish)
+
* [[Windows 1254]] (Turkish) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1254 code table]
* [[Windows 1255]] (Hebrew)
+
* [[Windows 1255]] (Hebrew) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1255 code table]
* [[Windows 1256]] (Arabic, Farsi, Urdu)
+
* [[Windows 1256]] (Arabic, Farsi, Urdu) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1256 code table]
* [[Windows 1257]] (Baltic Rim)
+
* [[Windows 1257]] (Baltic Rim) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1257 code table]
* [[Windows 1258]] (Vietnamese)
+
* [[Windows 1258]] (Vietnamese) - [http://www.kreativekorp.com/charset/encoding.php?name=CP1258 code table]
  
 
== Links ==
 
== Links ==
 
* [https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/ Unicode mappings]
 
* [https://unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/ Unicode mappings]
 +
* [https://msdn.microsoft.com/en-us/library/windows/desktop/dd317756(v=vs.85).aspx Windows Dev Center: Code Page Identifiers]
 
* [[Wikipedia: Windows code page]]
 
* [[Wikipedia: Windows code page]]
  
 
[[Category:Microsoft]]
 
[[Category:Microsoft]]
 
[[Category:Windows]]
 
[[Category:Windows]]

Latest revision as of 02:38, 21 May 2019

File Format
Name Windows encodings
Ontology

Windows encodings (or Windows code pages) refers to the various legacy character encodings used by the non-Unicode Microsoft Windows API, and most non-Unicode-aware Windows applications.

In many contexts, it means "whatever the user's default non-Unicode encoding happens to be", which is bad from a portability perspective. All too many file formats use one of these encodings, with no reliable way to determine which one.

The term is somewhat ambiguous. In the strictest sense, it refers to the so-called "ANSI" encodings such as Windows 1252, but it can also encompass many of the even-more-legacy MS-DOS encodings (a.k.a. "OEM" encodings) supported by Windows, such as CP437.

The native encoding of Windows NT-based systems is UTF-16 (or UCS-2 for very old systems), but that is usually not considered to be a "Windows encoding". Sufficiently modern versions of Windows even support UTF-8 as a "legacy" encoding, though it is preferable to use the Unicode API instead.

[edit] List of encodings

[edit] Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox