From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
Line 2: Line 2:
|subcat=Character encoding
|subcat=Character encoding
|charsetaliases=csBOCU1, csBOCU-1
|charsetaliases=csBOCU1, csBOCU-1

Latest revision as of 02:34, 21 May 2019

File Format
Name BOCU-1
IANA charset BOCU-1
IANA aliases csBOCU1, csBOCU-1
IANA MIBenum 1020
Released ≤2001
BOCU-1 (Binary Ordered Compression for Unicode) is a Unicode character encoding. It is associated with the ICU software, and apparently owned and patented by IBM. It is designed for small size (especially when encoding code points that are mostly from the same language), compatibility with MIME "text" media types, and to have certain desirable sorting characteristics.

Despite its name, it does not have many of the attributes typical of a data compression format. It should probably be thought of simply as a Unicode encoding.

See also SCSU, to which it is sometimes compared.

[edit] Links

Personal tools