BOCU-1
From Just Solve the File Format Problem
(Difference between revisions)
(Created page with "{{FormatInfo |formattype=electronic |subcat=Character encoding |released=≤2001 }} BOCU-1 ('''Binary Ordered Compression for Unicode''') is a Unicode character encodi...") |
Dan Tobias (Talk | contribs) |
||
(One intermediate revision by one user not shown) | |||
Line 2: | Line 2: | ||
|formattype=electronic | |formattype=electronic | ||
|subcat=Character encoding | |subcat=Character encoding | ||
+ | |subcat2=Unicode | ||
+ | |charset=BOCU-1 | ||
+ | |charsetaliases=csBOCU1, csBOCU-1 | ||
+ | |mibenum=1020 | ||
|released=≤2001 | |released=≤2001 | ||
}} | }} |
Latest revision as of 02:34, 21 May 2019
BOCU-1 (Binary Ordered Compression for Unicode) is a Unicode character encoding. It is associated with the ICU software, and apparently owned and patented by IBM. It is designed for small size (especially when encoding code points that are mostly from the same language), compatibility with MIME "text" media types, and to have certain desirable sorting characteristics.Despite its name, it does not have many of the attributes typical of a data compression format. It should probably be thought of simply as a Unicode encoding.
See also SCSU, to which it is sometimes compared.