BOCU-1

File Format
Name	BOCU-1
Ontology	Electronic File Formats Character encoding Unicode BOCU-1 ; ; ; ;
IANA charset	BOCU-1
IANA aliases	csBOCU1, csBOCU-1
IANA MIBenum	1020
Released	≤2001

Latest revision as of 02:34, 21 May 2019

BOCU-1 (Binary Ordered Compression for Unicode) is a Unicode character encoding. It is associated with the ICU software, and apparently owned and patented by IBM. It is designed for small size (especially when encoding code points that are mostly from the same language), compatibility with MIME "text" media types, and to have certain desirable sorting characteristics.

Despite its name, it does not have many of the attributes typical of a data compression format. It should probably be thought of simply as a Unicode encoding.

See also SCSU, to which it is sometimes compared.

[edit] Links

Unicode Technical Note #6 BOCU-1: MIME-Compatible Unicode Compression
Wikipedia article
ICU: BOCU-1
BOCU Draft 2001-05-30
A survey of Unicode compression: BOCU-1
BOCU-1 charset registration
ICU

@@ Line 2: / Line 2: @@
 |formattype=electronic
 |subcat=Character encoding
+|subcat2=Unicode
+|charset=BOCU-1
+|charsetaliases=csBOCU1, csBOCU-1
+|mibenum=1020
 |released=≤2001
 }}

BOCU-1

Latest revision as of 02:34, 21 May 2019

[edit] Links

Personal tools

Namespaces

Variants

Views

Actions

Search

Navigation

Toolbox