DjVu
DjVu is a multi-layer raster image file format for digital documents. It was originally developed at AT&T Labs, and is commonly used in book digitization, for example by the Internet Archive.
DjVu documents may include a plain text layer (e.g. from OCR), as well as other data such as a document outline, so the format can serve some of the same purposes as PDF.
Contents |
Format
Files have a 4-byte preamble. The rest of the file uses IFF format.
Identifiers
The MIME type is image/vnd.djvu, but IANA may list it as image/vnd-djvu, apparently in error.
Identification
Files begin with ASCII characters "AT&TFORM
".
At offset 12 should be a tag indicating the specific file type. For DjVu v3, the possibilities are "DJVM
", "DJVU
", "DJVI
", and "THUM
".
There is an extension of DjVu called Secure DjVu. Secure DjVu files begin with "SDJV
".
Specifications
- DjVu v3 Reference (requires DjVu plug-in)
- DjVu 1999-04-29 (v2) Reference (requires DjVu plug-in)
- Secure DjVu Specification (requires DjVu plug-in)
Software
- DjVuLibre: Viewers, tools, C++ reference library
- Viewers & Plug-ins
Sample files
- The Specifications documents listed above
- The DjVuLibre distributions include some DjVu files.