DjVu
From Just Solve the File Format Problem
(Difference between revisions)
Line 12: | Line 12: | ||
== Format == | == Format == | ||
Files have a 4-byte preamble. The rest of the file uses [[IFF]] format. | Files have a 4-byte preamble. The rest of the file uses [[IFF]] format. | ||
− | |||
− | |||
− | |||
== Identification == | == Identification == | ||
Line 40: | Line 37: | ||
* [http://djvu.org/resources/whatisdjvu.php Overview] | * [http://djvu.org/resources/whatisdjvu.php Overview] | ||
* [[Wikipedia:DjVu|Wikipedia article]] | * [[Wikipedia:DjVu|Wikipedia article]] | ||
+ | * [http://www.iana.org/assignments/media-types/image/vnd-djvu Media type registration] | ||
[[Category:Document]] | [[Category:Document]] | ||
[[Category:IFF based file formats]] | [[Category:IFF based file formats]] |
Revision as of 16:44, 29 November 2014
DjVu is a multi-layer raster image file format for digital documents. It was originally developed at AT&T Labs, and is commonly used in book digitization, for example by the Internet Archive.
DjVu documents may include a plain text layer (e.g. from OCR), as well as other data such as a document outline, so the format can serve some of the same purposes as PDF.
Contents |
Format
Files have a 4-byte preamble. The rest of the file uses IFF format.
Identification
Files begin with ASCII characters "AT&TFORM
".
At offset 12 should be a tag indicating the specific file type. For DjVu v3, the possibilities are "DJVM
", "DJVU
", "DJVI
", and "THUM
".
There is an extension of DjVu called Secure DjVu. Secure DjVu files begin with "SDJV
".
Specifications
- DjVu v3 Reference (requires DjVu plug-in)
- DjVu 1999-04-29 (v2) Reference (requires DjVu plug-in)
- Secure DjVu Specification (requires DjVu plug-in)
Software
- DjVuLibre: Viewers, tools, C++ reference library
- Viewers & Plug-ins
Sample files
- The Specifications documents listed above
- The DjVuLibre distributions include some DjVu files.