WACZ

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(PRONOM)
 
Line 2: Line 2:
 
|subcat=Archiving
 
|subcat=Archiving
 
|extensions={{ext|wacz}}
 
|extensions={{ext|wacz}}
|pronom={{PRONOM|application/warc}}
+
|pronom={{PRONOM|fmt/1840}}
 
|wikidata={{wikidata|Q104903124}}
 
|wikidata={{wikidata|Q104903124}}
 
|mimetypes={{mimetype|application/x-wacz}}
 
|mimetypes={{mimetype|application/x-wacz}}
Line 25: Line 25:
 
==Software==
 
==Software==
 
* https://github.com/webrecorder/py-wacz
 
* https://github.com/webrecorder/py-wacz
 +
* https://github.com/bodleian/wacksy
  
 
==References==
 
==References==

Latest revision as of 14:47, 30 September 2025

File Format
Name WACZ
Ontology
Extension(s) .wacz
MIME Type(s) application/x-wacz
LoCFDD fdd000586
PRONOM fmt/1840
Wikidata ID Q104903124

A Web Archive Collection Zipped[1][2] is a file format designed to package a standard WARC with accompanying metadata into a single file.[3]

[edit] Format Information

A WACZ file is a ZIP compressed format which can include:

Archive:  sample.wacz
  Length      Date    Time    Name
---------  ---------- -----   ----
    77751  12-19-2023 10:27   pages/pages.jsonl
 19477775  12-19-2023 10:27   archive/data.warc.gz
   525986  12-19-2023 10:27   indexes/index.cdx
      828  12-19-2023 10:27   datapackage.json
      676  12-19-2023 10:27   datapackage-digest.json

[edit] Software

[edit] References

  1. https://specs.webrecorder.net/wacz/latest/
  2. https://webrecorder.net/2023/05/03/an-update-on-wacz.html
  3. https://replayweb.page/docs/wacz-format
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox