WACZ
From Just Solve the File Format Problem
(Difference between revisions)
(PRONOM) |
|||
| Line 2: | Line 2: | ||
|subcat=Archiving | |subcat=Archiving | ||
|extensions={{ext|wacz}} | |extensions={{ext|wacz}} | ||
| − | |pronom={{PRONOM| | + | |pronom={{PRONOM|fmt/1840}} |
|wikidata={{wikidata|Q104903124}} | |wikidata={{wikidata|Q104903124}} | ||
|mimetypes={{mimetype|application/x-wacz}} | |mimetypes={{mimetype|application/x-wacz}} | ||
| Line 25: | Line 25: | ||
==Software== | ==Software== | ||
* https://github.com/webrecorder/py-wacz | * https://github.com/webrecorder/py-wacz | ||
| + | * https://github.com/bodleian/wacksy | ||
==References== | ==References== | ||
Latest revision as of 14:47, 30 September 2025
A Web Archive Collection Zipped[1][2] is a file format designed to package a standard WARC with accompanying metadata into a single file.[3]
[edit] Format Information
A WACZ file is a ZIP compressed format which can include:
Archive: sample.wacz
Length Date Time Name
--------- ---------- ----- ----
77751 12-19-2023 10:27 pages/pages.jsonl
19477775 12-19-2023 10:27 archive/data.warc.gz
525986 12-19-2023 10:27 indexes/index.cdx
828 12-19-2023 10:27 datapackage.json
676 12-19-2023 10:27 datapackage-digest.json