WACZ
From Just Solve the File Format Problem
(Difference between revisions)
(PRONOM) |
|||
Line 2: | Line 2: | ||
|subcat=Archiving | |subcat=Archiving | ||
|extensions={{ext|wacz}} | |extensions={{ext|wacz}} | ||
− | |pronom={{PRONOM| | + | |pronom={{PRONOM|fmt/1840}} |
|wikidata={{wikidata|Q104903124}} | |wikidata={{wikidata|Q104903124}} | ||
|mimetypes={{mimetype|application/x-wacz}} | |mimetypes={{mimetype|application/x-wacz}} | ||
Line 25: | Line 25: | ||
==Software== | ==Software== | ||
* https://github.com/webrecorder/py-wacz | * https://github.com/webrecorder/py-wacz | ||
+ | * https://github.com/bodleian/wacksy | ||
==References== | ==References== |
Latest revision as of 14:47, 30 September 2025
A Web Archive Collection Zipped[1][2] is a file format designed to package a standard WARC with accompanying metadata into a single file.[3]
[edit] Format Information
A WACZ file is a ZIP compressed format which can include:
Archive: sample.wacz Length Date Time Name --------- ---------- ----- ---- 77751 12-19-2023 10:27 pages/pages.jsonl 19477775 12-19-2023 10:27 archive/data.warc.gz 525986 12-19-2023 10:27 indexes/index.cdx 828 12-19-2023 10:27 datapackage.json 676 12-19-2023 10:27 datapackage-digest.json