WACZ
From Just Solve the File Format Problem
(Difference between revisions)
(Created page with "{{FormatInfo |subcat=Archiving |extensions={{ext|wacz}} |pronom={{PRONOM|application/warc}} |wikidata={{wikidata|Q104903124}} |mimetypes={{mimetype|application/x-wacz}} |locfd...") |
|||
Line 22: | Line 22: | ||
676 12-19-2023 10:27 datapackage-digest.json | 676 12-19-2023 10:27 datapackage-digest.json | ||
</pre> | </pre> | ||
+ | |||
+ | ==Software== | ||
+ | * https://github.com/webrecorder/py-wacz | ||
==References== | ==References== |
Latest revision as of 22:41, 19 December 2023
A Web Archive Collection Zipped[1][2] is a file format designed to package a standard WARC with accompanying metadata into a single file.[3]
[edit] Format Information
A WACZ file is a ZIP compressed format which can include:
Archive: sample.wacz Length Date Time Name --------- ---------- ----- ---- 77751 12-19-2023 10:27 pages/pages.jsonl 19477775 12-19-2023 10:27 archive/data.warc.gz 525986 12-19-2023 10:27 indexes/index.cdx 828 12-19-2023 10:27 datapackage.json 676 12-19-2023 10:27 datapackage-digest.json