WACZ
From Just Solve the File Format Problem
				
								
				(Difference between revisions)
				
																
				
				
								
				 (Created page with "{{FormatInfo |subcat=Archiving |extensions={{ext|wacz}} |pronom={{PRONOM|application/warc}} |wikidata={{wikidata|Q104903124}} |mimetypes={{mimetype|application/x-wacz}} |locfd...")  | 
			|||
| Line 22: | Line 22: | ||
       676  12-19-2023 10:27   datapackage-digest.json  |        676  12-19-2023 10:27   datapackage-digest.json  | ||
</pre>  | </pre>  | ||
| + | |||
| + | ==Software==  | ||
| + | * https://github.com/webrecorder/py-wacz  | ||
==References==  | ==References==  | ||
Revision as of 22:41, 19 December 2023
A Web Archive Collection Zipped[1][2] is a file format designed to package a standard WARC with accompanying metadata into a single file.[3]
Format Information
A WACZ file is a ZIP compressed format which can include:
Archive:  sample.wacz
  Length      Date    Time    Name
---------  ---------- -----   ----
    77751  12-19-2023 10:27   pages/pages.jsonl
 19477775  12-19-2023 10:27   archive/data.warc.gz
   525986  12-19-2023 10:27   indexes/index.cdx
      828  12-19-2023 10:27   datapackage.json
      676  12-19-2023 10:27   datapackage-digest.json