WARC
From Just Solve the File Format Problem
(Difference between revisions)
m |
(Added link to sample files) |
||
Line 11: | Line 11: | ||
{{FormatInfo | {{FormatInfo | ||
|subcat=Compression | |subcat=Compression | ||
− | |extensions={{ext|warc}} | + | |extensions={{ext|warc}}<br>{{ext|warc.gz}} |
}} | }} | ||
Successor to the [[ARC (Internet Archive)]] format. Standardized as ISO 28500:2009, Information and documentation -- WARC file format. Developed under the auspices of the International Internet Preservation Consortium. WARC was developed as an extension to ARC in part to provide better capabilities for managing Web archives for the long term, allowing for capture of more metadata about the circumstances of archiving. | Successor to the [[ARC (Internet Archive)]] format. Standardized as ISO 28500:2009, Information and documentation -- WARC file format. Developed under the auspices of the International Internet Preservation Consortium. WARC was developed as an extension to ARC in part to provide better capabilities for managing Web archives for the long term, allowing for capture of more metadata about the circumstances of archiving. | ||
− | + | == Sample files == | |
− | + | * [http://archive.org/details/testWARCfiles Test WARC Files] warc.gz file from Internet Archive. | |
== References == | == References == |
Revision as of 19:02, 12 November 2012
File Formats | > | Electronic File Formats | > | Compression | > | WARC |
Successor to the ARC (Internet Archive) format. Standardized as ISO 28500:2009, Information and documentation -- WARC file format. Developed under the auspices of the International Internet Preservation Consortium. WARC was developed as an extension to ARC in part to provide better capabilities for managing Web archives for the long term, allowing for capture of more metadata about the circumstances of archiving.
Sample files
- Test WARC Files warc.gz file from Internet Archive.