WARC

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Other links and references)
Line 3: Line 3:
 
|extensions={{ext|warc}}, {{ext|warc.gz}}
 
|extensions={{ext|warc}}, {{ext|warc.gz}}
 
|pronom={{PRONOM|fmt/289}}
 
|pronom={{PRONOM|fmt/289}}
 +
|mimetypes={{mimetype|application/warc}}, {{mimetype|application/warc-fields}}
 
}}
 
}}
  
Line 40: Line 41:
 
* [http://www.digitalstudies.org/ojs/index.php/digital_studies/article/view/325/412 The great WARC adventure: Using SIPS, AIPS, and DIPS to document SLAAPs]
 
* [http://www.digitalstudies.org/ojs/index.php/digital_studies/article/view/325/412 The great WARC adventure: Using SIPS, AIPS, and DIPS to document SLAAPs]
 
* [http://inkdroid.org/2016/04/14/warc-work/ WARC Work]
 
* [http://inkdroid.org/2016/04/14/warc-work/ WARC Work]
 +
* [https://kris-sigur.blogspot.com/2016/05/warc-mime-type.html?spref=tw WARC MIME Media Type] (as of now unregistered, but a suggested value exists)
  
 
[[Category:Internet Archive]]
 
[[Category:Internet Archive]]
 
[[Category:Web]]
 
[[Category:Web]]

Revision as of 13:42, 17 May 2016

File Format
Name WARC
Ontology
Extension(s) .warc, .warc.gz
MIME Type(s) application/warc, application/warc-fields
PRONOM fmt/289

WARC is the successor to the ARC (Internet Archive) format. Standardized as ISO 28500:2009, Information and documentation -- WARC file format. Developed under the auspices of the International Internet Preservation Consortium. WARC was developed as an extension to ARC in part to provide better capabilities for managing Web archives for the long term, allowing for capture of more metadata about the circumstances of archiving.

WARC files are often compressed using gzip, resulting in a .warc.gz extension.

There is also a specification for a Web Archive Metadata File. Another metadata format used with WARC files is CDX.

Contents

Specifications

Sample files

Tools

Other links and references

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox