Archiving

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Filesystem backup and restoration)
 
(46 intermediate revisions by 5 users not shown)
Line 15: Line 15:
 
* [[ARC (Internet Archive)]] (.arc)
 
* [[ARC (Internet Archive)]] (.arc)
 
* [[BagIt]]
 
* [[BagIt]]
 +
* [[BAR (Age of Mythology)]]
 
* [[bar]] (SunOS 4)
 
* [[bar]] (SunOS 4)
 
* [[cpio]] (.cpio)
 
* [[cpio]] (.cpio)
 +
* [[Hamster archive]]
 
* [[LBR]] (.lbr)
 
* [[LBR]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
Line 22: Line 24:
 
* [[Mozilla Archive]] (.mar)
 
* [[Mozilla Archive]] (.mar)
 
* [[pax]]
 
* [[pax]]
 +
* [[Quake PAK]]
 +
* [[SeqBox]] (.sbx)
 
* [[Tape Archive]] (.tar)
 
* [[Tape Archive]] (.tar)
* [[VIV]] (.viv; used in Electronic Arts games)
+
* [[VIV]] (.viv, .big; used in Electronic Arts games)
 +
* [[Webarchive (Safari)]]
  
 
== Compression + archiving ==
 
== Compression + archiving ==
 
(Multiple files => 1 file, makes it smaller. See [[Compression]] for formats that compress single files/streams, and [[Disk Image Formats]] for formats that capture the low-level structure of a disk)
 
(Multiple files => 1 file, makes it smaller. See [[Compression]] for formats that compress single files/streams, and [[Disk Image Formats]] for formats that capture the low-level structure of a disk)
  
 +
* [[4Q]] (.4q)
 
* [[7z]] (.7z)
 
* [[7z]] (.7z)
 
* [[ACE]] (.ace)
 
* [[ACE]] (.ace)
Line 39: Line 45:
 
* [[ARJ]]
 
* [[ARJ]]
 
* [[Astrotite]] (.afa)
 
* [[Astrotite]] (.afa)
 +
* [[B6Z]] (.b6z)
 
* [[BlakHole]] (.bh)
 
* [[BlakHole]] (.bh)
 
* [[BRU]] - (RSX-11M backup)
 
* [[BRU]] - (RSX-11M backup)
 
* [[Cabinet]] (.cab)
 
* [[Cabinet]] (.cab)
 +
* [[CAR]] (a SAP archive format; also see [[SAR]])
 +
* [[CMZ (archive format)|CMZ]]
 
* [[Compact File Set]] (.cfs)
 
* [[Compact File Set]] (.cfs)
 
* [[Compact Pro]] (.cpt)
 
* [[Compact Pro]] (.cpt)
Line 48: Line 57:
 
* [[DWC]] (.dwc)
 
* [[DWC]] (.dwc)
 
* [[EGG (ALZip)]] (.egg)
 
* [[EGG (ALZip)]] (.egg)
 +
* [[FTCOMP]]
 
* [[GCA]] (.gca)
 
* [[GCA]] (.gca)
 
* [[Greenleaf ArchiveLib]]
 
* [[Greenleaf ArchiveLib]]
 
* [[HA]] (.ha)
 
* [[HA]] (.ha)
 
* [[HKI]] (WinHKI) (.hki)
 
* [[HKI]] (WinHKI) (.hki)
* [[ICE]] (.ice)
+
* [[ICE (ICEOWS)]] (.ice)
 
* [[IMP]] (.imp)
 
* [[IMP]] (.imp)
 
* [[JAR (ARJ Software)]] (.j)
 
* [[JAR (ARJ Software)]] (.j)
Line 58: Line 68:
 
* [[LHA]] (.lzh, .lha)
 
* [[LHA]] (.lzh, .lha)
 
* [[LZX]] (.lzx)
 
* [[LZX]] (.lzx)
 +
* [[MAFF]] (Mozilla web archive; uses ZIP for compression)
 
* [[NuFX]] (.bxy)
 
* [[NuFX]] (.bxy)
 +
* [[packARC]] (.pja)
 +
* [[Packdir]] (RISC OS)
 
* [[PackIt]] (.pit)
 
* [[PackIt]] (.pit)
 
* [[PAK]] (.pak)  
 
* [[PAK]] (.pak)  
Line 69: Line 82:
 
* [[Quantum compressed archive]] (.q)
 
* [[Quantum compressed archive]] (.q)
 
* [[RAR]] (.rar)
 
* [[RAR]] (.rar)
 +
* [[RARC (Nintendo)]]
 
* [[RK]] (WinRK)
 
* [[RK]] (WinRK)
 
* [[S7z]] (.s7z)
 
* [[S7z]] (.s7z)
 +
* [[SAR]] (a SAP archive format; also see [[CAR]])
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[ShrinkIt]] (.shk)
 
* [[ShrinkIt]] (.shk)
Line 81: Line 96:
 
* [[UltraCompressor II]] (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
 
* [[UltraCompressor II]] (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
 
* [[WARC]] (.warc, .warc.gz)
 
* [[WARC]] (.warc, .warc.gz)
* [[xar]] (.xar)
+
* [[xar (Extensible Archive)]] (.xar)
 
* [[XP3]] (.xp3) - used in [[KiriKiri Adventure Game System]]
 
* [[XP3]] (.xp3) - used in [[KiriKiri Adventure Game System]]
 
* [[Yamazaki zipper archive]] (.yz1) - used in DeepFreezer
 
* [[Yamazaki zipper archive]] (.yz1) - used in DeepFreezer
Line 87: Line 102:
 
* [[Zipx]] (.zipx)
 
* [[Zipx]] (.zipx)
 
* [[ZOO]] (.zoo)
 
* [[ZOO]] (.zoo)
 +
* [[ZPAQ]]
 
* [[Zzip]] (.zz)
 
* [[Zzip]] (.zz)
  
 
== Diff files ==
 
== Diff files ==
 
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
 
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
 +
* [[binpatch (Jaap Korthals Altes)]]
 
* [[bsdiff]]
 
* [[bsdiff]]
 
* [[Courgette]]
 
* [[Courgette]]
Line 96: Line 113:
 
* [[FidoNet nodediff]]
 
* [[FidoNet nodediff]]
 
* [[GDIFF]]
 
* [[GDIFF]]
 +
* [[IPS (binary patch format)]]
 +
* [[JSON Patch]]
 
* [[RTPatch]]
 
* [[RTPatch]]
 
* [[Unified diff]]
 
* [[Unified diff]]
 +
* [[UPS (binary patch format)]]
 +
* [[VCDIFF]]
  
== Filesystem backup ==
+
See also [[Wikipedia:Data differencing]].
 +
 
 +
== Encrypted or cryptographically signed archives ==
 +
* [[XIP]]
 +
* [[ZED]]
 +
 
 +
== Filesystem backup and restoration ==
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
* [[BACKUP (MS-DOS)]]
 
* [[BACKUP (MS-DOS)]]
 +
* [[BlockHashLoc]]
 
* [[Partimage]]  
 
* [[Partimage]]  
 
* [[ufsdump]]
 
* [[ufsdump]]
 +
* [[Unix dump]]
 
* [[Windows Imaging Format]] (.wim, .swm)
 
* [[Windows Imaging Format]] (.wim, .swm)
  
 
== Institutional archiving ==
 
== Institutional archiving ==
 +
* [[ARK]] (identifier for archived documents)
 
* [[OAIS]] (Open Archival Information System, an organization that does archiving)
 
* [[OAIS]] (Open Archival Information System, an organization that does archiving)
 
* [[Planets Core Registry]]
 
* [[Planets Core Registry]]
 
* [[PRONOM]]
 
* [[PRONOM]]
 
** [[PUID]]
 
** [[PUID]]
 +
* [[SIRF]] (Self-Contained Information Retention Format)
 
* [[Submission Information Package]]
 
* [[Submission Information Package]]
 +
* [[UPF]] (Universal Preservation Format)
  
 
== Metadata formats ==
 
== Metadata formats ==
 
* [[Archive Team hostname file]]
 
* [[Archive Team hostname file]]
 +
* [[CDX]] (associated with web archives)
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[Internet Archive metadata]]
 
* [[Internet Archive metadata]]
 
* [[mtree]]
 
* [[mtree]]
 +
* [[Portland Common Data Model]]
 
* [[TOSEC Naming Convention]]
 
* [[TOSEC Naming Convention]]
 +
* [[Web Archive Metadata File]] (metadata for [[WARC]])
  
 
(see also [[Metadata]], [[Bibliographic data]])
 
(see also [[Metadata]], [[Bibliographic data]])
Line 130: Line 165:
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (OS X)]]
 
* [[Bundle file (OS X)]]
 +
* [[Cabal]]
 +
* [[Cross-Platform Installer Module]]
 
* [[deb]] (Debian package)
 
* [[deb]] (Debian package)
* [[GEOS Convert]] (.cvt)
 
 
* [[IPS archive|Image Packaging System (IPS) archive]] (.p5p)
 
* [[IPS archive|Image Packaging System (IPS) archive]] (.p5p)
 
* [[IPS manifest|Image Packaging System (IPS) manifest]] (.p5m)
 
* [[IPS manifest|Image Packaging System (IPS) manifest]] (.p5m)
 
* [[Inno Setup self-extracting archive]]
 
* [[Inno Setup self-extracting archive]]
 
* [[InstallShield CAB]]
 
* [[InstallShield CAB]]
 +
* [[InstallShield installer archive]]
 
* [[InstallShield Z]]
 
* [[InstallShield Z]]
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 +
* [[Microsoft Update Standalone Package]]
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 +
* [[Olive]] (executable virtual-machine archive of software)
 
* [[Package File (OS X)]] (.pkg)
 
* [[Package File (OS X)]] (.pkg)
 
* [[RPM]] (RedHat Package Manager)
 
* [[RPM]] (RedHat Package Manager)
 
* [[SIS]] (Symbian)
 
* [[SIS]] (Symbian)
* [[TI variable file]]
+
* [[Snap]] [https://docs.snapcraft.io/the-snap-format/698]
 
* [[Windows Installer]] (.msi, .msp)
 
* [[Windows Installer]] (.msi, .msp)
 +
* [[XAR (Executable Archive)]]
  
 
== Self-extracting archives ==
 
== Self-extracting archives ==
Line 155: Line 195:
 
* [[shar]] (Shell Archive, in Unix-like systems)
 
* [[shar]] (Shell Archive, in Unix-like systems)
  
== Transfer encodings ==
+
== Transfer formats ==
(convert binaries to printable ASCII for download; encode resource forks or metadata together with file, etc.)
+
(encapsulate a system-specific file structure to make it compatible with foreign systems, file transfer protocols, etc.)
* [[Ascii85]]
+
* [[AppleDouble]]
 +
* [[AppleSingle]]
 +
* [[Bento]] (1990s Apple cross-platform compound content format)
 
* [[Binary II]] (.bny) (for Apple II series) (often [[squeeze]]d as .bqy)
 
* [[Binary II]] (.bny) (for Apple II series) (often [[squeeze]]d as .bqy)
 
* [[BinHex]] (.hqx)
 
* [[BinHex]] (.hqx)
 +
* [[GEOS Convert]] (.cvt)
 
* [[MacBinary]]
 
* [[MacBinary]]
 +
* [[TI variable file]]
  
See also the main article, [[Transfer Encodings]].
+
For low-level transfer encodings, see [[Transfer Encodings]].
  
 
== Links ==
 
== Links ==
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 +
* [http://blogs.loc.gov/digitalpreservation/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/ What Do you Mean by Archive? Genres of Usage for Digital Preservers]

Latest revision as of 05:18, 26 January 2019

File Format
Name Archiving
Ontology

{{{caption}}}

(Lossless, for generic data + file archives)

See also: Compression, Error detection and correction

Contents

[edit] Archiving only

(many files => 1 file, no compression attempted)

[edit] Compression + archiving

(Multiple files => 1 file, makes it smaller. See Compression for formats that compress single files/streams, and Disk Image Formats for formats that capture the low-level structure of a disk)

[edit] Diff files

(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)

See also Wikipedia:Data differencing.

[edit] Encrypted or cryptographically signed archives

[edit] Filesystem backup and restoration

Formats designed for backing up disks, usually specifically targeted at particular filesystems (but backing it up in the form of file structures, not raw sector images like Disk Image Formats).

[edit] Institutional archiving

[edit] Metadata formats

(see also Metadata, Bibliographic data)

[edit] Miscellaneous

[edit] Program/App/Applet/Installer specialized archive formats

[edit] Self-extracting archives

(in addition, some of the other archivers are able to produce executable files for some platform which include the archived data and a program to extract them, and generally have the file extension normal for executables, such as .exe for DOS/Windows. In many cases, if you rename the extension to the appropriate one for the archive type involved, e.g., .zip, the file will open in the appropriate archiver, which is handy for extracting them on systems not compatible with the original executable.)

[edit] Transfer formats

(encapsulate a system-specific file structure to make it compatible with foreign systems, file transfer protocols, etc.)

For low-level transfer encodings, see Transfer Encodings.

[edit] Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox