Archiving

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Institutional archiving)
(Filesystem backup and restoration)
 
(34 intermediate revisions by 4 users not shown)
Line 18: Line 18:
 
* [[bar]] (SunOS 4)
 
* [[bar]] (SunOS 4)
 
* [[cpio]] (.cpio)
 
* [[cpio]] (.cpio)
 +
* [[Hamster archive]]
 
* [[LBR]] (.lbr)
 
* [[LBR]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
Line 23: Line 24:
 
* [[Mozilla Archive]] (.mar)
 
* [[Mozilla Archive]] (.mar)
 
* [[pax]]
 
* [[pax]]
 +
* [[Quake PAK]]
 +
* [[SeqBox]] (.sbx)
 
* [[Tape Archive]] (.tar)
 
* [[Tape Archive]] (.tar)
 
* [[VIV]] (.viv, .big; used in Electronic Arts games)
 
* [[VIV]] (.viv, .big; used in Electronic Arts games)
Line 42: Line 45:
 
* [[ARJ]]
 
* [[ARJ]]
 
* [[Astrotite]] (.afa)
 
* [[Astrotite]] (.afa)
 +
* [[B6Z]] (.b6z)
 
* [[BlakHole]] (.bh)
 
* [[BlakHole]] (.bh)
 
* [[BRU]] - (RSX-11M backup)
 
* [[BRU]] - (RSX-11M backup)
 
* [[Cabinet]] (.cab)
 
* [[Cabinet]] (.cab)
 +
* [[CAR]] (a SAP archive format; also see [[SAR]])
 +
* [[CMZ (archive format)|CMZ]]
 
* [[Compact File Set]] (.cfs)
 
* [[Compact File Set]] (.cfs)
 
* [[Compact Pro]] (.cpt)
 
* [[Compact Pro]] (.cpt)
Line 51: Line 57:
 
* [[DWC]] (.dwc)
 
* [[DWC]] (.dwc)
 
* [[EGG (ALZip)]] (.egg)
 
* [[EGG (ALZip)]] (.egg)
 +
* [[FTCOMP]]
 
* [[GCA]] (.gca)
 
* [[GCA]] (.gca)
 
* [[Greenleaf ArchiveLib]]
 
* [[Greenleaf ArchiveLib]]
 
* [[HA]] (.ha)
 
* [[HA]] (.ha)
 
* [[HKI]] (WinHKI) (.hki)
 
* [[HKI]] (WinHKI) (.hki)
* [[ICE]] (.ice)
+
* [[ICE (ICEOWS)]] (.ice)
 
* [[IMP]] (.imp)
 
* [[IMP]] (.imp)
 
* [[JAR (ARJ Software)]] (.j)
 
* [[JAR (ARJ Software)]] (.j)
Line 63: Line 70:
 
* [[MAFF]] (Mozilla web archive; uses ZIP for compression)
 
* [[MAFF]] (Mozilla web archive; uses ZIP for compression)
 
* [[NuFX]] (.bxy)
 
* [[NuFX]] (.bxy)
 +
* [[packARC]] (.pja)
 
* [[Packdir]] (RISC OS)
 
* [[Packdir]] (RISC OS)
 
* [[PackIt]] (.pit)
 
* [[PackIt]] (.pit)
Line 74: Line 82:
 
* [[Quantum compressed archive]] (.q)
 
* [[Quantum compressed archive]] (.q)
 
* [[RAR]] (.rar)
 
* [[RAR]] (.rar)
 +
* [[RARC (Nintendo)]]
 
* [[RK]] (WinRK)
 
* [[RK]] (WinRK)
 
* [[S7z]] (.s7z)
 
* [[S7z]] (.s7z)
 +
* [[SAR]] (a SAP archive format; also see [[CAR]])
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[ShrinkIt]] (.shk)
 
* [[ShrinkIt]] (.shk)
Line 86: Line 96:
 
* [[UltraCompressor II]] (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
 
* [[UltraCompressor II]] (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
 
* [[WARC]] (.warc, .warc.gz)
 
* [[WARC]] (.warc, .warc.gz)
* [[xar]] (.xar)
+
* [[xar (Extensible Archive)]] (.xar)
 
* [[XP3]] (.xp3) - used in [[KiriKiri Adventure Game System]]
 
* [[XP3]] (.xp3) - used in [[KiriKiri Adventure Game System]]
 
* [[Yamazaki zipper archive]] (.yz1) - used in DeepFreezer
 
* [[Yamazaki zipper archive]] (.yz1) - used in DeepFreezer
Line 92: Line 102:
 
* [[Zipx]] (.zipx)
 
* [[Zipx]] (.zipx)
 
* [[ZOO]] (.zoo)
 
* [[ZOO]] (.zoo)
 +
* [[ZPAQ]]
 
* [[Zzip]] (.zz)
 
* [[Zzip]] (.zz)
  
Line 102: Line 113:
 
* [[FidoNet nodediff]]
 
* [[FidoNet nodediff]]
 
* [[GDIFF]]
 
* [[GDIFF]]
 +
* [[IPS (binary patch format)]]
 +
* [[JSON Patch]]
 
* [[RTPatch]]
 
* [[RTPatch]]
 
* [[Unified diff]]
 
* [[Unified diff]]
 +
* [[UPS (binary patch format)]]
 +
* [[VCDIFF]]
  
== Filesystem backup ==
+
See also [[Wikipedia:Data differencing]].
 +
 
 +
== Encrypted or cryptographically signed archives ==
 +
* [[XIP]]
 +
* [[ZED]]
 +
 
 +
== Filesystem backup and restoration ==
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
* [[BACKUP (MS-DOS)]]
 
* [[BACKUP (MS-DOS)]]
 +
* [[BlockHashLoc]]
 
* [[Partimage]]  
 
* [[Partimage]]  
 
* [[ufsdump]]
 
* [[ufsdump]]
 +
* [[Unix dump]]
 
* [[Windows Imaging Format]] (.wim, .swm)
 
* [[Windows Imaging Format]] (.wim, .swm)
  
Line 120: Line 143:
 
* [[SIRF]] (Self-Contained Information Retention Format)
 
* [[SIRF]] (Self-Contained Information Retention Format)
 
* [[Submission Information Package]]
 
* [[Submission Information Package]]
 +
* [[UPF]] (Universal Preservation Format)
  
 
== Metadata formats ==
 
== Metadata formats ==
 
* [[Archive Team hostname file]]
 
* [[Archive Team hostname file]]
 +
* [[CDX]] (associated with web archives)
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[Internet Archive metadata]]
 
* [[Internet Archive metadata]]
 
* [[mtree]]
 
* [[mtree]]
 +
* [[Portland Common Data Model]]
 
* [[TOSEC Naming Convention]]
 
* [[TOSEC Naming Convention]]
 
* [[Web Archive Metadata File]] (metadata for [[WARC]])
 
* [[Web Archive Metadata File]] (metadata for [[WARC]])
Line 139: Line 165:
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (OS X)]]
 
* [[Bundle file (OS X)]]
 +
* [[Cabal]]
 +
* [[Cross-Platform Installer Module]]
 
* [[deb]] (Debian package)
 
* [[deb]] (Debian package)
* [[GEOS Convert]] (.cvt)
 
 
* [[IPS archive|Image Packaging System (IPS) archive]] (.p5p)
 
* [[IPS archive|Image Packaging System (IPS) archive]] (.p5p)
 
* [[IPS manifest|Image Packaging System (IPS) manifest]] (.p5m)
 
* [[IPS manifest|Image Packaging System (IPS) manifest]] (.p5m)
 
* [[Inno Setup self-extracting archive]]
 
* [[Inno Setup self-extracting archive]]
 
* [[InstallShield CAB]]
 
* [[InstallShield CAB]]
 +
* [[InstallShield installer archive]]
 
* [[InstallShield Z]]
 
* [[InstallShield Z]]
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 +
* [[Microsoft Update Standalone Package]]
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 
* [[Olive]] (executable virtual-machine archive of software)
 
* [[Olive]] (executable virtual-machine archive of software)
Line 154: Line 183:
 
* [[RPM]] (RedHat Package Manager)
 
* [[RPM]] (RedHat Package Manager)
 
* [[SIS]] (Symbian)
 
* [[SIS]] (Symbian)
* [[TI variable file]]
+
* [[Snap]] [https://docs.snapcraft.io/the-snap-format/698]
 
* [[Windows Installer]] (.msi, .msp)
 
* [[Windows Installer]] (.msi, .msp)
 +
* [[XAR (Executable Archive)]]
  
 
== Self-extracting archives ==
 
== Self-extracting archives ==
Line 165: Line 195:
 
* [[shar]] (Shell Archive, in Unix-like systems)
 
* [[shar]] (Shell Archive, in Unix-like systems)
  
== Transfer encodings ==
+
== Transfer formats ==
(convert binaries to printable ASCII for download; encode resource forks or metadata together with file, etc.)
+
(encapsulate a system-specific file structure to make it compatible with foreign systems, file transfer protocols, etc.)
* [[Ascii85]]
+
* [[AppleDouble]]
 +
* [[AppleSingle]]
 +
* [[Bento]] (1990s Apple cross-platform compound content format)
 
* [[Binary II]] (.bny) (for Apple II series) (often [[squeeze]]d as .bqy)
 
* [[Binary II]] (.bny) (for Apple II series) (often [[squeeze]]d as .bqy)
 
* [[BinHex]] (.hqx)
 
* [[BinHex]] (.hqx)
 +
* [[GEOS Convert]] (.cvt)
 
* [[MacBinary]]
 
* [[MacBinary]]
 +
* [[TI variable file]]
  
See also the main article, [[Transfer Encodings]].
+
For low-level transfer encodings, see [[Transfer Encodings]].
  
 
== Links ==
 
== Links ==
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 +
* [http://blogs.loc.gov/digitalpreservation/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/ What Do you Mean by Archive? Genres of Usage for Digital Preservers]

Latest revision as of 05:18, 26 January 2019

File Format
Name Archiving
Ontology

{{{caption}}}

(Lossless, for generic data + file archives)

See also: Compression, Error detection and correction

Contents

[edit] Archiving only

(many files => 1 file, no compression attempted)

[edit] Compression + archiving

(Multiple files => 1 file, makes it smaller. See Compression for formats that compress single files/streams, and Disk Image Formats for formats that capture the low-level structure of a disk)

[edit] Diff files

(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)

See also Wikipedia:Data differencing.

[edit] Encrypted or cryptographically signed archives

[edit] Filesystem backup and restoration

Formats designed for backing up disks, usually specifically targeted at particular filesystems (but backing it up in the form of file structures, not raw sector images like Disk Image Formats).

[edit] Institutional archiving

[edit] Metadata formats

(see also Metadata, Bibliographic data)

[edit] Miscellaneous

[edit] Program/App/Applet/Installer specialized archive formats

[edit] Self-extracting archives

(in addition, some of the other archivers are able to produce executable files for some platform which include the archived data and a program to extract them, and generally have the file extension normal for executables, such as .exe for DOS/Windows. In many cases, if you rename the extension to the appropriate one for the archive type involved, e.g., .zip, the file will open in the appropriate archiver, which is handy for extracting them on systems not compatible with the original executable.)

[edit] Transfer formats

(encapsulate a system-specific file structure to make it compatible with foreign systems, file transfer protocols, etc.)

For low-level transfer encodings, see Transfer Encodings.

[edit] Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox