Archiving

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Filesystem backup: add ufsdump)
(Compression + archiving)
(40 intermediate revisions by 5 users not shown)
Line 15: Line 15:
 
* [[ARC (Internet Archive)]] (.arc)
 
* [[ARC (Internet Archive)]] (.arc)
 
* [[BagIt]]
 
* [[BagIt]]
 +
* [[BAR (Age of Mythology)]]
 
* [[bar]] (SunOS 4)
 
* [[bar]] (SunOS 4)
 
* [[cpio]] (.cpio)
 
* [[cpio]] (.cpio)
 +
* [[Hamster archive]]
 
* [[LBR]] (.lbr)
 
* [[LBR]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
Line 22: Line 24:
 
* [[Mozilla Archive]] (.mar)
 
* [[Mozilla Archive]] (.mar)
 
* [[pax]]
 
* [[pax]]
 +
* [[SeqBox]] (.sbx)
 
* [[Tape Archive]] (.tar)
 
* [[Tape Archive]] (.tar)
 +
* [[VIV]] (.viv, .big; used in Electronic Arts games)
 +
* [[Webarchive (Safari)]]
  
 
== Compression + archiving ==
 
== Compression + archiving ==
 
(Multiple files => 1 file, makes it smaller. See [[Compression]] for formats that compress single files/streams, and [[Disk Image Formats]] for formats that capture the low-level structure of a disk)
 
(Multiple files => 1 file, makes it smaller. See [[Compression]] for formats that compress single files/streams, and [[Disk Image Formats]] for formats that capture the low-level structure of a disk)
  
 +
* [[4Q]] (.4q)
 
* [[7z]] (.7z)
 
* [[7z]] (.7z)
 
* [[ACE]] (.ace)
 
* [[ACE]] (.ace)
Line 38: Line 44:
 
* [[ARJ]]
 
* [[ARJ]]
 
* [[Astrotite]] (.afa)
 
* [[Astrotite]] (.afa)
 +
* [[B6Z]] (.b6z)
 
* [[BlakHole]] (.bh)
 
* [[BlakHole]] (.bh)
 
* [[BRU]] - (RSX-11M backup)
 
* [[BRU]] - (RSX-11M backup)
 
* [[Cabinet]] (.cab)
 
* [[Cabinet]] (.cab)
 +
* [[CAR]] (a SAP archive format; also see [[SAR]])
 
* [[Compact File Set]] (.cfs)
 
* [[Compact File Set]] (.cfs)
 
* [[Compact Pro]] (.cpt)
 
* [[Compact Pro]] (.cpt)
Line 47: Line 55:
 
* [[DWC]] (.dwc)
 
* [[DWC]] (.dwc)
 
* [[EGG (ALZip)]] (.egg)
 
* [[EGG (ALZip)]] (.egg)
 +
* [[FTCOMP]]
 
* [[GCA]] (.gca)
 
* [[GCA]] (.gca)
 
* [[Greenleaf ArchiveLib]]
 
* [[Greenleaf ArchiveLib]]
 
* [[HA]] (.ha)
 
* [[HA]] (.ha)
 
* [[HKI]] (WinHKI) (.hki)
 
* [[HKI]] (WinHKI) (.hki)
* [[ICE]] (.ice)
+
* [[ICE (ICEOWS)]] (.ice)
 
* [[IMP]] (.imp)
 
* [[IMP]] (.imp)
 
* [[JAR (ARJ Software)]] (.j)
 
* [[JAR (ARJ Software)]] (.j)
Line 57: Line 66:
 
* [[LHA]] (.lzh, .lha)
 
* [[LHA]] (.lzh, .lha)
 
* [[LZX]] (.lzx)
 
* [[LZX]] (.lzx)
 +
* [[MAFF]] (Mozilla web archive; uses ZIP for compression)
 
* [[NuFX]] (.bxy)
 
* [[NuFX]] (.bxy)
 +
* [[packARC]] (.pja)
 +
* [[Packdir]] (RISC OS)
 
* [[PackIt]] (.pit)
 
* [[PackIt]] (.pit)
 
* [[PAK]] (.pak)  
 
* [[PAK]] (.pak)  
Line 68: Line 80:
 
* [[Quantum compressed archive]] (.q)
 
* [[Quantum compressed archive]] (.q)
 
* [[RAR]] (.rar)
 
* [[RAR]] (.rar)
 +
* [[RARC (Nintendo)]]
 
* [[RK]] (WinRK)
 
* [[RK]] (WinRK)
 
* [[S7z]] (.s7z)
 
* [[S7z]] (.s7z)
 +
* [[SAR]] (a SAP archive format; also see [[CAR]])
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[ShrinkIt]] (.shk)
 
* [[ShrinkIt]] (.shk)
Line 86: Line 100:
 
* [[Zipx]] (.zipx)
 
* [[Zipx]] (.zipx)
 
* [[ZOO]] (.zoo)
 
* [[ZOO]] (.zoo)
 +
* [[ZPAQ]]
 
* [[Zzip]] (.zz)
 
* [[Zzip]] (.zz)
  
 
== Diff files ==
 
== Diff files ==
 
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
 
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
 +
* [[binpatch (Jaap Korthals Altes)]]
 
* [[bsdiff]]
 
* [[bsdiff]]
 
* [[Courgette]]
 
* [[Courgette]]
Line 95: Line 111:
 
* [[FidoNet nodediff]]
 
* [[FidoNet nodediff]]
 
* [[GDIFF]]
 
* [[GDIFF]]
 +
* [[IPS (binary patch format)]]
 +
* [[JSON Patch]]
 
* [[RTPatch]]
 
* [[RTPatch]]
 
* [[Unified diff]]
 
* [[Unified diff]]
 +
* [[UPS (binary patch format)]]
 +
* [[VCDIFF]]
  
== Filesystem backup ==
+
See also [[Wikipedia:Data differencing]].
 +
 
 +
== Encrypted or cryptographically signed archives ==
 +
* [[XIP]]
 +
* [[ZED]]
 +
 
 +
== Filesystem backup and restoration ==
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
* [[BACKUP (MS-DOS)]]
 
* [[BACKUP (MS-DOS)]]
 +
* [[BlockHashLoc]]
 
* [[Partimage]]  
 
* [[Partimage]]  
 
* [[ufsdump]]
 
* [[ufsdump]]
Line 106: Line 133:
  
 
== Institutional archiving ==
 
== Institutional archiving ==
 +
* [[ARK]] (identifier for archived documents)
 
* [[OAIS]] (Open Archival Information System, an organization that does archiving)
 
* [[OAIS]] (Open Archival Information System, an organization that does archiving)
 
* [[Planets Core Registry]]
 
* [[Planets Core Registry]]
 
* [[PRONOM]]
 
* [[PRONOM]]
 
** [[PUID]]
 
** [[PUID]]
 +
* [[SIRF]] (Self-Contained Information Retention Format)
 
* [[Submission Information Package]]
 
* [[Submission Information Package]]
 +
* [[UPF]] (Universal Preservation Format)
  
 
== Metadata formats ==
 
== Metadata formats ==
 
* [[Archive Team hostname file]]
 
* [[Archive Team hostname file]]
 +
* [[CDX]] (associated with web archives)
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[Internet Archive metadata]]
 
* [[Internet Archive metadata]]
 
* [[mtree]]
 
* [[mtree]]
 +
* [[Portland Common Data Model]]
 
* [[TOSEC Naming Convention]]
 
* [[TOSEC Naming Convention]]
 +
* [[Web Archive Metadata File]] (metadata for [[WARC]])
  
 
(see also [[Metadata]], [[Bibliographic data]])
 
(see also [[Metadata]], [[Bibliographic data]])
Line 127: Line 160:
 
== Program/App/Applet/Installer specialized archive formats ==
 
== Program/App/Applet/Installer specialized archive formats ==
 
* [[APK]] (Android app)
 
* [[APK]] (Android app)
 +
* [[Bento]] (1990s Apple cross-platform compound content format)
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (OS X)]]
 
* [[Bundle file (OS X)]]
 +
* [[Cabal]]
 +
* [[Cross-Platform Installer Module]]
 
* [[deb]] (Debian package)
 
* [[deb]] (Debian package)
 
* [[GEOS Convert]] (.cvt)
 
* [[GEOS Convert]] (.cvt)
Line 135: Line 171:
 
* [[Inno Setup self-extracting archive]]
 
* [[Inno Setup self-extracting archive]]
 
* [[InstallShield CAB]]
 
* [[InstallShield CAB]]
 +
* [[InstallShield installer archive]]
 
* [[InstallShield Z]]
 
* [[InstallShield Z]]
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 +
* [[Microsoft Update Standalone Package]]
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 +
* [[Olive]] (executable virtual-machine archive of software)
 
* [[Package File (OS X)]] (.pkg)
 
* [[Package File (OS X)]] (.pkg)
 
* [[RPM]] (RedHat Package Manager)
 
* [[RPM]] (RedHat Package Manager)
Line 166: Line 205:
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 +
* [http://blogs.loc.gov/digitalpreservation/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/ What Do you Mean by Archive? Genres of Usage for Digital Preservers]

Revision as of 17:02, 16 May 2018

File Format
Name Archiving
Ontology

{{{caption}}}

(Lossless, for generic data + file archives)

See also: Compression, Error detection and correction

Contents

Archiving only

(many files => 1 file, no compression attempted)

Compression + archiving

(Multiple files => 1 file, makes it smaller. See Compression for formats that compress single files/streams, and Disk Image Formats for formats that capture the low-level structure of a disk)

Diff files

(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)

See also Wikipedia:Data differencing.

Encrypted or cryptographically signed archives

Filesystem backup and restoration

Formats designed for backing up disks, usually specifically targeted at particular filesystems (but backing it up in the form of file structures, not raw sector images like Disk Image Formats).

Institutional archiving

Metadata formats

(see also Metadata, Bibliographic data)

Miscellaneous

Program/App/Applet/Installer specialized archive formats

Self-extracting archives

(in addition, some of the other archivers are able to produce executable files for some platform which include the archived data and a program to extract them, and generally have the file extension normal for executables, such as .exe for DOS/Windows. In many cases, if you rename the extension to the appropriate one for the archive type involved, e.g., .zip, the file will open in the appropriate archiver, which is handy for extracting them on systems not compatible with the original executable.)

Transfer encodings

(convert binaries to printable ASCII for download; encode resource forks or metadata together with file, etc.)

See also the main article, Transfer Encodings.

Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox