Archiving

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Metadata formats)
(Program/App/Applet/Installer specialized archive formats)
(39 intermediate revisions by 5 users not shown)
Line 15: Line 15:
 
* [[ARC (Internet Archive)]] (.arc)
 
* [[ARC (Internet Archive)]] (.arc)
 
* [[BagIt]]
 
* [[BagIt]]
 +
* [[BAR (Age of Mythology)]]
 
* [[bar]] (SunOS 4)
 
* [[bar]] (SunOS 4)
 
* [[cpio]] (.cpio)
 
* [[cpio]] (.cpio)
 +
* [[Hamster archive]]
 
* [[LBR]] (.lbr)
 
* [[LBR]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
 
* [[LBR (Commodore)]] (.lbr)
Line 22: Line 24:
 
* [[Mozilla Archive]] (.mar)
 
* [[Mozilla Archive]] (.mar)
 
* [[pax]]
 
* [[pax]]
 +
* [[SeqBox]] (.sbx)
 
* [[Tape Archive]] (.tar)
 
* [[Tape Archive]] (.tar)
* [[VIV]] (.viv; used in Electronic Arts games)
+
* [[VIV]] (.viv, .big; used in Electronic Arts games)
 +
* [[Webarchive (Safari)]]
  
 
== Compression + archiving ==
 
== Compression + archiving ==
 
(Multiple files => 1 file, makes it smaller. See [[Compression]] for formats that compress single files/streams, and [[Disk Image Formats]] for formats that capture the low-level structure of a disk)
 
(Multiple files => 1 file, makes it smaller. See [[Compression]] for formats that compress single files/streams, and [[Disk Image Formats]] for formats that capture the low-level structure of a disk)
  
 +
* [[4Q]] (.4q)
 
* [[7z]] (.7z)
 
* [[7z]] (.7z)
 
* [[ACE]] (.ace)
 
* [[ACE]] (.ace)
Line 39: Line 44:
 
* [[ARJ]]
 
* [[ARJ]]
 
* [[Astrotite]] (.afa)
 
* [[Astrotite]] (.afa)
 +
* [[B6Z]] (.b6z)
 
* [[BlakHole]] (.bh)
 
* [[BlakHole]] (.bh)
 
* [[BRU]] - (RSX-11M backup)
 
* [[BRU]] - (RSX-11M backup)
 
* [[Cabinet]] (.cab)
 
* [[Cabinet]] (.cab)
 +
* [[CAR]] (a SAP archive format; also see [[SAR]])
 
* [[Compact File Set]] (.cfs)
 
* [[Compact File Set]] (.cfs)
 
* [[Compact Pro]] (.cpt)
 
* [[Compact Pro]] (.cpt)
Line 48: Line 55:
 
* [[DWC]] (.dwc)
 
* [[DWC]] (.dwc)
 
* [[EGG (ALZip)]] (.egg)
 
* [[EGG (ALZip)]] (.egg)
 +
* [[FTCOMP]]
 
* [[GCA]] (.gca)
 
* [[GCA]] (.gca)
 
* [[Greenleaf ArchiveLib]]
 
* [[Greenleaf ArchiveLib]]
 
* [[HA]] (.ha)
 
* [[HA]] (.ha)
 
* [[HKI]] (WinHKI) (.hki)
 
* [[HKI]] (WinHKI) (.hki)
* [[ICE]] (.ice)
+
* [[ICE (ICEOWS)]] (.ice)
 
* [[IMP]] (.imp)
 
* [[IMP]] (.imp)
 
* [[JAR (ARJ Software)]] (.j)
 
* [[JAR (ARJ Software)]] (.j)
Line 58: Line 66:
 
* [[LHA]] (.lzh, .lha)
 
* [[LHA]] (.lzh, .lha)
 
* [[LZX]] (.lzx)
 
* [[LZX]] (.lzx)
 +
* [[MAFF]] (Mozilla web archive; uses ZIP for compression)
 
* [[NuFX]] (.bxy)
 
* [[NuFX]] (.bxy)
 +
* [[packARC]] (.pja)
 +
* [[Packdir]] (RISC OS)
 
* [[PackIt]] (.pit)
 
* [[PackIt]] (.pit)
 
* [[PAK]] (.pak)  
 
* [[PAK]] (.pak)  
Line 69: Line 80:
 
* [[Quantum compressed archive]] (.q)
 
* [[Quantum compressed archive]] (.q)
 
* [[RAR]] (.rar)
 
* [[RAR]] (.rar)
 +
* [[RARC (Nintendo)]]
 
* [[RK]] (WinRK)
 
* [[RK]] (WinRK)
 
* [[S7z]] (.s7z)
 
* [[S7z]] (.s7z)
 +
* [[SAR]] (a SAP archive format; also see [[CAR]])
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[Scifer]] (.sen, .ba, .xml)
 
* [[ShrinkIt]] (.shk)
 
* [[ShrinkIt]] (.shk)
Line 81: Line 94:
 
* [[UltraCompressor II]] (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
 
* [[UltraCompressor II]] (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
 
* [[WARC]] (.warc, .warc.gz)
 
* [[WARC]] (.warc, .warc.gz)
* [[xar]] (.xar)
+
* [[Xar (eXtensible ARchive)|xar (eXtensible ARchive)]] (.xar)
 
* [[XP3]] (.xp3) - used in [[KiriKiri Adventure Game System]]
 
* [[XP3]] (.xp3) - used in [[KiriKiri Adventure Game System]]
 
* [[Yamazaki zipper archive]] (.yz1) - used in DeepFreezer
 
* [[Yamazaki zipper archive]] (.yz1) - used in DeepFreezer
Line 87: Line 100:
 
* [[Zipx]] (.zipx)
 
* [[Zipx]] (.zipx)
 
* [[ZOO]] (.zoo)
 
* [[ZOO]] (.zoo)
 +
* [[ZPAQ]]
 
* [[Zzip]] (.zz)
 
* [[Zzip]] (.zz)
  
 
== Diff files ==
 
== Diff files ==
 
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
 
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
 +
* [[binpatch (Jaap Korthals Altes)]]
 
* [[bsdiff]]
 
* [[bsdiff]]
 
* [[Courgette]]
 
* [[Courgette]]
Line 96: Line 111:
 
* [[FidoNet nodediff]]
 
* [[FidoNet nodediff]]
 
* [[GDIFF]]
 
* [[GDIFF]]
 +
* [[IPS (binary patch format)]]
 +
* [[JSON Patch]]
 
* [[RTPatch]]
 
* [[RTPatch]]
 
* [[Unified diff]]
 
* [[Unified diff]]
 +
* [[UPS (binary patch format)]]
 +
* [[VCDIFF]]
  
== Filesystem backup ==
+
See also [[Wikipedia:Data differencing]].
 +
 
 +
== Encrypted or cryptographically signed archives ==
 +
* [[XIP]]
 +
* [[ZED]]
 +
 
 +
== Filesystem backup and restoration ==
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
Formats designed for backing up disks, usually specifically targeted at particular [[filesystem]]s (but backing it up in the form of file structures, not raw sector images like [[Disk Image Formats]]).
 
* [[BACKUP (MS-DOS)]]
 
* [[BACKUP (MS-DOS)]]
 +
* [[BlockHashLoc]]
 
* [[Partimage]]  
 
* [[Partimage]]  
 
* [[ufsdump]]
 
* [[ufsdump]]
Line 107: Line 133:
  
 
== Institutional archiving ==
 
== Institutional archiving ==
 +
* [[ARK]] (identifier for archived documents)
 
* [[OAIS]] (Open Archival Information System, an organization that does archiving)
 
* [[OAIS]] (Open Archival Information System, an organization that does archiving)
 
* [[Planets Core Registry]]
 
* [[Planets Core Registry]]
 
* [[PRONOM]]
 
* [[PRONOM]]
 
** [[PUID]]
 
** [[PUID]]
 +
* [[SIRF]] (Self-Contained Information Retention Format)
 
* [[Submission Information Package]]
 
* [[Submission Information Package]]
 +
* [[UPF]] (Universal Preservation Format)
  
 
== Metadata formats ==
 
== Metadata formats ==
 
* [[Archive Team hostname file]]
 
* [[Archive Team hostname file]]
 +
* [[CDX]] (associated with web archives)
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats
 
* [[Internet Archive metadata]]
 
* [[Internet Archive metadata]]
 
* [[mtree]]
 
* [[mtree]]
 +
* [[Portland Common Data Model]]
 
* [[TOSEC Naming Convention]]
 
* [[TOSEC Naming Convention]]
 
* [[Web Archive Metadata File]] (metadata for [[WARC]])
 
* [[Web Archive Metadata File]] (metadata for [[WARC]])
Line 129: Line 160:
 
== Program/App/Applet/Installer specialized archive formats ==
 
== Program/App/Applet/Installer specialized archive formats ==
 
* [[APK]] (Android app)
 
* [[APK]] (Android app)
 +
* [[Bento]] (1990s Apple cross-platform compound content format)
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (Linux)]]
 
* [[Bundle file (OS X)]]
 
* [[Bundle file (OS X)]]
 +
* [[Cabal]]
 +
* [[Cross-Platform Installer Module]]
 
* [[deb]] (Debian package)
 
* [[deb]] (Debian package)
 
* [[GEOS Convert]] (.cvt)
 
* [[GEOS Convert]] (.cvt)
Line 137: Line 171:
 
* [[Inno Setup self-extracting archive]]
 
* [[Inno Setup self-extracting archive]]
 
* [[InstallShield CAB]]
 
* [[InstallShield CAB]]
 +
* [[InstallShield installer archive]]
 
* [[InstallShield Z]]
 
* [[InstallShield Z]]
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[IPA]] (.ipa) => iOS app in archived form; actually in zip format
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Jar]] (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 
* [[Meta-Package File]] (.mpkg) (Mac OS X)
 +
* [[Microsoft Update Standalone Package]]
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 
* [[NSIS]] (Nullsoft Scriptable Install System)
 +
* [[Olive]] (executable virtual-machine archive of software)
 
* [[Package File (OS X)]] (.pkg)
 
* [[Package File (OS X)]] (.pkg)
 
* [[RPM]] (RedHat Package Manager)
 
* [[RPM]] (RedHat Package Manager)
Line 147: Line 184:
 
* [[TI variable file]]
 
* [[TI variable file]]
 
* [[Windows Installer]] (.msi, .msp)
 
* [[Windows Installer]] (.msi, .msp)
 +
* [[XAR (Executable Archive)]]
  
 
== Self-extracting archives ==
 
== Self-extracting archives ==
Line 168: Line 206:
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.coredumps.de/doc/dump/zwicky/testdump.doc.html Torture-testing Backup and Archive Programs: Things You Ought to Know But Probably Would Rather Not]
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 
* [http://www.ietf.org/mail-archive/web/apps-discuss/current/msg13094.html Discussion of proposed top-level MIME type for 'archive']
 +
* [http://blogs.loc.gov/digitalpreservation/2014/02/what-do-you-mean-by-archive-genres-of-usage-for-digital-preservers/ What Do you Mean by Archive? Genres of Usage for Digital Preservers]

Revision as of 15:20, 22 September 2018

File Format
Name Archiving
Ontology

{{{caption}}}

(Lossless, for generic data + file archives)

See also: Compression, Error detection and correction

Contents

Archiving only

(many files => 1 file, no compression attempted)

Compression + archiving

(Multiple files => 1 file, makes it smaller. See Compression for formats that compress single files/streams, and Disk Image Formats for formats that capture the low-level structure of a disk)

Diff files

(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)

See also Wikipedia:Data differencing.

Encrypted or cryptographically signed archives

Filesystem backup and restoration

Formats designed for backing up disks, usually specifically targeted at particular filesystems (but backing it up in the form of file structures, not raw sector images like Disk Image Formats).

Institutional archiving

Metadata formats

(see also Metadata, Bibliographic data)

Miscellaneous

Program/App/Applet/Installer specialized archive formats

Self-extracting archives

(in addition, some of the other archivers are able to produce executable files for some platform which include the archived data and a program to extract them, and generally have the file extension normal for executables, such as .exe for DOS/Windows. In many cases, if you rename the extension to the appropriate one for the archive type involved, e.g., .zip, the file will open in the appropriate archiver, which is handy for extracting them on systems not compatible with the original executable.)

Transfer encodings

(convert binaries to printable ASCII for download; encode resource forks or metadata together with file, etc.)

See also the main article, Transfer Encodings.

Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox