Archiving
Dan Tobias (Talk | contribs) (→Metadata formats) |
Dan Tobias (Talk | contribs) (→Metadata formats) |
||
Line 230: | Line 230: | ||
* [[CDX]] (associated with web archives) | * [[CDX]] (associated with web archives) | ||
* [[DFDL]] (Data Format Description Language) - a file format for describing file formats | * [[DFDL]] (Data Format Description Language) - a file format for describing file formats | ||
− | * [[FilmConservationMetadata] | + | * [[FilmConservationMetadata]] |
* [[Internet Archive metadata]] | * [[Internet Archive metadata]] | ||
* [[mtree]] | * [[mtree]] |
Revision as of 20:36, 15 May 2021
(Lossless, for generic data + file archives)
See also: Compression, Error detection and correction
Archiving only
(many files => 1 file, no compression attempted)
- AR file format (.a, .lib) -- Unix Archiver (ar) format as used by various compilers/linkers
- ARC (Internet Archive) (.arc)
- BagIt
- BAR (Age of Mythology)
- bar (SunOS 4)
- CAR (CarComp)
- cpio (.cpio)
- FAR (The Sims) (.far)
- Flash (Micro TECH Systems) (.flh)
- Hamster archive
- LBR (.lbr)
- LBR (Commodore) (.lbr)
- Lynx archive (Commodore) (.lnx)
- Mozilla Archive (.mar)
- pax
- Quake PAK
- SAVF
- SeqBox (.sbx)
- siva (.siva)
- Tape Archive (.tar)
- TPAC (Tim Gordon)
- VIV (.viv, .big; used in Electronic Arts games)
- Webarchive (Safari)
- XMIT
Compression + archiving
(Multiple files => 1 file, makes it smaller. See Compression for formats that compress single files/streams, and Disk Image Formats for formats that capture the low-level structure of a disk)
- 4Q (.4q)
- 777
- 7z (.7z)
- ACB
- ACE (.ace)
- afio
- AIN (.ain)
- ALZ (.alz) (ALZip)
- AMG (AMGC)
- AR7
- ARC (PC/MS-DOS) (.arc)
- ARC (Commodore) (.arc)
- ARC (FreeArc) (.arc)
- ARC (Internet Archive) (gzipped form: .arc.gz)
- ArcFS (RISC OS) (.arc)
- ArcMac
- ARG
- ARHANGEL
- ARJ
- ARX
- ASD Archiver
- Astrotite (.afa)
- B6Z (.b6z)
- BIX
- BlakHole (.bh)
- BRU - (RSX-11M backup)
- BSArc and BSA
- Cabinet (.cab)
- CAR (a SAP archive format; also see SAR)
- CAR (MylesHi!)
- CFL (Compressed file library) (.cfl)
- ChiefLZ (LZA)
- CMZ
- CODEC (Telvox)
- Compact File Set (.cfs)
- Compact Pro (.cpt)
- CRUSH
- Disk Archiver (.dar)
- DGCA (.dgc)
- Diamond
- Doom WAD (.wad)
- DWC (.dwc)
- Dzip (.dz)
- Eazel (.eaz)
- EGG (ALZip) (.egg)
- Freeze!
- FTCOMP
- GCA (.gca)
- Genus Graphics Library
- Greenleaf ArchiveLib
- HA (.ha)
- HAP
- HKI (WinHKI) (.hki)
- HPACK (.hpk)
- HUF (CP/M)
- Hyper archive (.hyp)
- ICE (ICEOWS) (.ice)
- IMP (.imp)
- JAR (ARJ Software) (.j)
- JARCS (.jar)
- JRchive (.jrc)
- KGB Archiver (.kgb)
- Konqueror web archive (.war)
- LArc (.lzs)
- LHA (.lzh, .lha)
- LHARK
- LHice
- LIMIT (.lim)
- LZX (.lzx)
- MAFF (Mozilla web archive; uses ZIP for compression)
- MAr (Melting-Pot)
- MDCD (.md)
- Micrognosis Compression Archiver (MAR)
- Now Compress
- NuFX (.bxy)
- oPAQue
- packARC (.pja)
- Packdir (RISC OS)
- PackIt (.pit)
- PAK (.pak)
- PAK (ARC extension)
- PAQ (.pa6, .pa7, .pa8, others)
- PCX Library (.pcl)
- PEA (PeaZIP) (.pea)
- PerfectCompress (.uca)
- PIM (.pim)
- PLA (ARCHICAD Archive) (.pla)
- PMA (.pma)
- PPMd
- PUT (MicroFox) (.put)
- Quadruple D Archiver (.qda)
- Quantum compressed archive (.q)
- RAR (.rar)
- RARC (Nintendo)
- RAX
- RK (WinRK)
- RKIVE (.rkv)
- S7z (.s7z)
- SAR (a SAP archive format; also see CAR)
- SAR (Streamline Design)
- SARJ (.srj)
- Scifer (.sen, .ba, .xml)
- ShrinkIt (.shk)
- Slim!
- Softlib (Softdisk Publishing) (.slb)
- Spark (.spk)
- SQLite Archive (SQLAR)
- Squash (.arh)
- Squeeze It (.sqz)
- SQWEZ 2.x (.sqz)
- SQX (.sqx)
- StuffIt (.sit)
- StuffIt X (.sitx)
- TBAFS (RISC OS)
- TSComp (.tsc)
- UFA
- UHARC (.uha)
- UltraCompressor II (.uc, .uc0, .uc2, .ucn, .ur2, .ue2)
- WARC (.warc, .warc.gz)
- X (X1) (.x)
- xar (Extensible Archive) (.xar)
- XP3 (.xp3) - used in KiriKiri Adventure Game System
- XPACK (.xpa)
- YAC (.yc)
- Yamazaki zipper archive (.yz1) - used in DeepFreezer
- ZIP (.zip)
- Zipx (.zipx)
- Zoo (.zoo)
- ZPack Lite (.zpk) - See Z/Install Lite
- ZPAQ
- Zzip (.zz)
Diff files
(containing only the parts of a file that have changed, so they can be applied to an existing file to update it; used for update distribution and incremental backups)
- ACB
- binpatch (Jaap Korthals Altes)
- bsdiff
- Courgette
- diff
- FidoNet nodediff
- GDIFF
- IPS (binary patch format)
- JSON Patch
- LDIFF LZD
- RTPatch
- Unified diff
- UPS (binary patch format)
- VCDIFF
See also Wikipedia:Data differencing.
Encrypted or cryptographically signed archives
File splitting
(1 file => multiple files. Most file splitting utilities just produce raw file fragments, and don't have a native file format. But some use a container format, and some create a separate "control" file for metadata.)
Also, the two sorts of Horcrux, Horcrux (jesseduffield) and Horcrux (kndyry), are split-and-encrypted files.
Filesystem backup and restoration
Formats designed for backing up disks, usually specifically targeted at particular filesystems (but backing it up in the form of file structures, not raw sector images like Disk Image Formats).
Institutional archiving
- ARK (identifier for archived documents)
- OAIS (Open Archival Information System, an organization that does archiving)
- Planets Core Registry
- PRONOM
- SIRF (Self-Contained Information Retention Format)
- Submission Information Package
- UPF (Universal Preservation Format)
Metadata formats
- Archive Team hostname file
- CDX (associated with web archives)
- DFDL (Data Format Description Language) - a file format for describing file formats
- FilmConservationMetadata
- Internet Archive metadata
- mtree
- Portland Common Data Model
- TOSEC Naming Convention
- Web Archive Metadata File (metadata for WARC)
(see also Metadata, Bibliographic data)
Miscellaneous
Program/App/Applet/Installer specialized archive formats
- APK (Android app)
- Bundle file (Linux)
- Bundle file (OS X)
- Cabal
- Cakewalk Bundle (BUN)
- Cakewalk Bundle (CWB)
- COM-Pack library
- Cross-Platform Installer Module
- deb (Debian package)
- Image Packaging System (IPS) archive (.p5p)
- Image Packaging System (IPS) manifest (.p5m)
- Inno Setup self-extracting archive
- InstallShield CAB
- InstallShield installer archive
- InstallShield Z
- IPA (.ipa) => iOS app in archived form; actually in zip format
- Jar (.j, .jar) (Java) => this is just a renamed zip file with some mandatory files
- Meta-Package File (.mpkg) (Mac OS X)
- Microsoft Update Standalone Package
- NSIS (Nullsoft Scriptable Install System)
- Olive (executable virtual-machine archive of software)
- Package File (OS X) (.pkg)
- PackageForTheWeb
- RPM (RedHat Package Manager)
- SIS (Symbian)
- Snap
- SWG (SWAG)
- Unity package file
- WAD (Wii)
- Windows Installer (.msi, .msp)
- Wise installer package
- XAR (Executable Archive)
- Z/Install Lite
Self-extracting archives
(in addition, some of the other archivers are able to produce executable files for some platform which include the archived data and a program to extract them, and generally have the file extension normal for executables, such as .exe for DOS/Windows. In many cases, if you rename the extension to the appropriate one for the archive type involved, e.g., .zip, the file will open in the appropriate archiver, which is handy for extracting them on systems not compatible with the original executable.)
- COMPACK / BUILDSFX
- SDA (Self Dissolving Archive)
- SEA (StuffIt) (StuffIt Self-Extracting Archive)
- Self-extracting ZIP
- SFX (Self-Extracting Archive)
- shar (Shell Archive, in Unix-like systems)
- SQWEZ
Software
Refer to Compression and archiving software.
Transfer formats
(encapsulate a system-specific file structure to make it compatible with foreign systems, file transfer protocols, etc.)
- AppleDouble
- AppleSingle
- Bento (1990s Apple cross-platform compound content format)
- Binary II (.bny) (for Apple II series) (often squeezed as .bqy)
- BinHex (.hqx)
- GEOS Convert (.cvt)
- MacBinary
- oric-dsk-manager exported file
- TI variable file
For low-level transfer encodings, see Transfer Encodings.