File identification software
From Just Solve the File Format Problem
				
								
				(Difference between revisions)
				
																
				
				
								
				 (updated DROID requirements)  | 
			Dan Tobias  (Talk | contribs)   | 
			||
| Line 17: | Line 17: | ||
* [[JHOVE]] (tool to classify/identify/validate file formats)  | * [[JHOVE]] (tool to classify/identify/validate file formats)  | ||
* [[MediaInfo]] (cross-platform, open source, [http://mediainfo.sourceforge.net/en website]): "MediaInfo is a convenient unified display of the most relevant technical and tag data for video and audio files."  | * [[MediaInfo]] (cross-platform, open source, [http://mediainfo.sourceforge.net/en website]): "MediaInfo is a convenient unified display of the most relevant technical and tag data for video and audio files."  | ||
| + | * [[PHP PRONOM drip]]: Recognize file formats using PRONOM registry (open source, [http://www.phpclasses.org/package/9095-PHP-Recognize-file-formats-using-PRONOM-registry.html website])  | ||
* [[Siegfried]] (signature-based file identification tool) [http://www.itforarchivists.com/siegfried website] [http://www.openplanetsfoundation.org/blogs/2014-09-27-siegfried-pronom-based-file-format-identification-tool blog post]  | * [[Siegfried]] (signature-based file identification tool) [http://www.itforarchivists.com/siegfried website] [http://www.openplanetsfoundation.org/blogs/2014-09-27-siegfried-pronom-based-file-format-identification-tool blog post]  | ||
* [[TrID]] (Windows/Linux, free for non-commercial use, [http://mark0.net/soft-trid-e.html website]): identifies files using a database of filetype signatures. Also has an [http://mark0.net/onlinetrid.aspx online version].  | * [[TrID]] (Windows/Linux, free for non-commercial use, [http://mark0.net/soft-trid-e.html website]): identifies files using a database of filetype signatures. Also has an [http://mark0.net/onlinetrid.aspx online version].  | ||
Revision as of 04:06, 11 April 2015
| Software | > | File identification software | 
Software that automates the process of Identifying Files.
- Apache Tika (cross-platform, open source, website): "The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries." Written in Java.
 - DROID (cross-platform, open source, website): "DROID is a software tool developed by The National Archives [of the United Kingdom] to perform automated batch identification of file formats." Requires Java 7 or 8 (Version 6.1.5).
 - FIDO (cross-platform, open source) website: Format Identification for Digital Objects, written in Python.
 - FIDOO (web-based online file identification): website
 - File command (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the Debian man page for an overview.
 - File Information Tool Set: software from the Harvard University library to identify file formats and extract metadata
 - FI Tools (Windows, commercial, website)
 - G-Spot (Windows, freeware, website): Identifies audio and video codecs need to play a media file.
 - JHOVE (tool to classify/identify/validate file formats)
 - MediaInfo (cross-platform, open source, website): "MediaInfo is a convenient unified display of the most relevant technical and tag data for video and audio files."
 - PHP PRONOM drip: Recognize file formats using PRONOM registry (open source, website)
 - Siegfried (signature-based file identification tool) website blog post
 - TrID (Windows/Linux, free for non-commercial use, website): identifies files using a database of filetype signatures. Also has an online version.