File identification software

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
m
Line 9: Line 9:
 
* [[Apache Tika]] (cross-platform, open source, [http://tika.apache.org/ website]): "The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries." Written in [[Java]].
 
* [[Apache Tika]] (cross-platform, open source, [http://tika.apache.org/ website]): "The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries." Written in [[Java]].
 
* [[DROID]] (cross-platform, open source, [http://digital-preservation.github.com/droid/ website]): ''"DROID is a software tool developed by The National Archives [of the United Kingdom] to perform automated batch identification of file formats."'' Requires [[Java]] 6, will not run on Java 7 as of 28 Oct 2012.
 
* [[DROID]] (cross-platform, open source, [http://digital-preservation.github.com/droid/ website]): ''"DROID is a software tool developed by The National Archives [of the United Kingdom] to perform automated batch identification of file formats."'' Requires [[Java]] 6, will not run on Java 7 as of 28 Oct 2012.
* [[FIDO]] (cross-platform, open source, [http://www.openplanetsfoundation.org/software/fido website]: Format Identification for Digital Objects, written in [[Python]].
+
* [[FIDO]] (cross-platform, open source) [http://www.openplanetsfoundation.org/software/fido website]: Format Identification for Digital Objects, written in [[Python]].
 +
* [[FIDOO]] (web-based online file identification): [http://www.techmaurice.com/fidoo/ website]
 
* [[File command]] (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the [http://manpages.debian.net/cgi-bin/man.cgi?query=file&apropos=0&sektion=0&manpath=Debian+6.0+squeeze&format=html&locale=en Debian man page] for an overview.
 
* [[File command]] (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the [http://manpages.debian.net/cgi-bin/man.cgi?query=file&apropos=0&sektion=0&manpath=Debian+6.0+squeeze&format=html&locale=en Debian man page] for an overview.
 
* [[File Information Tool Set]]: software from the Harvard University library to identify file formats and extract metadata
 
* [[File Information Tool Set]]: software from the Harvard University library to identify file formats and extract metadata

Revision as of 23:49, 3 November 2014

Software > File identification software

Software that automates the process of Identifying Files.

  • Apache Tika (cross-platform, open source, website): "The Apache Tika™ toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries." Written in Java.
  • DROID (cross-platform, open source, website): "DROID is a software tool developed by The National Archives [of the United Kingdom] to perform automated batch identification of file formats." Requires Java 6, will not run on Java 7 as of 28 Oct 2012.
  • FIDO (cross-platform, open source) website: Format Identification for Digital Objects, written in Python.
  • FIDOO (web-based online file identification): website
  • File command (various implementations): a standard Unix command, found on almost all Unix and Unix-like (i.e., Linux) systems. See the Debian man page for an overview.
  • File Information Tool Set: software from the Harvard University library to identify file formats and extract metadata
  • FI Tools (Windows, commercial, website)
  • G-Spot (Windows, freeware, website): Identifies audio and video codecs need to play a media file.
  • JHOVE (tool to classify/identify/validate file formats)
  • MediaInfo (cross-platform, open source, website): "MediaInfo is a convenient unified display of the most relevant technical and tag data for video and audio files."
  • Siegfried (signature-based file identification tool) website blog post
  • TrID (Windows/Linux, free for non-commercial use, website): identifies files using a database of filetype signatures. Also has an online version.

References

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox