BAM (Binary Alignment Map)
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) (Created page with "{{FormatInfo |formattype=electronic |subcat=Scientific Data formats }} '''BAM''' (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed ...") |
Dan Tobias (Talk | contribs) |
||
(3 intermediate revisions by one user not shown) | |||
Line 2: | Line 2: | ||
|formattype=electronic | |formattype=electronic | ||
|subcat=Scientific Data formats | |subcat=Scientific Data formats | ||
+ | |extensions={{ext|bam}}, {{ext|bai}} | ||
}} | }} | ||
'''BAM''' (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed binary version of the data in the [[SAM]] format. It is classified as an alignment format, as is [[CRAM]]. This is as opposed to sequence-only, unaligned, formats such as [[FASTA and FASTQ]]. | '''BAM''' (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed binary version of the data in the [[SAM]] format. It is classified as an alignment format, as is [[CRAM]]. This is as opposed to sequence-only, unaligned, formats such as [[FASTA and FASTQ]]. | ||
BAM is compressed as a series of compressed blocks, each one a [[gzip]] archive. This is known as [[BGZF]] compression. | BAM is compressed as a series of compressed blocks, each one a [[gzip]] archive. This is known as [[BGZF]] compression. | ||
+ | |||
+ | An index of a BAM is stored in a format called BAI, also described in the spec. | ||
== Specifications == | == Specifications == | ||
Line 12: | Line 15: | ||
== Links == | == Links == | ||
− | * [[Wikipedia: | + | * [[Wikipedia:Binary Alignment Map|Wikipedia article]] |
* [https://github.com/samtools/hts-specs HTS-Specs Github] | * [https://github.com/samtools/hts-specs HTS-Specs Github] | ||
* [https://sourceforge.net/projects/samtools/lists/samtools-devel samtools-devel mailing list] | * [https://sourceforge.net/projects/samtools/lists/samtools-devel samtools-devel mailing list] |
Latest revision as of 03:51, 4 August 2020
BAM (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed binary version of the data in the SAM format. It is classified as an alignment format, as is CRAM. This is as opposed to sequence-only, unaligned, formats such as FASTA and FASTQ.
BAM is compressed as a series of compressed blocks, each one a gzip archive. This is known as BGZF compression.
An index of a BAM is stored in a format called BAI, also described in the spec.