BAM (Binary Alignment Map)
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) (→Links) |
Dan Tobias (Talk | contribs) |
||
(2 intermediate revisions by one user not shown) | |||
Line 2: | Line 2: | ||
|formattype=electronic | |formattype=electronic | ||
|subcat=Scientific Data formats | |subcat=Scientific Data formats | ||
+ | |extensions={{ext|bam}}, {{ext|bai}} | ||
}} | }} | ||
'''BAM''' (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed binary version of the data in the [[SAM]] format. It is classified as an alignment format, as is [[CRAM]]. This is as opposed to sequence-only, unaligned, formats such as [[FASTA and FASTQ]]. | '''BAM''' (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed binary version of the data in the [[SAM]] format. It is classified as an alignment format, as is [[CRAM]]. This is as opposed to sequence-only, unaligned, formats such as [[FASTA and FASTQ]]. | ||
BAM is compressed as a series of compressed blocks, each one a [[gzip]] archive. This is known as [[BGZF]] compression. | BAM is compressed as a series of compressed blocks, each one a [[gzip]] archive. This is known as [[BGZF]] compression. | ||
+ | |||
+ | An index of a BAM is stored in a format called BAI, also described in the spec. | ||
== Specifications == | == Specifications == |
Latest revision as of 03:51, 4 August 2020
BAM (Binary Alignment Map) is a data format used for mapping sequences of DNA. It is a compressed binary version of the data in the SAM format. It is classified as an alignment format, as is CRAM. This is as opposed to sequence-only, unaligned, formats such as FASTA and FASTQ.
BAM is compressed as a series of compressed blocks, each one a gzip archive. This is known as BGZF compression.
An index of a BAM is stored in a format called BAI, also described in the spec.