SAM
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) |
Dan Tobias (Talk | contribs) |
||
| (One intermediate revision by one user not shown) | |||
| Line 2: | Line 2: | ||
|formattype=electronic | |formattype=electronic | ||
|subcat=Scientific Data formats | |subcat=Scientific Data formats | ||
| + | |extensions={{ext|sam}}, {{ext|sai}} | ||
}} | }} | ||
'''SAM''' (Sequence Alignment/Map) is a data format used for mapping sequences of DNA. It is in text (tab-separated) form, while A companion format [[BAM (Binary Alignment Map)|BAM]] is binary. It is classified as an alignment format, as is [[CRAM]]. This is as opposed to sequence-only, unaligned, formats such as [[FASTA and FASTQ]]. | '''SAM''' (Sequence Alignment/Map) is a data format used for mapping sequences of DNA. It is in text (tab-separated) form, while A companion format [[BAM (Binary Alignment Map)|BAM]] is binary. It is classified as an alignment format, as is [[CRAM]]. This is as opposed to sequence-only, unaligned, formats such as [[FASTA and FASTQ]]. | ||
| + | |||
| + | Indexes associated with a SAM file are called SAI. | ||
== Identification == | == Identification == | ||
Latest revision as of 03:51, 4 August 2020
SAM (Sequence Alignment/Map) is a data format used for mapping sequences of DNA. It is in text (tab-separated) form, while A companion format BAM is binary. It is classified as an alignment format, as is CRAM. This is as opposed to sequence-only, unaligned, formats such as FASTA and FASTQ.
Indexes associated with a SAM file are called SAI.
[edit] Identification
Files will often start with a header line (though this is technically optional), such as @HD VN:1.6 SO:coordinate, where the VN parameter gives the version number of the format and this may be followed with a SO or G0 parameter giving sorting order or grouping of alignments.