Difference between revisions of "Samtools"

From wiki
Jump to: navigation, search
Line 12: Line 12:
 
= Tips =
 
= Tips =
  
* samtools commands always start with samtools and then a subcommand
+
* samtools commands follow the subcommand style: i.e. they always start with a "samtools" followed by subcommand (eg. "sort", "view", etc) which identify the exact operation samtools will perform. This is due to the fact that samtools is a suite of tools for handling alignment files in sam/bam format.
 
* Two input files will commonly be needed, often a sam/bam file and also the reference file.
 
* Two input files will commonly be needed, often a sam/bam file and also the reference file.
* '''view''' is a commonly used subcommand, whose name refers to internal viewing, because user visuals is not samtools strong point
+
* '''view''' is a commonly used subcommand, whose name however refers to internal viewing, because external user visuals is not samtools strong point, though it has some capabilities in this regard ("tview").
  
 
== tview ==
 
== tview ==

Revision as of 09:32, 29 June 2016

Introduction

Samtools hardly needs an introduction, it is one of the cornerstones of bioinformatics processing and is at the heart of the business of sequence aligning.

Despite that introduction, note that samtools does not actually carry out alignments itself. Rather it offers utilities attendant on real aligners such as bwa and bowtie.

Primarily this is due to its providing various tools (available as subcommands) centred on a well defined sequence alignment format, sam, and its binary (and therefore compressed) equivalent, bam.

This wiki page just offers some tips on how to use it, as there is plenty documentation elsewhere, some of which is mentioned in the links.

Tips

  • samtools commands follow the subcommand style: i.e. they always start with a "samtools" followed by subcommand (eg. "sort", "view", etc) which identify the exact operation samtools will perform. This is due to the fact that samtools is a suite of tools for handling alignment files in sam/bam format.
  • Two input files will commonly be needed, often a sam/bam file and also the reference file.
  • view is a commonly used subcommand, whose name however refers to internal viewing, because external user visuals is not samtools strong point, though it has some capabilities in this regard ("tview").

tview

While visualisation is not samtools' strong point, the tview subcommand does provide a raw view of the alignment that a bam files has. Sometimes such raw, and less pretty representations of the alignment can be useful. It is run like so:

samtools tview alignments/sim_reads_aligned.sorted.bam genomes/NC_008253.fna

Links