Difference between revisions of "Bowtie"
Line 21: | Line 21: | ||
= the alignment step = | = the alignment step = | ||
− | We shall deal with colorspace files here, because there is only an added '''-C''' in the options for them, and so it can be ommited when they are not colorspace files. Here is an | + | We shall deal with colorspace files here, because there is only an added '''-C''' in the options for them, and so it can be ommited when they are not colorspace files. Here is an template for such a command: |
bowtie -S -C <label_for_index_of_reference> <uncompressed_fastq_filename> <output_filename_with_sam_extension> | bowtie -S -C <label_for_index_of_reference> <uncompressed_fastq_filename> <output_filename_with_sam_extension> |
Revision as of 12:38, 15 November 2016
Introduction
This article refers to the early version of bowtie, which in fact has a few features that version 2 does not have, such as colorspace data processing.
Indexing the reference
This is done using the bowtie-build command and the input reference with the second argument being a label for the output files. It is an easy command, no options are required. Here is an example:
bowtie-build NC_002127.fna e_coli_O157_H7
Explanation:
- bowtie-build is indexing creation tol of bowtie
- in this example NC_002127.fna is our reference and is the first argument
- the second argument is a label for the output index files, in this case e_coli_O157_H7
- The extensions of the output files will have the extension ebwt. The root name will be the label mentioned above with some changing added characters (1, 2, 3, 4, rev.1, rev.2)
colorspace indexing
A big advantage of this first version of bowtie is its ability to handle colorspace fastq files. For thsi, a special index of the reference sequence must be generated, using bowtie-build and the special option -C. The rest of the command is the same, and you would of course use a different label for later processing with colorspace files.
the alignment step
We shall deal with colorspace files here, because there is only an added -C in the options for them, and so it can be ommited when they are not colorspace files. Here is an template for such a command:
bowtie -S -C <label_for_index_of_reference> <uncompressed_fastq_filename> <output_filename_with_sam_extension>