Difference between revisions of "Tophat"
(Created page with "Introduction = The well known sloice-aware aligner or mapper, now being phased out in favour of HISAT2 = Usage = module load tophat This will load the latest version (2.1...") |
|||
(One intermediate revision by the same user not shown) | |||
Line 1: | Line 1: | ||
− | Introduction = | + | =Introduction = |
The well known sloice-aware aligner or mapper, now being phased out in favour of HISAT2 | The well known sloice-aware aligner or mapper, now being phased out in favour of HISAT2 | ||
Line 12: | Line 12: | ||
Note the reference to the webpage at the end, that proves to be an easier help resource. As it's some what long, the important parts are extracted here. | Note the reference to the webpage at the end, that proves to be an easier help resource. As it's some what long, the important parts are extracted here. | ||
+ | |||
+ | == TopHat Output == | ||
+ | |||
+ | The tophat script produces a number of files in the directory in which it was invoked. Most of these files are internal, intermediate files that are generated for use within the pipeline. The output files you will likely want to look at are: | ||
+ | |||
+ | * <code>accepted_hits.bam</code>, A list of read alignments in binary SAM format. This is the chief output file. | ||
+ | * <code>junctions.bed</code>, A UCSC BED track of junctions reported by TopHat. Each junction consists of two connected BED blocks, where each block is as long as the maximal overhang of any read spanning the junction. The score is the number of alignments spanning the junction. | ||
+ | * <code>insertions.bed</code> and <code>deletions.bed</code>. UCSC BED tracks of insertions and deletions reported by TopHat. | ||
+ | :- <code>insertions.bed</code> - chromLeft refers to the last genomic base before the insertion. | ||
+ | :- <code>deletions.bed</code> - chromLeft refers to the first genomic base of the deletion. |
Latest revision as of 16:51, 7 May 2017
Introduction
The well known sloice-aware aligner or mapper, now being phased out in favour of HISAT2
Usage
module load tophat
This will load the latest version (2.1.1 from 2016). To see the help:
tophat -h
Note the reference to the webpage at the end, that proves to be an easier help resource. As it's some what long, the important parts are extracted here.
TopHat Output
The tophat script produces a number of files in the directory in which it was invoked. Most of these files are internal, intermediate files that are generated for use within the pipeline. The output files you will likely want to look at are:
-
accepted_hits.bam
, A list of read alignments in binary SAM format. This is the chief output file. -
junctions.bed
, A UCSC BED track of junctions reported by TopHat. Each junction consists of two connected BED blocks, where each block is as long as the maximal overhang of any read spanning the junction. The score is the number of alignments spanning the junction. -
insertions.bed
anddeletions.bed
. UCSC BED tracks of insertions and deletions reported by TopHat.
- -
insertions.bed
- chromLeft refers to the last genomic base before the insertion. - -
deletions.bed
- chromLeft refers to the first genomic base of the deletion.