Difference between revisions of "Trimmomatic"

From wiki
Jump to: navigation, search
 
(2 intermediate revisions by the same user not shown)
Line 7: Line 7:
 
Trimmomatic is quite unique in the way it is called.
 
Trimmomatic is quite unique in the way it is called.
  
It is a java program, so this requires the jar file naturally, and then:
+
It is a java program, so this requires the jar file (naturally) and then:
  
 
# whether single- or pair-end reads (SE|PE)
 
# whether single- or pair-end reads (SE|PE)
 
# number of threads with hyphen, i.e. -threads N
 
# number of threads with hyphen, i.e. -threads N
# whether quality scoring is phred33 or phred64 (although it will try to detect)
+
# whether quality scoring is phred33 or phred64 (although it will try to auto-detect)
 +
# next come the two input pair-ended file with no hyphens, i.e just the plain filenames.
 +
# next come the ouput files and four names are required here, because two files of these will be single-end (unpaired reads), i.e. their corresponding read in the pair did not reach the quality standard and so was deleted.
 +
 
 +
== Adapter file specification ==
 +
 
 +
You can use an environmental variable for the adapter fasta file, because they have been set by the module and will point to the right place. Here is a list, the most commonly used one is $TS3PE.
  
 
* $TS3SE is TruSeq3-SE.fa
 
* $TS3SE is TruSeq3-SE.fa
Line 19: Line 25:
 
* $TS3PE is TruSeq3-PE.fa
 
* $TS3PE is TruSeq3-PE.fa
 
* $TS3PE2 is TruSeq3-PE-2.fa
 
* $TS3PE2 is TruSeq3-PE-2.fa
 
  
 
== Example use-cases==
 
== Example use-cases==
  
  java -jar $PATHTRIMMOJAR/trimmomatic-0.32.jar PE -threads 6 -phred33 $DATASETPATH/Pair1.fastq.gz $DATASETPATH/Pair1.fastq.gz forward_paired.fq forward_unpaired.fq reverse_paired.fq reverse_unpaired.fq ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 LEADING:5 TRAILING:5 SLIDINGWINDOW:4:15 MINLEN:50
+
  java -jar $PATHTRIMMOJAR/trimmomatic-0.32.jar PE -threads 6 -phred33 $DATASETPATH/Pair1.fastq.gz $DATASETPATH/Pair1.fastq.gz forward_paired.fq forward_unpaired.fq reverse_paired.fq reverse_unpaired.fq ILLUMINACLIP:$TS3PE:2:30:10 LEADING:5 TRAILING:5 SLIDINGWINDOW:4:15 MINLEN:50

Latest revision as of 17:17, 5 July 2016

Introduction

Another tool for quality trimming ... this is probably the best known.

Usage

Trimmomatic is quite unique in the way it is called.

It is a java program, so this requires the jar file (naturally) and then:

  1. whether single- or pair-end reads (SE|PE)
  2. number of threads with hyphen, i.e. -threads N
  3. whether quality scoring is phred33 or phred64 (although it will try to auto-detect)
  4. next come the two input pair-ended file with no hyphens, i.e just the plain filenames.
  5. next come the ouput files and four names are required here, because two files of these will be single-end (unpaired reads), i.e. their corresponding read in the pair did not reach the quality standard and so was deleted.

Adapter file specification

You can use an environmental variable for the adapter fasta file, because they have been set by the module and will point to the right place. Here is a list, the most commonly used one is $TS3PE.

  • $TS3SE is TruSeq3-SE.fa
  • $TS2SE is TruSeq2-SE.fa
  • $TS2PE is TruSeq2-PE.fa
  • $NXPE is NexteraPE-PE.fa
  • $TS3PE is TruSeq3-PE.fa
  • $TS3PE2 is TruSeq3-PE-2.fa

Example use-cases

java -jar $PATHTRIMMOJAR/trimmomatic-0.32.jar PE -threads 6 -phred33 $DATASETPATH/Pair1.fastq.gz $DATASETPATH/Pair1.fastq.gz forward_paired.fq forward_unpaired.fq reverse_paired.fq reverse_unpaired.fq ILLUMINACLIP:$TS3PE:2:30:10 LEADING:5 TRAILING:5 SLIDINGWINDOW:4:15 MINLEN:50