Difference between revisions of "MinION Coverage sensitivity analysis"

From wiki
Jump to: navigation, search
(Created page with " = Subtract Operations = In the spirit of discovering new sequences, this ivolves subtract what is currently annotated from our alignment dataset. This is a highly untarget...")
 
Line 1: Line 1:
 +
 +
 +
 +
= Conversion of base-called fast5 to fasta=
 +
 +
This is done with poretools, and can be performed. Filters may be
  
  
Line 12: Line 18:
  
 
= Intersect Operations =
 
= Intersect Operations =
 +
 +
 +
= Obtaining the sequence behind a range =
 +
 +
As the bed file only holds chromosome names and ranges, there is no sequence information. This is obtain with the bedtools getfasta tool. This however needs a fasta file, so the correct fasta must be obtained from the bam file, because the reference sequence could be used or sequences from the reads.

Revision as of 17:24, 7 April 2017


Conversion of base-called fast5 to fasta

This is done with poretools, and can be performed. Filters may be


Subtract Operations

In the spirit of discovering new sequences, this ivolves subtract what is currently annotated from our alignment dataset.

This is a highly untargeted approach because all currently annotated genes and interestung areas are subtracted so that the coverage can only describe currently unknown areas of the genome, including non-coding areas.

Though this might sound unfruitful to start with, it has some benefits:

  • useful to gain a feeling of general coverage
  • reduces reliance on currently known areas, which may have been chosen becaus eof high coverage from previous studies.

Intersect Operations

Obtaining the sequence behind a range

As the bed file only holds chromosome names and ranges, there is no sequence information. This is obtain with the bedtools getfasta tool. This however needs a fasta file, so the correct fasta must be obtained from the bam file, because the reference sequence could be used or sequences from the reads.