Difference between revisions of "ChIP-Seq Top2 in Yeast"

From wiki
Jump to: navigation, search
Line 96: Line 96:
 
* $BAM1 is the name of the immunoprecipitation alignment file, termed the '''treatment''' and therefore given with the '''-t''' option.
 
* $BAM1 is the name of the immunoprecipitation alignment file, termed the '''treatment''' and therefore given with the '''-t''' option.
 
* $BAM2 is the name of the normal input alignment file, termed the '''control''' and therefore given with the '''-c''' option.
 
* $BAM2 is the name of the normal input alignment file, termed the '''control''' and therefore given with the '''-c''' option.
 +
* BAMPE is just a setting informing the program that these a pair-end alignment files.
 +
* the output directory here is given the IP (rather than the INP) directory for the sample.
 +
* the size of the genome is then given for the -g option.
  
categories and Signal =
+
MACS produces a number of files as output.
 +
 
 +
 
 +
= Categorisation as peak =
  
 
It is not totally clear how MACS decides that a peak should be called. This is apparent when the signal appears to rise and fall, yet MACS does not call a peak.
 
It is not totally clear how MACS decides that a peak should be called. This is apparent when the signal appears to rise and fall, yet MACS does not call a peak.

Revision as of 16:39, 9 October 2017

Introduction

This wiki page combines the two sequencing experiments of ChIP-Seq on yeast samples udring June / July 2017.

Sample quality

In the above previous analyses, we undertook some Duplicate removel as this showed up to be an issue in the FastQC quality files.

However, the effect of this removal was very low. In any case, the MACS2 program takes care of duplicates so samples were only trimmed for adapters and quality.

Alignment to the S288C Reference

Bam quality, no filtering

First Experiment

Second Experiment

Alignment to the new W303 Reference

Bam quality, no filtering

First Experiment

Second Experiment

Bam quality, filtering applied

The bam files were filtered both for mapping quality and low integrity reads in the following manner:

samtools view -b -F 1820 -q 48 <INBAMFILE> -o ${OUTBAMFILE> -@ <NUMTHREADS>

First Experiment

Second Experiment

Bam quality, 2nd filtering applied

This time bam files were filtered to MAPQ 28:

samtools view -b -F 1820 -q 28 <INBAMFILE> -o ${OUTBAMFILE> -@ <NUMTHREADS>

First Experiment

Second Experiment

The MACS/2 program

The main way of running this program is as follows:

macs2 callpeak -B -t $BAM1 -c $BAM2 -f BAMPE --outdir $IPOUTDIR -g 12.07e6 -q 0.01
  • $BAM1 is the name of the immunoprecipitation alignment file, termed the treatment and therefore given with the -t option.
  • $BAM2 is the name of the normal input alignment file, termed the control and therefore given with the -c option.
  • BAMPE is just a setting informing the program that these a pair-end alignment files.
  • the output directory here is given the IP (rather than the INP) directory for the sample.
  • the size of the genome is then given for the -g option.

MACS produces a number of files as output.


Categorisation as peak

It is not totally clear how MACS decides that a peak should be called. This is apparent when the signal appears to rise and fall, yet MACS does not call a peak.

Signal script

The source for this is from the MACS developer and is detailed here.


Differential Peak-calling

Macs2 incorporated differential peak calling somewhat later into its set of tools. The idea is to take two conditions and compare their peaks.

The pipeline to follow is probably Macs2's recommended one at Differential Binding Events