-
Notifications
You must be signed in to change notification settings - Fork 117
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Peak calling parameters might not be ideal #169
Comments
…ixed nf-core#168 (write genome fa and fai for IGV), fixed nf-core#169 (peak calling).
@ktrns and @JoseEspinosa, This may be a silly question but in the region that is shown above, we are assuming the depleted region between the two peaks is where the nucleosome is, right? Can't it be where TF bind instead? How do you tell the difference? Also, if it is TF binding, having a peak that only capture the open chromatin region on either side of the TF will not include their motifs. When we do motif discovery on the peaks, then the expected TF motifs won't be found? Also in this suggested macs callpeak parameter, we no longer call broad peaks? TIA for your insights. |
Hello, I saw that some papers in which they use macs2 for ATAC-seq they previously filter the fragment sizes to get a set corresponding to nucleosome free regions (NFR) and then they apply Best regards, |
Dear all,
We had started to discuss this topic in Slack, but I decided to raise this issue so the comments won't get lost.
In brief, you currently use the
macs2
settings originally suggested for ATAC-seq data:-f "BAMPE" --nomodel
. I have spent some time on reading, and I find this discussion very helpful:https://twitter.com/XiChenUoM/status/1336658454866325506
In brief, by using
-f "BAMPE"
we focus on only 1 of the 2 cut sites of the fragment, which does not seem to be the most appropriate way to handle ATAC-seq data.Based on what I read, converting
bam
tobed
, and then use-f bed
(or bedpe, still have to figure that out) together with--shift -75 --extsize 150
is better. You would basically use both readsR1
andR2
, and create 150bp reads with the mid-point on the cut-site of the transposase. Another twitter reply to the above thread said that Encode3 used--shift -37 --extsize 73
.I will likely start to try this to see if it gives an improvement.
Thanks a lot for your valuable work on the pipeline!
Katrin
The text was updated successfully, but these errors were encountered: