Skip to content
This repository has been archived by the owner on Nov 9, 2019. It is now read-only.

ReadCountCollection pre-processing filters #1055

Closed
mbabadi opened this issue May 18, 2017 · 1 comment
Closed

ReadCountCollection pre-processing filters #1055

mbabadi opened this issue May 18, 2017 · 1 comment

Comments

@mbabadi
Copy link
Contributor

mbabadi commented May 18, 2017

At the moment, we:

  • Remove targets with possibly bad (NaN, infinity, negative) values
  • Remove targets that have uniformly low coverage across all samples

Perhaps we should consider adding more filters:

  • Remove targets with very high and very low GC content (can be done in the CalculateTargetCoverage step)
  • Remove targets with lots of repeats and anomalously low mappability (can be done in the CalculateTargetCoverage step)
  • In the learning mode, remove a target if too many are masked across the samples (in that case, max likelihood parameter estimation is unreliable)

This must be done after careful evaluations, i.e. only if certain features makes a target prone to bad calls no matter what.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants