Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Somatic SV breakpoint boundary issue caused by supplementary alignment boundaries #9

Open
aganezov opened this issue Feb 8, 2024 · 1 comment
Labels
enhancement New feature or request

Comments

@aganezov
Copy link

aganezov commented Feb 8, 2024

Version: Severus 0.1.2
Data:

  • Tumor: colo829
  • Normal: colo829bl

command executed:

Severus-0.1.2/severus.py -t 16 --target-bam colo829.chr15_41M.bam --control-bam colo829bl.chr15_41M.bam --out-dir severus_unhaplotagged --vaf-thr 0.05 --single-bp

and

Severus-0.1.2/severus.py -t 16 --target-bam colo829.chr15_41M.tagged.bam --control-bam colo829bl.chr15_41M.tagged.bam --out-dir severus_haplotagged --vaf-thr 0.05 --single-bp

Resulting somatic SV DUP call can be viewed on the IGV screenshots:
severus_bug1 igv_snapshot 2
severus_bug1 igv_snapshot 1
NB: the truthset is represented via individual breakends, while Severus vcf contains the more contiguous DUP call. Both refer to the same underlying somatic variant.

Somatic DUP left boundary extends beyond the truth set boundary into the supplementary alignment outermost location. As compared to the truth set DUP boundary breakpoint.

Issue persists with both unhaplotagged and haplotagged input data.

@aysegokce
Copy link
Contributor

Hi @aganezov,
There is an overlapping sequence between the primary and supplementary alignment, possibly due to homology between breakpoints, and the overlap size (~200bp) is below our threshold (500bp). Therefore, Severus directly outputs the positions from the alignments, whereas since it is PCR-validated, truthset has a more precise position. Even If it is above the threshold, Severus would keep the start position and change the end position, so it may not be an exact match.

The truthset vcf is a junction vcf, so they represent everything (except unmapped insertions) as breakends, but in the INFO column, they specify the SVTYPE which should be DUP for this SV.

Ayse

@aysegokce aysegokce added the enhancement New feature or request label Feb 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants