Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(SV) slim down REF column for CPX variants #4970

Merged
merged 1 commit into from
Jul 9, 2018

Conversation

SHuang-Broad
Copy link
Contributor

Made a stupid decision to output all bases of the affected reference region, which significantly increased the VCF file size.

Now we follow the example for DEL variants, i.e. we only put the reference base in POS in the REF column.

@codecov-io
Copy link

codecov-io commented Jun 30, 2018

Codecov Report

Merging #4970 into master will increase coverage by 20.593%.
The diff coverage is 100%.

@@               Coverage Diff                @@
##              master     #4970        +/-   ##
================================================
+ Coverage     60.198%   80.791%   +20.593%     
- Complexity     12781     17962      +5181     
================================================
  Files           1095      1095                
  Lines          64604     64594        -10     
  Branches       10394     10392         -2     
================================================
+ Hits           38890     52186     +13296     
+ Misses         21482      8384     -13098     
+ Partials        4232      4024       -208
Impacted Files Coverage Δ Complexity Δ
...nce/SegmentedCpxVariantSimpleVariantExtractor.java 93.96% <100%> (+8.949%) 71 <0> (+5) ⬆️
.../sv/discovery/inference/CpxVariantInterpreter.java 79.839% <100%> (+74.921%) 26 <0> (+25) ⬆️
...tmutpileup/ValidateBasicSomaticShortMutations.java 85.965% <0%> (-0.94%) 7% <0%> (-6%)
.../hellbender/tools/genomicsdb/GenomicsDBImport.java 75.758% <0%> (-0.591%) 53% <0%> (+7%)
...llbender/tools/genomicsdb/GenomicsDBConstants.java 0% <0%> (ø) 0% <0%> (ø) ⬇️
...r/tools/walkers/annotator/ClippingRankSumTest.java 100% <0%> (ø) 4% <0%> (ø) ⬇️
...ender/engine/spark/datasources/ReadsSparkSink.java 77.027% <0%> (+0.676%) 33% <0%> (+1%) ⬆️
...spark/sv/evidence/FindBreakpointEvidenceSpark.java 69.485% <0%> (+0.825%) 61% <0%> (+1%) ⬆️
...utils/test/ReadsPreprocessingPipelineTestData.java 0.862% <0%> (+0.862%) 1% <0%> (+1%) ⬆️
...ute/hellbender/utils/test/IntegrationTestSpec.java 73.118% <0%> (+1.075%) 25% <0%> (ø) ⬇️
... and 588 more

final Allele anchorBaseRefAlleleRear = Allele.create(refBases[refBases.length - 2], true);
final SimpleInterval startAndStop = makeOneBpInterval(complexVC.getContig(), complexVC.getEnd());
final VariantContextBuilder rearIns = getInsFromOneEnd(false, firstRefSegmentIdx, startAndStop, anchorBaseRefAlleleRear, refSegmentLengths, altArrangement, true);
final int pos = complexVC.getEnd();
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why is it the end of the VC here and not the start?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The reason was because this is for extracting rear insertion.
Example:

SEGMENTS=chr1:100-200,chr1:200-300,chr1:300-400,chr1:400-500
ALT_ARRANGEMENT=chrOther:100-158,1,-3,4,UINS-433

Here we would emit four records:
a front insertion of 59 bases at position chr1:100, a deletion of chr1:200-300, an inversion of chr1:300-400, and a rear insertion of 433 bases at position of chr1:500.

Copy link
Member

@cwhelan cwhelan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks fine to me, just one little question about the start position of the variant intervals.

* before all bases from affected region is extracted, leading to bloated VCF, now only the anchor base
* also fixes downstream CPX variant re-interpreter
@SHuang-Broad SHuang-Broad merged commit fa5244f into master Jul 9, 2018
@SHuang-Broad SHuang-Broad deleted the sh-sv-cpx-slim-down-refbase branch July 9, 2018 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants