-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MYCN amplification status thresholding #112
Conversation
@afarrel , just noting our discussion in Slack and recap the issue: We are re-evaluating CNV consensus annotation since:
For all NBL samples, we can do the following to check the differences between clinical annotation and our consensus call results:
And the results are:
I was concerned about only have 75 samples in the annotated consensus file (
This particular strategy was introduced since previously, neutral calls get I think we can potentially use ploidy for copy number and that would rescue some neutral calls but would not add any |
Thanks @runjin326 ! |
Also attaching a file with the comparison :) |
Thanks @runjin326 for your feedback. This issue is tough and we may need to do this on a cohort or cancer level. We've done a lot of work to benchmark the brain tumors so I don't want us to alter this methodology globally based on MYCN, which is a very highly amplified gene compared to others. I'm surprised these aren't captured, but we can also use SNP array as default CN and consensus as backup once we have those data integrated. Another option is to determine what the overlaps are for consensus. We implemented consensus if 90% overlap in one direction (CNV A overlaps 90% with CNV B) or 50% in both. Can we assess the actual breaks and overlaps as well @ewafula to see why amps are missed? It's also possible a few clinically amp samples won't have dna amp due to DNA/protein differences and/or tumor collection heterogeneity per assay. FISH is used clinically. |
@afarrel, I have updated the module with comparisons between the TARGET SNP arrays (focal and segment at multiple copy number ratio cutoffs to infer MYCN amp) and clinical MYCN amp status in the histologies files. Details explained in the README.md |
Closing this as we will add SNP arrays at a later date. |
Purpose/implementation Section
What scientific question is your analysis addressing?
The purpose of this analysis module is to assess the status of the MYCN gene amplification status between the OpenPedCan Neuroblastoma GMKF and TARGET consensus copy number variants (CNV) calls from WSX/WGS data and TARGET focal and segment SNP array data compared to the clinical information in the master histologies file.
What was your approach?
Input data
analyses/data/histologies.tsv
analyses/data/consensus_wgs_plus_cnvkit_wxs.tsv.gz
analyses/data/cnv-cnvkit.seg.gz
analyses/data/cnv-controlfreec.tsv.gz
analyses/data/sv-manta.tsv.gz
input/mycn_tumor_focal_SNParray.tsv
input/mycn_tumor_paired_segment_SNParray.tsv
input/mycn_tumor_single_segment_SNParray.tsv
What GitHub issue does your pull request address?
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Check if the results are sufficient for determining copy number and overlap coverage cutoffs for the consensus CNV calling analysis and suggests other additional approaches if needed.
Review the accuracy of the analysis code
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
YES
Results
What types of results are included (e.g., table, figure)?
Tables
What is your summary of the results?
results/mycn_consensus_vs_clinical_status.tsv
results/mycn_consensus_vs_clinical_diff_status.tsv
results/mycn_consensus_vs_clinical_metrics.tsv
results/cnvkit_mycn_consensus_vs_clinical_diff_status.tsv
results/ccontrolfreec_mycn_consensus_vs_clinical_diff_status.tsv
results/mantaSV_mycn_consensus_vs_clinical_diff_status.tsv
02-mycn-consensus-clinical-discordant.html
results/mycn_focal_SNParray_vs_clinical_metrics.tsv
results/mycn_focal_SNParray_vs_clinical_status.tsv
results/mycn_paired_segment_SNParray_vs_clinical_metrics.tsv
results/mycn_paired_segment_SNParray_vs_clinical_status.tsv
results/mycn_single_segment_SNParray_vs_clinical_metrics.tsv
results/mycn_single_segment_SNParray_vs_clinical_status.tsv
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.