You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 16, 2023. It is now read-only.
Optimize the cnv-frequencies module to improve run times. Currently takes 2 hrs to compute and annotate cancer_group and cancer_group_cohort gene-level frequencies variant frequencies for the autosomes CNV consensus results.
What changes need to be made? Please provide enough detail for another participant to make the update.
Reimplement and test runs times of mainly the compute_variant_frequencies function, the slowest portion of code to be more efficient.
What input data should be used? Which data were used in the version being updated?
autosomes/allosomes CNV consensus files
histology file
primary/relapse independents sample files
gene and disease annotations files
When do you expect the revised analysis will be completed?
Run time optimized. Down from 2.38333 hrs (143 min) to 0.65278 hrs (39.167 minutes) using named tuples. Reference issue ticket: PediatricOpenTargets/ticket-tracker#120. Is this acceptable?
I agree. The annotation function needs no further optimization, as it will be replaced by the upcoming annotation module. All other parts should run within 40 minutes, so they are also good. I will close the optimization ticket PediatricOpenTargets/ticket-tracker#120.
What analysis module should be updated and why?
Optimize the
cnv-frequencies
module to improve run times. Currently takes 2 hrs to compute and annotatecancer_group
andcancer_group_cohor
t gene-level frequencies variant frequencies for the autosomes CNV consensus results.What changes need to be made? Please provide enough detail for another participant to make the update.
Reimplement and test runs times of mainly the
compute_variant_frequencies
function, the slowest portion of code to be more efficient.What input data should be used? Which data were used in the version being updated?
When do you expect the revised analysis will be completed?
1-2 days
Who will complete the updated analysis?
@ewafula
cc @logstar @jharenza
The text was updated successfully, but these errors were encountered: