Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update annotate_freq and qual_hists, add split_vds and compute_freq_by_strata #571

Merged
merged 14 commits into from
Aug 4, 2023

Conversation

mike-w-wilson
Copy link
Contributor

This updates annotate_freq to use the added array aggregation functionality in hail, originally added by Tim in #537, adds the ability to add entry aggregation annotations, and generally cleans up the function by splitting the existing annotate_freq into two functions: annotate_freq which calls compute_freq_by_strata.

This also adds split_vds_by_strata which returns a lists of VDSs with as many VDSs as there are unique values for the passed expression.

We also add the ability to return a single struct of qual histograms, containing separate structs of raw and adj qual histograms.

Copy link
Contributor

@jkgoodrich jkgoodrich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just some areas that documentation needs to be added

gnomad/utils/annotations.py Outdated Show resolved Hide resolved
gnomad/utils/annotations.py Show resolved Hide resolved
gnomad/utils/annotations.py Outdated Show resolved Hide resolved
Compute call statistics and, when passed, entry aggregation function(s) by strata.

The computed call statistics are AC, AF, AN, and homozygote_count. Downsamplings are
added to the strata when downsamplings when passed. The entry aggregation functions
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
added to the strata when downsamplings when passed. The entry aggregation functions
added to the strata when `downsamplings` is passed. The entry aggregation functions

gnomad/utils/annotations.py Outdated Show resolved Hide resolved
gnomad/utils/annotations.py Show resolved Hide resolved
gnomad/utils/annotations.py Show resolved Hide resolved
gnomad/utils/annotations.py Show resolved Hide resolved
gnomad/utils/annotations.py Outdated Show resolved Hide resolved
gnomad/utils/annotations.py Outdated Show resolved Hide resolved
Copy link
Contributor

@jkgoodrich jkgoodrich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a small request

gnomad/utils/annotations.py Outdated Show resolved Hide resolved
gnomad/utils/annotations.py Outdated Show resolved Hide resolved
Copy link
Contributor

@jkgoodrich jkgoodrich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Small doc string changes

gnomad/utils/annotations.py Outdated Show resolved Hide resolved
gnomad/utils/annotations.py Outdated Show resolved Hide resolved
gnomad/utils/annotations.py Outdated Show resolved Hide resolved
Co-authored-by: jkgoodrich <33063077+jkgoodrich@users.noreply.github.com>
Copy link
Contributor

@jkgoodrich jkgoodrich left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@mike-w-wilson mike-w-wilson merged commit d5e495f into main Aug 4, 2023
@mike-w-wilson mike-w-wilson deleted the mw/compute_freq_by_strata branch August 4, 2023 18:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants