-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add stats
submodule with compute_completeness()
function
#118
Conversation
Codecov Report
@@ Coverage Diff @@
## main #118 +/- ##
==========================================
- Coverage 98.67% 98.48% -0.20%
==========================================
Files 41 43 +2
Lines 1590 1712 +122
==========================================
+ Hits 1569 1686 +117
- Misses 21 26 +5
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
For For |
Just to follow up on this, I think that the best option here to resolve the discrepancies in "filler values" across programs will be to add an argument to the call signature called This can then be used to allow the user to achieve the desired behavior. I'll also update the docstring to be a bit more explicit about the behavior with: 1) unmerged data 2) merged data; 1-col anomalous and 3) merged data; 2-col anomalous. I also think I will expand this method to support all forms of completeness that are reported by
|
This is ready to go in my mind:
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
On the whole the code style is good, and the tests seem adequate. I don't have too many comments, but I noted a few things that might be improved. I cannot claim that I know how robust this will be to different data sets, but it is a pretty solid start as far as I can tell.
I am going to approve, this but please consider making some of the changes.
…erved_value` description
This PR adds an initial pass at
rs.stats.compute_completeness()
. This function takes a DataSet as input, and returns the completeness by resolution bins. It is written to work for both merged and unmerged DataSet objects. It also takes ananomalous
flag which can be used to compute the anomalous completeness.docstring:
Example:
To do:
stats
section to the documentation