-
Notifications
You must be signed in to change notification settings - Fork 726
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Specify the correct sample name for empty VCFs instead of default
#334
Comments
Thanks for the suggestion @arostamianfar . Sounds like a good suggestion and should be easy to do : adding a flag to postprocess_variants. If I'll file an internal issue to track this, and we should be able to have this in our next release. |
The functionality has been implemented in the internal codebase and will be available in the next release. |
Like @gunjanbaid said, in our internal codebase, I added the --sample_name flag to postprocess_variants as well, which will allow users to pass in a different sample_name at that stage. But, I do want to point out that even with the current version (0.10.0), if you pass in --sample_name in the one-step command like Quick Start, you should be already able to get the VCF output with the specified sample_name. Because the I tried with exactly the steps in https://github.com/google/deepvariant/blob/r0.10/docs/deepvariant-quick-start.md , and added --sample_name:
After running this command (with the current version, 0.10.0), I do get
Just an FYI, and hopefully I didn't misunderstand the issue you brought up. But, in the new code (which we plan to release soon), the --sample_name flag is added to postprocess_variants as well. |
Thank you for the clarification, @pichuan ! However, my issue isn't with changing the sample_name, but with empty VCF files. In the above example, if you change I believe passing |
Ah yes, thanks @arostamianfar for providing the |
…his to run_deepvariant.py as well. This addresses #334. PiperOrigin-RevId: 327440165
…e for empty VCFs became `default` if we don't specify it. PiperOrigin-RevId: 328370355
Describe the issue:
DeepVariant currently outputs
default
as the sample name for empty VCFs (related to the fix in #186 ). Ideally, thesample_name
should be transferred from the original BAM file, but if that is too difficult to implement (since there are no examples), I think it would also be ok if we let the user customize this ID. One suggestion is that it can use the value provided by the--sample_name
flag (currently, used formake_examples
, but I think it can also be reused forpostprocess_variants
).Using
default
causes issues in pipelines where the VCF is used downstream of DeepVariant (e.g. merging the VCF with other callers; or even within DeepVariant in a pipeline that scatters calling across multiple chromosomes and tries to gather them and some of those VCFs are empty).Setup
The text was updated successfully, but these errors were encountered: