This repository has been archived by the owner on Jun 16, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 0
Updated analysis: use annotator
CLI in the cnv-frequencies
module
#124
Comments
5 tasks
@ewafula The |
Thanks, @logstar. I Will start working to update the CNV
module immediately.
…On Mon, Jul 26, 2021 at 11:06 AM Yuanchao Zhang ***@***.***> wrote:
@ewafula <https://github.com/ewafula> The annotator CLI is merged to the
dev branch
<d3b-center/OpenPedCan-analysis#59>, so
it could be used for your CNV module now.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#124 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAZN26CB3EJCQDK4QDJYXLLTZV2XJANCNFSM5AWHVLNA>
.
|
@logstar, the long-format-table-utils analysis module annotator might need updating with input files from v7. Works ok with updated CNV module with v6 but not v7. Checked my local branch is up to date.
|
@ewafula Thank you for checking. This will be fixed in d3b-center/OpenPedCan-analysis#66. You could adapt the changes of |
Closed with PR d3b-center/OpenPedCan-analysis#52 merged. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
What analysis module should be updated and why?
The
cnv-frequencies
module should be updated to use theannotator
CLI atanalyses/long-format-table-utils/annotator/annotator-api.R
, after(EDIT 07/26/2021 YZ: removed the requirement to merge d3b-center/OpenPedCan-analysis#52 before working on this issue)
What changes need to be made? Please provide enough detail for another participant to make the update.
Remove annotation related code in
analyses/cnv-frequencies/01-cnv-frequencies.py
.Use the long-format table annotator CLI in the
cnv-frequencies
module with the following steps:"Gene_symbol", "Gene_Ensembl_ID", "Disease"
(case-sensitive) are not all present in the column names of the table to be annotated, add new columns or rename existing ones to have all these required columns.OpenPedCan-analysis
or a subdirectory ofOpenPedCan-analysis
. This allows theannotator-cli.R
to locate theannotator-api.R
.annotator-cli.R
script withRscript --vanilla path/to/annotator-cli.R
and proper options. TheRscript
command can be invoked by Python (>= 3.5)import subprocess; subprocess.run("Rscript --vanilla analyses/long-format-table-utils/annotator/annotator-cli.R -h".split())
. For more information about Python (>= 3.5)subprocess.run
, https://docs.python.org/3/library/subprocess.html#subprocess.run.Following is an example usage in the
rna-seq-expression-summary-stats
module01-tpm-summary-stats.R
.What input data should be used? Which data were used in the version being updated?
data/histologies.tsv
data/consensus_seg_annotated_cn_autosomes.tsv.gz
data/consensus_seg_annotated_cn_x_and_y.tsv.gz
analyses/independent-samples/results/independent-specimens.wgs.primary.eachcohort.tsv
analyses/independent-samples/results/independent-specimens.wgs.relapse.eachcohort.tsv
When do you expect the revised analysis will be completed?
1-2 days.
Who will complete the updated analysis?
@ewafula
The text was updated successfully, but these errors were encountered: