Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update canonical transcript overrides with explanation #75

Merged
merged 6 commits into from
Mar 1, 2023

Conversation

leexgh
Copy link
Member

@leexgh leexgh commented Feb 22, 2023

Fix: genome-nexus/genome-nexus#664

  • Using new oncokb isoform overrides that are extracted from onockb API https://www.oncokb.org/api/v1/utils/allCuratedGenes and separate to grch37 and grch38 files
  • add new columns to explain where does each canonical transcript come from. Options: esembl only one transcript, ensembl longest, uniprot, oncokb, mskcc, manually (stored in genome-nexus-isoform-overrides)

Need to merge #74 first

@@ -0,0 +1,3 @@
enst_id gene_name protein_stable_id gene_stable_id comment
ENST00000573679 PTPRC ENSP00000458322 ENSG00000262418
Copy link
Member Author

@leexgh leexgh Feb 22, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This comes from: 88f0d68. @inodb Do you want to add some comments?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm having a hard time reconstructing why i picked that one, maybe ok to leave as is for now

@@ -0,0 +1 @@
enst_id gene_name protein_stable_id gene_stable_id comment
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@inodb Do you have more grch38 transcript overrides to add?

Copy link
Member

@inodb inodb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM - thank you!!

scripts/make_one_canonical_transcript_per_gene.py Outdated Show resolved Hide resolved
@leexgh leexgh merged commit 04e4f1e into genome-nexus:master Mar 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Update canonical transcript with new grch37 and grch38 oncokb isoform overrides files
2 participants