Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Hotspots to grch38 #59

Merged
merged 8 commits into from
Aug 30, 2022

Conversation

pieterlukasse
Copy link
Contributor

Fixes #58

@pieterlukasse pieterlukasse changed the title Hotspots update to grch38 Update Hotspots to grch38 Aug 26, 2022
@pieterlukasse
Copy link
Contributor Author

Genes dropped and genes updated are attached
dropped_genes.txt
updated_genes.txt
.

type=str, help="which transcript override source is preferred. Possible values: [mskcc, uniprot, genome_nexus, ensembl]")
parser.add_argument("--grch37_hotspots_2d_3d_file_name", default="../../data/grch37_ensembl92/export/hotspots_v2_and_3d.txt",
type=str, help="combined grch37 2D and 3D cancerhotspots data file")
args = parser.parse_args()
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

FYI - the updated hotspots file in this PR was produced by running this script with the default values for all these arguments ^

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@pieterlukasse Thanks! Is there a place where this would fit in the Makefile? https://github.com/genome-nexus/genome-nexus-importer/blob/master/data/common_input/hotspots/Makefile

We've been trying to use the Makefiles as a way to build all the export/ files. Ideally we can build everything with this command:

https://github.com/genome-nexus/genome-nexus-importer#running

I think that hotspots are separately tho in this Makefile (should prolly be added to the docs how that works at some point):

https://github.com/genome-nexus/genome-nexus-importer/blob/master/data/common_input/hotspots/Makefile

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@inodb good idea. I've pushed up an extra commit with the changes to the hotspots Makefile, and added a subsection to the README. Please take a look and let me know what you think.

Copy link
Member

@leexgh leexgh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you! Neat code! 🥇

../../grch37_ensembl92/export/hotspots_v2_and_3d.txt: hotspots_v2_and_3d_grch37.txt
cp $< $@

../../grch38_ensembl92/export/hotspots_v2_and_3d.txt: hotspots_v2_and_3d_grch37.txt_grch38_ported.txt
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you have new "../../grch38_ensembl92/export/hotspots_v2_and_3d.txt" uploaded in the pull request?

Copy link
Contributor Author

@pieterlukasse pieterlukasse Aug 30, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did run it. Do you want me to upload it or do you want to run it to test the process? The final result should be identical to the grch38_ensembl95 one, given the Ensembl api the script is using.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will copy the file over for now on our end - thanks!

Copy link
Member

@inodb inodb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great! Thanks so much! 🏆 - We love the code!

@inodb inodb added the enhancement New feature or request label Aug 30, 2022
@inodb inodb merged commit c080699 into genome-nexus:master Aug 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Hotspots data not grch38 compatible
3 participants