Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Delete duplicate layer in AnnData objects #759

Merged
merged 21 commits into from
May 9, 2024

Conversation

jashapiro
Copy link
Member

When working on OpenScPCA-nf, I found that when we move raw data in the processed data, we are leaving in place the original "logcounts" layer, making it meaning we have two copies of it around.

This PR removes that duplication (and merges in changes from main to development)

Unless we were supposed to leave it for some reason, I think it should save us some space! By my metrics (and logical expectation, nice when they match) this saves ~1/3 of the space for the AnnData processed objects.

I couldn't find any reference to the layers in our docs, so this change should not require any updates there.

@jashapiro jashapiro requested a review from allyhawkins May 9, 2024 20:43
Copy link
Member

@allyhawkins allyhawkins left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is fine with me. We don't use the AnnData objects at all after running this script in the workflow so should be good to go. Thanks for merging in those other changes from main too!

@jashapiro jashapiro merged commit 42cca2b into development May 9, 2024
4 checks passed
@jashapiro jashapiro deleted the jashapiro/delete-layer branch May 9, 2024 21:03
jashapiro added a commit to AlexsLemonade/OpenScPCA-nf that referenced this pull request May 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants