Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ValueError: Subset RNA cells not found in population #27

Closed
orbitalse opened this issue Aug 15, 2023 · 2 comments
Closed

ValueError: Subset RNA cells not found in population #27

orbitalse opened this issue Aug 15, 2023 · 2 comments
Labels
question Further information is requested

Comments

@orbitalse
Copy link

Hello Lingfei,

I am encountering the following error when trying to run my data through Dictys. I am at the makefiles step, and this error is generated when I run makefile_check.py to check that everything is in order with the input data before starting the GRN inference.

ValueError: Subset RNA cells not found in population

I am using unpaired snRNA-seq and snATAC-seq data, coming from the same donor but not performed jointly in the same cells. I have already formatted the snRNA-seq expression matrix as well as the clusters/subsets for both snRNA and snATAC according to how it is done in the full-multiome tutorial.

Would you be able to elaborate on what this error means? I've tried debugging the code to trace back the source of the error but to no avail.

Many thanks in advance for your help.

@lingfeiwang lingfeiwang added question Further information is requested and removed tutorial labels Aug 15, 2023
@lingfeiwang
Copy link
Collaborator

Hi orbitalse,

Thank you for the question.

In your input folder, you should have subsets/*/names_rna.txt including cell names contained in each subpopulation. This error means certain cell names in these files cannot be found in your expression matrix file expression.tsv.gz. You need to ensure all cells mentioned in subsets/*/names_rna.txt have their expression profiles available in expression.tsv.gz.

We will make this clear in the error message in the next version. Please reopen the issue if the error persists.

Lingfei

@orbitalse
Copy link
Author

Many thanks for your timely reply! I was able to resolve the issue. The column names of the expression matrix didn't exactly match the cell barcodes in subsets/*/names_rna.txt, which caused the error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants