Skip to content

Latest commit

 

History

History
 
 

GSE129845

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

GSE129845 data set

This data set comes from the publication "Single-Cell Transcriptomic Map of the Human and Mouse Bladders" by Yu et al., J Am Soc Nephrol (2019) (DOI, GEO submission). It contains ~13,000 cells from 3 patients.

To test Cerebro, download the .crb file from Seurat v3 and load it into Cerebro.

Workflow

The workflows of all three frameworks are conceptually the same, containing the following steps:

  • Load the transcript counts.
  • Filter cells based on the number of transcripts and expressed genes.
  • Normalize the transcript counts and scaled each cell to contain 10,000 transcripts.
  • Identify variable genes.
  • Scale the expression matrix and regressing out the number of transcripts.
  • Perform cell cycle analysis.
  • Perform principal component analysis.
  • Identify clusters and build a cluster tree.
  • Perform dimensional reduction.

Then, using the functions of cerebroApp, we add some more data:

  • Calculate the percent of mitochondrial and ribosomal gene expression (addPercentMtRibo()).
  • Get the most expressed genes in each sample and cluster (getMostExpressedGenes()).
  • Get marker genes for each sample and cluster (getMarkerGenes()).
  • Perform pathway enrichment analysis using the marker genes of each sample and cluster (getEnrichedPathways()).
  • Perform gene set enrichment analysis for each sample and cluster (performGeneSetEnrichmentAnalysis()).

Next, we calculate trajectories of (1) all cells and (2) a subset of cells (those in G1 phase) using Monocle v2 and the variable features identified by Seurat. We extract these trajectories from the respective Monocle objects and add them to our Seurat object through the extractMonocleTrajectory() function.

Lastly, from the Seurat object we export a Cerebro file (.crb extension) that can be loaded into Cerebro (exportFromSeurat()).

How to reproduce

The example data sets were generated using the official Cerebro Docker image which was built in Docker (Docker Hub) and imported into Singularity (here I used Singularity 2.6.0). Details and descriptions for the workflow can be found in the respective directory Seurat v3.