MetaMorphCells is a bioinformatics research project focused on studying a small population of cancer cells that undergo dedifferentiation into cancer stem cells. This repository contains various scripts and notebooks for preprocessing, analyzing, and visualizing single-cell sequencing data to explore the molecular mechanisms underlying cancer cell dedifferentiation.
Cancer cells can sometimes "rewind" their development, changing into a more primitive, stem-like state. This transformation gives them special abilities, like growing faster, resisting treatments, and spreading to other parts of the body.
In this project, we use advanced data analysis techniques to study these changes in ovarian cancer cells. By examining individual cells, we aim to uncover how they transform and find clues that might help us develop better treatments in the future. We use single-cell genomic sequencing data to model and predict cancer cell behavior. The graph below illustrates the predicted pathways of how different cancer cell clusters differentiate, based on data from a clinical dataset (GSE165897).
🔬 Key Topics:
- Cancer stem cells
- Dedifferentiation
- Single-cell RNA sequencing (scRNA-seq)
- Bioinformatics tools and workflows
This repository is organized into the following sections:
1. Data Preprocessing
- Scripts for data cleaning and normalization of scRNA-seq datasets.
2. Analysis Pipeline
- Includes the full pipeline from clustering, differential expression, imputation, to trajectory inference.
3. Gene Regulatory Network Inference
- Tools for building gene regulatory networks and identifying key regulators.
4. Perturbation Analysis
- Scripts to model and analyze perturbations in the dedifferentiation process.
5. Visualization
- Custom visualizations for cell clusters, gene expression, and trajectories.
Tool/Method | Description |
---|---|
ALRA | Implements the ALRA (Adaptively-thresholded Low Rank Approximation) imputation method ALRA GitHub |
CytoTRACE2 | Predicts differentiation state of single cells based on transcriptional data. CytoTRACE2 GitHub |
scGPT | Applying the scGPT model for gene expression prediction, GRN inference, and perturbation analysis. scGPT GitHub |
scPopcorn | Identification of unique cell clusters using the scPopcorn package. scPopcorn GitHub |
scTour | Model training and lineage trajectory inference using scTour. scTour GitHub |
Scanpy | Comprehensive scRNA-seq analysis toolkit. Scanpy GitHub |
scVelo | RNA velocity analysis. scVelo GitHub |
Velocyto | RNA velocity analysis in scRNA-seq data. Velocyto GitHub |
- Identification of dedifferentiation markers in ovarian cancer cells.
- RNA velocity maps showing trajectory of dedifferentiation.
- Gene regulatory networks highlighting potential therapeutic targets.
This project is still evolving. Future updates may include:
- Integration with spatial transcriptomics data.
- Deeper perturbation studies using additional clinical datasets.