-
Notifications
You must be signed in to change notification settings - Fork 9
Genome size estimation of skimming data (RESPECT)
Kamil S. Jaron edited this page Mar 22, 2024
·
1 revision
We will now use RESPECT to compute the repeat structure of the genomes.
Make sure you are under the skmer-tutorial
directory.
mkdir respect
cd respect
mkdir inputs
cd inputs
## link to both skims and genomes here
ln -s ../../skims/nonhybrids/Saccharomyces_*fq ../../skmer-tutorial/nonhybrids/*fna .
cd ..
Now run the method:
### There is some ugliness having to do with licenses gurobi. We cannot just run respect. I will discuss during the tutorial.
### /cluster/projects/nn9458k/oh
_know/teachers/smirarab/respect/gurobi912/linux64/bin/grbgetkey
### export GRB_LICENSE_FILE=`pwd`/gurobi.lic
respect -d inputs/ -N 100
This will take a minute or two.
Look at the summary of results and note the high level of accuracy comparing to genomes
### Examine genome length estimates
column -t estimated-parameters.txt|sort -k1
### And examine repeat spectra
cat estimated-spectra.txt |sort -k1|column -t
cd ..
Introduction
k-mer spectra analysis
- 📖 Introduction to K-mer spectra analysis
- 📖 Basics of genome modeling
- ⚒ manual model fitting (for better understanding of the underlying model)
- ⚒ simple diploid
- ⚒ demonstrating the effect of sequencing error rate on k-mer coverage
- 📖 Common difficulties in characterisation of diploid genomes using k mer spectra analysis
- ⚒ low coverage (pitfall) - to be merged
- ⚒ very homozygous diploid
- ⚒ highly heterozygous diploid
- ⚒ Genome size of a repetitive genome (pitfall)
- ⚒ Wrong ploidy (pitfall)
- 📖 Characterization of polyploid genomes using k mer spectra analysis
- ⚒ Autotetraploid
- ⚒ Allotetraploid
- ⚒ Estimating ploidy (smudgeplot)
- 📖 Genome modeling as a quality control
- ⚒ Contamination (pitfall)
- ⚒ k-mers in an assembly (Mercury/KAT)
- 📖 Analysing genome skimming data
Separation of chromosomes
- 📖Separate sub-genomes of an allopolyploid
- 📖Separating chromosomes by comparison of sequencing libraries
Species assignment using short k-mers
Others