Skip to content

Plant genome assembly and annotation pipeline using snakemake

License

Notifications You must be signed in to change notification settings

baozg/assembly-annotation-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Snakemake workflow: plant genome HiFi assembly and genome annotation

This Snakemake pipeline aims for plant genome assembly with HiFi data and genome annotation with RNA-Seq/IsoSeq.

Assembly

Modified from the VGP assembly VGP tutorial

Assembly

Annotation

  • Maker running inside snakemake still have the problem since the NSF lock and submit system is already included.
  • IsoSeq integration is still in development. For diploid or polyploidy, FLNC mapping is better than the clustering of the PacBio pipeline if you have haplotype-resolved assembly.
  • Automatic pipeline always confused the complex gene clusters (metabolic gene clusters / NLR gene clusters), please manually curation it before any further interpretation. WebApollo or IGV-GSAman could be a good start for manual curation. (https://www.bilibili.com/video/BV1x84y1z7ZW/)

Annotation

About

Plant genome assembly and annotation pipeline using snakemake

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published