Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads

Ye C, Ma Z. (2016) Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads. PeerJ 4:e2016 https://doi.org/10.7717/peerj.2016

test data: https://sourceforge.net/projects/sparc-consensus/files/testdata/

To compile from scratch, clone the directory and use the following command:

g++ -O3 -o Sparc *.cpp

Parameters:

b: backbone file.

m: the reads mapping files produced by blasr, using option -m 5. (A blasr example command: blasr -nproc 32 query.reads.fasta backbone.fasta -bestn 1 -m 5 -minMatch 19 -out backbone.mapped.m5)

k: k-mer size (suggested range: [1,2]).

c: coverage threshold (range: [1,5], suggest: 2).

t: adaptive threshold (suggested range[0.0,0.3]).

g: skip size, the larger the value, the more memory efficient the algorithm is (suggested range: [1,3]).

HQ_Prefix: Shared prefix of the high quality read names. (e.g. if the sec-gen sequences have names >Contig_xxx, then ‘Contig’ is a shared prefix of the high quality reads)

boost: boosting weight for the high quality reads (suggested range: [1,10]).

Example command:

Using third-gen data only:

Sparc b Backbone.fa m backbone.mapped.m5 k 2 g 2 c 2 t 0.1 o ConsensusOutput

Using hybrid data:

Sparc b backbone.fasta m backbone.mapped.m5 k 2 g 2 c 2 t 0.1 HQ_Prefix Contig boost 5 o ConsensusOutput

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
compiled		compiled
utility		utility
Align.h		Align.h
BasicDataStructure.h		BasicDataStructure.h
GraphConstruction.h		GraphConstruction.h
GraphSimplification.h		GraphSimplification.h
LICENSE		LICENSE
README.md		README.md
Sparc.cpp		Sparc.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads

About

Releases

Packages

Languages

License

yechengxi/Sparc

Folders and files

Latest commit

History

Repository files navigation

Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages