Skip to content
Martin Asser Hansen edited this page Oct 2, 2015 · 8 revisions

Biopiece: read_psl

Description

read_psl read PSL data from file. The PSL format consists of up to 21 columns:

.1 MATCHES - Number of non-repeat matches. .1 MISMATCHES - Number of mismatches. .1 REPMATCHES - Number of repeat matches. .1 NCOUNT - Number of Ns. .1 QNUMINSERT - Number of inserts in query. .1 QBASEINSERT - Number of bases inserted in query. .1 SNUMINSERT - Number of inserts in subject. .1 SBASEINSERT - Number of bases inserted in subject. .1 STRAND - Strand. .1 Q_ID - Query ID. .1 Q_LEN - Query length. .1 Q_BEG - Query begin. .1 Q_END - Query end. .1 S_ID - Subject ID. .1 S_LEN - Subject length. .1 S_BEG - Subject begin. .1 S_END - Subject end. .1 BLOCKCOUNT - Block count. .1 BLOCKSIZES - Block sizes. .1 Q_BEGS - Query sequence blocks begins. .1 S_BEGS - Subject sequence blocks begins.

read_psl adds an additional two keys:

.1 SCORE - Score calculated as in web BLAT results. .1 SPAN - The span of the hit. .1 REC_TYPE - Record type.

For more about the PSL format:

http://genome.ucsc.edu/FAQ/FAQformat#format2

Usage

read_psl [options] -i <PSL file(s)>

Options

[-?          | --help]               #  Print full usage description.
[-i <files!> | --data_in=<files!>]   #  Comma separated list of files or glob expression to read.
[-n <uint>   | --num=<uint>]         #  Limit number of records to read.
[-I <file!>  | --stream_in=<file!>]  #  Read input stream from file  -  Default=STDIN
[-O <file>   | --stream_out=<file>]  #  Write output to stream file  -  Default=STDOUT
[-v          | --verbose]            #  Verbose output.

Examples

To read all PSL entries from a file:

read_psl -i test.psl

To read in only 10 records from a PSL file:

read_psl -n 10 -i test.psl

To read all PSL entries from multiple files:

read_psl -i test1.psl,test2.psl

To read PSL entries from multiple files using a glob expression:

read_psl -i '*.psl'

See also

blat_seq

write_psl

Author

Martin Asser Hansen - Copyright (C) - All rights reserved.

mail@maasha.dk

August 2007

License

GNU General Public License version 2

http://www.gnu.org/copyleft/gpl.html

Help

read_psl is part of the Biopieces framework.

http://www.biopieces.org

Clone this wiki locally