-
Notifications
You must be signed in to change notification settings - Fork 23
read_psl
read_psl read PSL data from file. The PSL format consists of up to 21 columns:
.1 MATCHES - Number of non-repeat matches. .1 MISMATCHES - Number of mismatches. .1 REPMATCHES - Number of repeat matches. .1 NCOUNT - Number of Ns. .1 QNUMINSERT - Number of inserts in query. .1 QBASEINSERT - Number of bases inserted in query. .1 SNUMINSERT - Number of inserts in subject. .1 SBASEINSERT - Number of bases inserted in subject. .1 STRAND - Strand. .1 Q_ID - Query ID. .1 Q_LEN - Query length. .1 Q_BEG - Query begin. .1 Q_END - Query end. .1 S_ID - Subject ID. .1 S_LEN - Subject length. .1 S_BEG - Subject begin. .1 S_END - Subject end. .1 BLOCKCOUNT - Block count. .1 BLOCKSIZES - Block sizes. .1 Q_BEGS - Query sequence blocks begins. .1 S_BEGS - Subject sequence blocks begins.
read_psl adds an additional two keys:
.1 SCORE - Score calculated as in web BLAT results. .1 SPAN - The span of the hit. .1 REC_TYPE - Record type.
For more about the PSL format:
http://genome.ucsc.edu/FAQ/FAQformat#format2
read_psl [options] -i <PSL file(s)>
[-? | --help] # Print full usage description.
[-i <files!> | --data_in=<files!>] # Comma separated list of files or glob expression to read.
[-n <uint> | --num=<uint>] # Limit number of records to read.
[-I <file!> | --stream_in=<file!>] # Read input stream from file - Default=STDIN
[-O <file> | --stream_out=<file>] # Write output to stream file - Default=STDOUT
[-v | --verbose] # Verbose output.
To read all PSL entries from a file:
read_psl -i test.psl
To read in only 10 records from a PSL file:
read_psl -n 10 -i test.psl
To read all PSL entries from multiple files:
read_psl -i test1.psl,test2.psl
To read PSL entries from multiple files using a glob expression:
read_psl -i '*.psl'
Martin Asser Hansen - Copyright (C) - All rights reserved.
August 2007
GNU General Public License version 2
http://www.gnu.org/copyleft/gpl.html
read_psl is part of the Biopieces framework.