This is some code I developed to detect and classify short tandem repeats in read data. This is part of my contribution to the Yuen Lab at the Sick Kids Hospital in Toronto as a volunteer under the supervision of Induja Chandrakumar.
This repo contains:
repeat_util.py
which is a Python script containing various utility methods. Not related to any particular project.- The project
consecutive_kmers
that converts sequences likeACTGATGATGGTGATGATGCGAA
and toACT(GAT)_2 GGT(GAT)_2 GCGAA
and groups them.