Skip to content

Training and using TFFM/PSSM/4-bits + DNA shape classifiers

License

LGPL-3.0, GPL-3.0 licenses found

Licenses found

LGPL-3.0
COPYING.LESSER
GPL-3.0
COPYING
Notifications You must be signed in to change notification settings

martelogan/DNAshapedTFBS

 
 

Repository files navigation

DNAshapedTFBS

Python module for TFFM/PSSM/4-bits + DNA shape classifiers

This module allows for:

  1. Training PSSM/TFFM/4-bits + DNA shape classifiers on ChIP-seq data
  2. Applying PSSM/TFFM/4-bits + DNA shape classifiers on ChIP-seq data

Note that only the best hit per ChIP-sequence is considered in the current version of the module.

Dependencies

The module requires:

  • python2.7 (and does not work with python3 in its current version).
  • the BioPython module www.biopython.org.
  • the TFFM package accessed from your PYTHONPATH environment variable.
  • the scikit-learn module.
  • the pandas module.
  • access to bigWig files providing the values of the DNA shape features HelT, MGW, ProT, and Roll from your genome interest along with the second order computation these features. Please visit the GBshape website.
  • the bwtool.

Tutorial

You can find some examples of how to run the DNAshapedTFBS.py tool in the script test.sh provided in the test/ repository of this package.

The script feature_importance_heatmap.py plots the heatmap(s) of trained classifier(s). Note that the current version only works for PSSM/TFFM + DNA shape classifiers. You can get help on how to use it by typing

python2.7 feature_importance_heatmap.py -h

Project home page

For information on the source tree, examples, issues, and pull requests, see

http://github.com/amathelier/DNAshapedTFBS

Cite

If you use the DNAshapedTFBS tool, please cite

  • A. Mathelier, B. Xin, T.-P. Chiu, L. Yang, R.R. Rohs, and W.W. Wasserman (2016) DNA shape features improve transcription factor binding site predictions in vivo. Cell Systems, DOI:10.1016/j.cels.2016.07.001.

About

Training and using TFFM/PSSM/4-bits + DNA shape classifiers

Resources

License

LGPL-3.0, GPL-3.0 licenses found

Licenses found

LGPL-3.0
COPYING.LESSER
GPL-3.0
COPYING

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 97.7%
  • Shell 2.3%