Skip to content
/ amp-lm Public
forked from Tushar-N/amp-lm

AMP-LM: A residue-level Language model for AMP sequence generation

Notifications You must be signed in to change notification settings

Vjp802/amp-lm

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AMP-LM: A residue-level Language model for AMP sequence generation

Code accompanying the paper: Computational antimicrobial peptide design and evaluation against multidrug-resistant clinical isolates of bacteria

There are 4 components to the code-base:

  • data_harvesting: Download and parse pages from YADAMP
  • residue_lm: Train and sample a residue level language model
  • clustal_pruning: Remove redundant sequences from sampled sequences using clustalW
  • bilstm_ranking: Rank the resulting sequences according to predicted MIC values using a bidirectional LSTM model

Details about the inputs and outputs for each component can be found in the run script below.

Requirements

  • torch7
  • python packages: pip install numpy beautifulsoup4 joblib
  • clustal: sudo apt-get install clustalw

for blstm service:

  • luarocks install https://raw.githubusercontent.com/benglard/htmlua/master/htmlua-scm-1.rockspec
  • luarocks install https://raw.githubusercontent.com/benglard/waffle/master/waffle-scm-1.rockspec

Instructions to run

The run.sh script can be used for end-to-end execution of the AMP-LM pipeline. It is recommended that the script be edited to include the GPU ID of any CUDA capable device in the system (training is very slow on the CPU).

To run only the BiLSTM model as a service:

cd bilstm_ranking
th service.lua <model.t7> -port 1337
# usage: http://127.0.0.1:1337/blstm/GLKIGKKIGPFLKLVKK

Contact

For queries regarding the LSTM algorithm, contact Tushar Nagarajan (tushar.nagarajan@gmail.com)
For queries regarding the experimental data, contact Deepesh Nagarajan (1337deepesh@gmail.com)
For academic queries, contact Prof. Nagasuma Chandra (sumachandra@gmail.com)

If you find this repository useful, please consider citing:

@article{nagarajan2017computational,
  title={Computational antimicrobial peptide design and evaluation against multidrug-resistant clinical isolates of bacteria},
  author={Nagarajan, Deepesh and Nagarajan, Tushar and Roy, Natasha and Kulkarni, Omkar and Ravichandran, Sathyabaarathi and Mishra, Madhulika and Chakravortty, Dipshikha and Chandra, Nagasuma},
  journal={Journal of Biological Chemistry},
  pages={jbc--M117},
  year={2017},
  publisher={ASBMB}
}

About

AMP-LM: A residue-level Language model for AMP sequence generation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Lua 85.2%
  • Python 8.6%
  • Shell 6.2%