Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 1.55 KB

README.md

File metadata and controls

15 lines (12 loc) · 1.55 KB

Description:

This repository contains scripts and sample code for creating and analyzing gene-level networks of phage diversity, as in our recent preprint, "Gene networks provide a high-resolution view of bacteriophage ecology," available via biorxiv.
A detailed guide (Walkthrough.txt) is available for download with instructions for going from genome sequences for a collection of phage genomes to building networks and generating host predictions for novel genomes. It can be used to recreate the analysis in our manuscript and can also be applied to new data. Please refer to the folders "R Scripts" for necessary code and to "Output Data" for examples of output. Necessary input files are contained in "Input Data."
(Updated 12/21/2017 with extended walkthrough and new code for comparing gene network topology to the phylogenetic relatedness of phage hosts.)

Required Software

This code is intended to be run in a UNIX environment and requires:
-R (and the packages 'Matrix' and 'igraph'): Required for most analysis.
-usearch: Used to cluster homologous genes (Note: only 32-bit usearch is free. Larger projects might not be possible without buying a 64-bit license.)
-MCL: used to identify graphical clusters in networks.
-Cytoscape: used to visualize networks.

Contact information

Please contact Jason Shapiro at jshapiro2@luc.edu with any questions.