Skip to content

Repository for scripts, notebooks, data, and analyses associated with the paper on Southern Alaska Tanner crabs and Hematodinium.

Notifications You must be signed in to change notification settings

grace-ac/paper-tanner-crab

 
 

Repository files navigation

paper-tanner-crab

DOI

Repository for scripts, notebooks, data, and analyses associated with the paper entitled:

Characterization of the gene repertoire and environmentally driven expression patterns in Tanner crab (Chionoecetes bairdi)

Grace Crandall1, Pamela C. Jensen2, Sam White1, Steven Roberts1

1 School of Aquatic and Fishery Sciences, University of Washington, Seattle, Washington 98105, United States
2 Resource Assessment and Conservation Engineering Division, Alaska Fisheries Science Center, National Marine Fisheries Service, NOAA, 7600 Sand Point Way NE, Seattle, WA 98115, United States

For broader background information on this project, visit RobertsLab/project-crab

Directory contents

DAVID output files from the single crab genes over time heatmap clusters (Fig. 3)
6 clusters, one file per clusters. Clusters 5 and 6 are very small compared to the other 4, and as such are not discussed in the paper.

DAVID output files from (Fig. 2) for the first three clusters. The fourth cluster didn't have any DAVID results.

The files in this directory are from this Rmd: scripts/DESeq.Rmd

Output files from this jupyter notebook to get gene count data from the 4 libraries used for DESeq2. Jupyter notebook: notebooks/kallisto-4libraries.ipynb

Output files from the jupyter notebook to get gene count data from the individual crab RNAseq libraries for visualization. Jupyter notebook: notebooks/kallisto-individual-crab.ipynb



Directory containing directories for each library that has output files from kallisto.

Notebooks:


  • DESeq.Rmd
    Rmd to use DESeq2 with the count matrix for the 4 libraries (380822, 380823, 380824, 380825).
  • GOslim-pie-transcriptome-v3.1.Rmd
    Rmd to create a pie chart of GOslim terms with counts from transcriptome v 3.1.
  • get-stress_response-genes-transcriptome.Rmd
    Rmd to get the list of trinity IDs from transcriptome v 3.1 that fall undert the GOslim term "Stress Response".
  • heatmaps-4libraries.Rmd
    Rmd to create heatmaps using pheatmap for DEGs for the 4 libraries.
  • heatmaps-DEGs_in-singlecrab_over-time.Rmd
    Rmd to create a heatmap of the temperature-influenced infection DEGs in the single crab ... double check what this Rmd is..
  • heatmaps-single_crab-over-time.Rmd
    Rmd to create heatmap of the contigs in the single crab over the three time points.
  • tempdegs-individualcrab.Rmd
    Rmd to join the cluster heatmap information from Fig. 2 in the manuscript with the cluster heatmap information from the heatmap of single crab genes over time. Purpose is to have a file that includes the cluster assignments for both heatmaps in order to find the temperature-influenced infection DEGs in the individual crab to get an idea of patterning of expression at the individual level over time.

Columns are:
Trinity_ID - contig ID from Trinity assembly
swissprot - swissprot
uniprot_acc_ID - uniprot accession ID for the contig
gene_id - gene ID
pident - percentage of identical matches
length - alignment length
mismatch - number of mismatches
gapopen - number of gap openings
qstart - start of alignment in query
qend - end of alignment in query
sstart - start of alignment in subject
send - end of alignment in subject
evalue - expect value
bitscore - Bit score
Entry.name - entry name
Status - reviewed or not
Protein.names - protein names for contig
Gene.names - gene names for contig
Ogranisms - taxonomic organisms
Length - length
Gene.Ontology.biological.process - biological process gene ontology
Gene.ontology.cellular.component - cellular component gene ontology
Gene.ontology.GO - GO terms
Gene.ontology.molecular.function - molecular function gene ontology
Gene.ontology.IDs - gene ontology IDs

  • Supp05-transcriptome-stress-genes.tab
    Table of the C. bairdi transcripts that fell under the GOslim term "Stress response".

  • Supp06-infection-temp-DEGs-counts_annot.tab
    Table of the differentially expressed contigs between infected and uninfected crab taking temperature difference into account (decreased vs. elevated) at Day 2 (408 DEGs), including library count data and uniprot/swissprot and gene ontology information.

  • Supp07-contrasttemp_DEGs-clusters-annot.tab
    Table of a subset of the above 408 DEGs that are just the contigs directly influenced by temperature - 123 total. Table includes library count data, cluster assignment as seen in Fig. 2, and uniprot/swissprot and gene ontology information.

  • Supp08-single_crab-clusters-blast-GO.tab
    Table containing the count data for the single crab RNAseq libraries over time, with the cluster assignments as shown in Fig. 3, along with uniprot/swissprot and gene ontology information

  • Supp09-temp-influenced-infectionDEGs-singlecrab-overtime.tab
    Table combining Supplemental tables 07 and 08. Allows for finding the 123 infection DEGs that are influenced by temperature treatment in the individual crab over time.

About

Repository for scripts, notebooks, data, and analyses associated with the paper on Southern Alaska Tanner crabs and Hematodinium.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.0%
  • R 2.0%