Skip to content

Latest commit

 

History

History
60 lines (47 loc) · 10.1 KB

README.md

File metadata and controls

60 lines (47 loc) · 10.1 KB

Investigations of the coronavirus SL5 RNA domain

This repository accompanies the publication

Kretsch, Rachael C., Lily Xu, Ivan N. Zheludev, Xueting Zhou, Rui Huang, Grace Nye, Shanshan Li, Kaiming Zhang, Wah Chiu, and Rhiju Das. 2024. “Tertiary Folds of the SL5 RNA from the 5′ Proximal Region of SARS-CoV-2 and Related Coronaviruses.” Proceedings of the National Academy of Sciences 121 (10): e2320493121.

Analysis scripts, including M2-seq analysis, modeling, angle analysis, and model validation can be found in analysis/.

Secondary structures were drawn in RiboDraw, the scripts can be found in ribodraw_figures.

The primary structure files can be found with PDB accession codes listed below. All modeling results can be found in models/, additionally the scripts to combine structures are included.

Visuals of the cryo-EM maps and models was done in chimeraX. The scripts to create these figures can be found in chimerax_figures/.

graph_figures/ contains the code to create graphs of the data in this paper including the ERRASER2 analysis, potting M2-seq matrices, plotting the base-pair probability for HCoV-229E, plotting the distance and angles of SL5 models, and calcualting a plotting sequence identity. Each notebook has the code to reach the data, make the figure, and displays the graph created. Some graphs are slightly cosmetically reformatted from this raw graph in the final figures but no substantive changes were made.

Data Availability

The raw sequencing reads can be found at NIH Sequence Read Archive with BioProject accession number PRJNA1039878, with each individual FASTQ found in with the SRA listed in this table. Additionally, the processed reactivity profiles can be found in the RMDB.

Construct Experiment Chemical modifier Reverse transcriptase SRA RMDB
SARS-CoV-2 SL5 domain “scarless” M2-seq No modification TGIRT-III SRR26810680 COVSL5_NOM_0002
SARS-CoV-2 SL5 domain “scarless” M2-seq DMS modified TGIRT-III SRR26810681 COVSL5_DMS_0002
SARS-CoV-2 SL5-6 domains “scarless” M2-seq No modification TGIRT-III SRR26810682 COVSL5_NOM_0001
SARS-CoV-2 SL5-6 domains “scarless” M2-seq DMS modified TGIRT-III SRR26810683 COVSL5_DMS_0001
SARS-CoV-2 SL5 domain “large-library” M2-seq No modification MarathonRT SRR26827601 SL5CV2_NOM_0001
SARS-CoV-2 SL5 domain “large-library” M2-seq DMS modified MarathonRT SRR26827601 SL5CV2_DMS_0001
SARS-CoV-2 SL5 domain “large-library” M2-seq No modification SuperScript II SRR26827601 SL5CV2_NOM_0002
SARS-CoV-2 SL5 domain “large-library” M2-seq 2A3 modifed SuperScript II SRR26827601 SL5CV2_2A3_0001
MERS SL5 domain “large-library” M2-seq No modification MarathonRT SRR26827601 SL5MER_NOM_0001
MERS SL5 domain “large-library” M2-seq DMS modified MarathonRT SRR26827601 SL5MER_DMS_0001
MERS SL5 domain “large-library” M2-seq No modification SuperScript II SRR26827601 SL5MER_NOM_0002
MERS SL5 domain “large-library” M2-seq 2A3 modifed SuperScript II SRR26827601 SL5MER_2A3_0001
BtCoV-HKU5 SL5 domain “large-library” M2-seq No modification MarathonRT SRR26827601 SL5HKU_NOM_0001
BtCoV-HKU5 SL5 domain “large-library” M2-seq DMS modified MarathonRT SRR26827601 SL5HKU_DMS_0001
BtCoV-HKU5 SL5 domain “large-library” M2-seq No modification SuperScript II SRR26827601 SL5HKU_NOM_0002
BtCoV-HKU5 SL5 domain “large-library” M2-seq 2A3 modifed SuperScript II SRR26827601 SL5HKU_2A3_0001

We are in the process of uploading the cryo-EM movies and particle stacks to EMPIAR. The cryo-EM maps can be found in EMDB with the accession codes listed in the table. The models can be found in the PDB with the accession codes listed in the table, or in this repository.

Construct EMPIAR EMDB PDB
SARS-CoV-2 SL5 domain 11827 EMD-42818 8UYS
SARS-CoV-2 SL5-6 domains 11813 EMD-42821 N/A
SARS-CoV-2 SL5-6 domains, SL5b extended 11834 EMD-42820 N/A
SARS-CoV-2 SL5-6 domains, SL5c extended 11814 EMD-42819 N/A
SARS-CoV-2 SL5-6 domains, SL6 extended and SL5a, SL5b, and SL5c removed 11838 N/A N/A
SARS-CoV-1 SL5 domain 11815 EMD-42816 8UYP
MERS SL5 domain conformation 1 11837 EMD-42809 8UYK
MERS SL5 domain conformation 2 11837 EMD-42810 8UYL
MERS SL5 domain conformation 3 11837 EMD-42811 8UYM
BtCoV-HKU5 SL5 domain conformation 1 11836 EMD-42801 8UYE
BtCoV-HKU5 SL5 domain conformation 2 11836 EMD-42805 8UYG
BtCoV-HKU5 SL5 domain conformation 3 11836 EMD-42802 N/A
BtCoV-HKU5 SL5 domain conformation 4 11836 EMD-42808 8UYJ
HCoV-229E SL5 domain 11835 EMD-42803 N/A
HCoV-NL63 SL5 domain conformation 1 11848 EMD-42813 N/A
HCoV-NL63 SL5 domain conformation 2 11848 EMD-42814 N/A