Abstract
The nucleoprotein (N) from SARS-CoV-2 is an essential cofactor of the viral replication transcription complex and as such represents an important target for viral inhibition. It has also been shown to colocalize to the transcriptase-replicase complex, where many copies of N decorate the viral genome, thereby protecting it from the host immune system. N has also been shown to phase separate upon interaction with viral RNA. N is a 419 amino acid multidomain protein, comprising two folded, RNA-binding and dimerization domains spanning residues 45–175 and 264–365 respectively. The remaining 164 amino acids are predicted to be intrinsically disordered, but there is currently no atomic resolution information describing their behaviour. Here we assign the backbone resonances of the first two intrinsically disordered domains (N1, spanning residues 1–44 and N3, spanning residues 176–263). Our assignment provides the basis for the identification of inhibitors and functional and interaction studies of this essential protein.
Similar content being viewed by others
Biological context
The spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) represents a serious threat to the stability of human societies throughout the world. In order to eradicate the disease, it is necessary to foster research aimed at the development of vaccines or effective antiviral inhibitors. The establishment of effective therapeutic strategies requires the identification of molecular targets against which inhibitory strategies can be developed.
The nucleoprotein (N) of SARS-CoV-2 is a crucial component of the replication machinery, localizing to the viral replicase–transcriptase complex (4, 5, 7–9). Beta-coronaviral N encapsidates the viral genome within a nucleocapsid, formed from multiple copies of the protein (Zúñiga et al. 2007; McBride et al. 2014), thereby providing protection from the host immune system, and is essential for regulation of viral gene transcription (Wu et al. 2014). N is also produced in abundance in infected cells and is an important marker for infection. In order to identify possible avenues targeting the essential roles of N in the viral cycle, it is important to understand the structure and function of this complex protein at atomic resolution.
N is a 419 amino acid multi-domain protein, comprising folded, RNA-binding (N2) and dimerization (N4) domains spanning residues 45–175 and 266–365 respectively (Chang et al. 2014). N2 has been described using both X-ray crystallography (Kang et al. 2020; Peng et al. 2020) and NMR spectroscopy (Dinesh et al. 2020). The dimerization domain N4 has been described using X-ray crystallography (Ye et al. 2020; Peng et al. 2020) and N has been shown to form higher order oligomers via N4 (Ye et al. 2020; Zeng et al. 2020; Lutomski et al. 2020). The remaining 164 amino acids comprising N1, N3 and N5 are predicted to be intrinsically disordered, but despite evidence that these domains are essential for function in the related Mouse Hepatitis virus (Keane and Giedroc 2013), there is currently no atomic resolution information describing their behaviour in solution. Nuclear magnetic resonance is the ideal tool for studying the conformational behaviour of highly dynamic or intrinsically disordered proteins (Jensen et al. 2014).
Here we present the assignment of the backbone resonances of the first two intrinsically disordered domains (N1, spanning residues 1–44 and N3, the central disordered domain, spanning residues 176–265). The N3 IDR (175–263) comprises a serine-arginine (SR) rich domain that is phosphorylated in virions, a modification that plays a role in both function and localization of SARS-CoV-1 N (Peng et al. 2008). N5 is thought to contribute to the oligomerization of N in SARS-CoV-1 (Lo et al. 2013).
Recent studies used single molecule FRET and molecular modelling (Cubuk et al. 2020) to describe the global conformational behaviour of N1 and N3. Peptides representing the SR region of N3 in its phosphorylated and non-phosphorylated forms were also described using NMR spectroscopy (Savastano et al. 2020). Mass spectrometry has also revealed a number of auto-catalytic sites in N, two of which are present in N3 (Lu et al. 2020; Lutomski et al. 2020).
N from SARS-CoV-2 has been shown to undergo liquid–liquid phase separation (LLPS) upon mixing with RNA (Cubuk et al. 2020; Savastano et al. 2020; Chen et al. 2020; Iserman et al. 2020; Perdikari et al. 2020; Carlson et al. 2020; Lu et al. 2020; Jack et al. 2020). The formation of membraneless condensates by colocalization of components involved in viral replication in liquid droplets is prevalent in negative sense single strand RNA viruses, such as rabies (Nikolic et al. 2017), measles (Zhou et al. 2019; Guseva et al. 2020), and Nipah (Ringel et al. 2019). It is not yet known which interactions are responsible for LLPS in SARS-CoV-2, but it is highly likely that the intrinsically disordered domains are involved in this process. Moreover, N2 and N3 were shown to be essential for phase separation with RNA (Lu et al. 2020).
Here, we present backbone resonance assignment and NMR relaxation of two domains (N1 and N3) of N, providing data prerequisite for characterisation of the functional modes of this enigmatic protein, screening of interaction partners and detailed mapping of interactions.
Methods and experiments
Sample preparation
The primary sequence of N from SARS-CoV-2 was extracted from NCBI genome entry NC_045512.2 [GenBank entry MN908947.3 (Wu et al. 2014)]. Commercially synthesized genes (Genscript Biotech) were codon-optimized for expression in Escherichia coli and subcloned in a pET21b(+) vector. Hexa-histidine and TEV-cleavage tags were included at the N-terminus to facilitate protein purification. After protease cleavage, the proteins contain N-terminal GRR- extensions. For 15N and 13C isotope labelling, cells were grown in M9-minimal medium supplemented with 15N–NH4–Cl and 13C6–d–glucose (1 and 2 g/L respectively).
Both nucleoprotein constructs (N123, comprising residues 1–263 and N3, comprising residues 175–263) were cloned into a pESPRIT vector between the AatII and NotI cleavage sites with His8-tag and TEV cleavage sites at the N-terminus (GenScript Biotech Netherlands). Transformation was performed by heat-shock and proteins were expressed in E. coli BL21(DE3) (Novagen) for 5 h at 37°C after induction at an optical density of 0.6 with 1 mM isopropyl–b–d–thiogalactopyranoside. Cells were harvested by centrifugating at 5000 rpm, resuspended in 20 mM Tris (pH 8.0) and 500 mM NaCl buffer, lysed by sonication, and centrifugated again at 18,000 rpm at 4°C. The supernatant was subjected to standard Ni purification. Proteins were eluted with 20 mM Tris (pH 8), 500 mM NaCl and 500 mM imidazole. Samples were then dialysed against 20 mM Tris (pH 8), 500 mM NaCl, 2 mM DTT at room temperature overnight. Following TEV cleavage, samples were concentrated and subjected to size exclusion chromatography (SEC, Superdex 75/200) in 50 mM Na-Phosphate (pH 6.5), 250 mM NaCl 2 mM DTT buffer (NMR buffer). Proteins were studied at 600 and 150 μM for N3 and 0.91 mM for N123.
NMR experiments
BEST and BEST-TROSY (BT) double and triple resonance assignment experiments, including BEST-HNCA, BEST HN(CO)CA, BT-HNCO (Lescop et al. 2007; Solyom et al. 2013), were recorded on 15N,13C-labeled samples at 298 K using Bruker Avance III spectrometers equipped with a cryoprobe at 1H frequencies of 600 and 850 MHz. R1rho relaxation experiments (Lakomek et al. 2012) were recorded at 150 μM protein concentration and 298 K in a 50 mM Na-Phosphate (pH 6.5), 250 mM NaCl 2 mM DTT buffer at 950 MHz.
Residues 217–224 and 231–235 of N3 were assigned using a deuterated 13C 15N labelled sample at 300 µM. BEST-HNCA, BEST-HN(CO)CA with one increment in 15N dimension were recorded at 850 MHz, allowing detection of weak additional peaks of the helical region and connection with those already assigned. All spectra were processed using NMRPipe (Delaglio et al. 1995) and analyzed using CCPNMR Analysis Assign (Skinner et al. 2016) and NMRFAM-SPARKY (Lee et al. 2015). Assignment was further assisted by comparison to BMRB entry 34511 (Dinesh et al. 2020) of the N-terminal RNA-binding domain of the SARS-CoV-2 nucleoprotein, specifically nucleoprotein residues 44 to 180 (N2).
Assignments and data deposition
The HSQC of N3 is typical of an intrinsically disordered protein (Fig. 1). Most peaks in N3 are reproduced in the spectrum of N123, that also reveals the presence of the folded RNA-binding domain N2, in addition to disordered peaks from N1 (intensity of peaks from the first 15 residues of N3 were reduced in N123, probably due to the proximity with N2). The assignment of N2 has been published (Dinesh et al. 2020), and assignment of N1 in N123 (Fig. 2) and N3 in its isolated form and in the context of N123 were accomplished using standard triple resonance approaches. A high percentage of resonances could be assigned in N3 (84% 1HN, 84% 15NH, 82% 13Cα and 76% 13C) and N1 in the context of N123 (94% 1HN, 94% 15NH, 100% 13Cα, 57% 13Cβ and 100% 13C). These assignments have been deposited in the Biological Magnetic Resonance Databank (BMRB ID: 50557, comprising backbone resonance assignment of N3 and 50558, comprising backbone resonance assignment of N1 and of resolved resonances in N3, both in the context of N123).
Secondary structural propensity and dynamics
Resonance assignment of the two domains confirms the disordered nature of N1 and N3 (Figs. 1 and 2). The linker region (N3) connecting the two folded domains (N2 and N4), comprises intrinsically disordered SR-rich and polar termini, flanking a central hydrophobic strand that exhibits a pronounced helical propensity (> 30% from 216 to 224 with near 100% helical population from position 220) (Fig. 3). Assignment of 5 residues at the centre of this region was not possible, possibly due to dimerization mediated by the helix. This region is also predicted to form a hydrophobic helix in SARS-CoV (Chang et al. 2009). Recent simulations (Cubuk et al. 2020) also proposed the presence of a weakly (< 20%) helical motif in the SR region at the N-terminal region of N (176–185), which is not seen experimentally, although there is overlap of predicted helical propensity (< 30% helix predicted to stretch from 213 to 225) in the vicinity of experimentally observed helical propensity. Studies of isolated peptides (Savastano et al. 2020) also suggested a helical propensity in the SR region which is not seen here.
Assignment of the backbone resonances of the N-terminal domain of N (N1) within the N123 construct (Fig. 2) reveals the presence of an intrinsically disordered chain with no detectable secondary structural propensity and only very slight differences around residue 248 compared to free N3. Note that a number of resonances that were visible in N3 were not assigned in N123, most probably due to short relaxation times experienced in the vicinity of the helical motif that limits transfer of magnetization in triple resonance experiments. Nevertheless putative transfer of assignment was proposed on the basis of the 15N–1H correlation spectra of N3 and N123 (starred resonances in Fig. 2). Spin relaxation measured in N123 confirms the highly dynamic nature of N1 and N3 in the context of the folded RNA binding domain N2 (Fig. 3), with relaxation rates in the expected range for a disordered domain (Adamski et al. 2019), with the exception of the helical element in N3 and three consecutive residues in N1 (R14-I15-T16) that both show elevated rates.
In conclusion, NMR backbone assignment and preliminary relaxation studies provide the basis for further NMR studies of this important drug target, providing the tools necessary for the identification of inhibitors and for detailed functional studies of this essential protein.
References
Adamski W, Salvi N, Maurin D et al (2019) A unified description of intrinsically disordered protein dynamics under physiological conditions using NMR spectroscopy. J Am Chem Soc 141:17817–17829. https://doi.org/10.1021/jacs.9b09002
Carlson CR, Asfaha JB, Ghent CM et al (2020) Phosphorylation modulates liquid-liquid phase separation of the SARS-CoV-2 N protein. Bio Rxiv. https://doi.org/10.1101/2020.06.28.176248
Chang C-K, Hsu Y-L, Chang Y-H et al (2009) Multiple nucleic acid binding sites and intrinsic disorder of severe acute respiratory syndrome coronavirus nucleocapsid protein: implications for ribonucleocapsid protein packaging. J Virol 83:2255–2264. https://doi.org/10.1128/JVI.02001-08
Chang C, Hou M-H, Chang C-F et al (2014) The SARS coronavirus nucleocapsid protein—forms and functions. Antivir Res 103:39–50. https://doi.org/10.1016/j.antiviral.2013.12.009
Chen H, Cui Y, Han X et al (2020) Liquid–liquid phase separation by SARS-CoV-2 nucleocapsid protein and RNA. Cell Res 30:1143–1145. https://doi.org/10.1038/s41422-020-00408-2
Cubuk J, Alston JJ, Incicco JJ et al (2020) The SARS-CoV-2 nucleocapsid protein is dynamic, disordered, and phase separates with RNA. Bio Rxiv. https://doi.org/10.1101/2020.06.17.158121
Delaglio FD, Grzesiek S, Vuister G et al (1995) NMRPIPE–A multidimensional spectral processing system based on unix pipes. J biomol nmr 6:277–293
Dinesh DC, Chalupska D, Silhan J et al (2020) Structural basis of RNA recognition by the SARS-CoV-2 nucleocapsid phosphoprotein. Bio Rxiv. https://doi.org/10.1101/2020.04.02.022194
Guseva S, Milles S, Jensen MR et al (2020) Measles virus nucleo- and phosphoproteins form liquid-like phase-separated compartments that promote nucleocapsid assembly. Sci Adv. https://doi.org/10.1126/sciadv.aaz7095
Iserman C, Roden C, Boerneke M et al (2020) Specific viral RNA drives the SARS CoV-2 nucleocapsid to phase separate. Bio Rxiv. https://doi.org/10.1101/2020.06.11.147199
Jack A, Ferro LS, Trnka MJ et al (2020) SARS CoV-2 nucleocapsid protein forms condensates with viral genomic RNA. Bio Rxiv. https://doi.org/10.1101/2020.09.14.295824
Jensen MR, Zweckstetter M, Huang J, Blackledge M (2014) Exploring free-energy landscapes of intrinsically disordered proteins at atomic resolution using NMR spectroscopy. Chem Rev 114:6632–6660. https://doi.org/10.1021/cr400688u
Kang S, Yang M, Hong Z et al (2020) Crystal structure of SARS-CoV-2 nucleocapsid protein RNA binding domain reveals potential unique drug targeting sites. Acta Pharm Sin B. https://doi.org/10.1016/j.apsb.2020.04.009
Keane SC, Giedroc DP (2013) Solution structure of mouse Hepatitis Virus (MHV) nsp3a and determinants of the interaction with MHV nucleocapsid (N) protein. J Virol 87:3502–3515
Lakomek N-A, Ying J, Bax A (2012) Measurement of 15N relaxation rates in perdeuterated proteins by TROSY-based methods. J Biomol NMR 53:209–221. https://doi.org/10.1007/s10858-012-9626-5
Lee W, Tonelli M, Markley JL (2015) NMRFAM-SPARKY: enhanced software for biomolecular NMR spectroscopy. Bioinformatics 31:1325–1327. https://doi.org/10.1093/bioinformatics/btu830
Lescop E, Schanda P, Brutscher B (2007) A set of BEST triple-resonance experiments for time-optimized protein resonance assignment. J Magn Reson 187:163–169. https://doi.org/10.1016/j.jmr.2007.04.002
Lo Y-S, Lin S-Y, Wang S-M et al (2013) Oligomerization of the carboxyl terminal domain of the human coronavirus 229E nucleocapsid protein. FEBS Lett 587:120–127. https://doi.org/10.1016/j.febslet.2012.11.016
Lu S, Ye Q, Singh D et al (2020) The SARS-CoV-2 nucleocapsid phosphoprotein forms mutually exclusive condensates with RNA and the membrane-associated M protein. Bio Rxiv. https://doi.org/10.1101/2020.07.30.228023
Lutomski CA, El-Baba TJ, Bolla JR, Robinson CV (2020) Autoproteolytic products of the SARS-CoV-2 nucleocapsid protein are primed for antibody evasion and virus proliferation. Bio Rxiv. https://doi.org/10.1101/2020.10.06.328112
McBride R, van Zyl M, Fielding BC (2014) The coronavirus nucleocapsid is a multifunctional protein. Viruses 6:2991–3018. https://doi.org/10.3390/v6082991
Nikolic J, Le Bars R, Lama Z et al (2017) Negri bodies are viral factories with properties of liquid organelles. Nat Commun 8:58. https://doi.org/10.1038/s41467-017-00102-9
Peng T-Y, Lee K-R, Tarn W-Y (2008) Phosphorylation of the arginine/serine dipeptide-rich motif of the severe acute respiratory syndrome coronavirus nucleocapsid protein modulates its multimerization, translation inhibitory activity and cellular localization. FEBS J 275:4152–4163. https://doi.org/10.1111/j.1742-4658.2008.06564.x
Peng Y, Du N, Lei Y et al (2020) Structures of the SARS-CoV-2 nucleocapsid and their perspectives for drug design. EMBO J 39(20):e105938. https://doi.org/10.15252/embj.2020105938
Perdikari TM, Murthy AC, Ryan VH et al (2020) SARS-CoV-2 nucleocapsid protein undergoes liquid-liquid phase separation stimulated by RNA and partitions into phases of human ribonucleoproteins. Bio Rxiv. https://doi.org/10.1101/2020.06.09.141101
Ringel M, Heiner A, Behner L et al (2019) Nipah virus induces two inclusion body populations: Identification of novel inclusions at the plasma membrane. PLoS Pathog 15:e1007733. https://doi.org/10.1371/journal.ppat.1007733
Savastano A, de Opakua AI, Rankovic M, Zweckstetter M (2020) Nucleocapsid protein of SARS-CoV-2 phase separates into RNA-rich polymerase-containing condensates. Bio Rxiv. https://doi.org/10.1101/2020.06.18.160648
Skinner SP, Fogh RH, Boucher W et al (2016) CcpNmr analysis assign: a flexible platform for integrated NMR analysis. J Biomol NMR 66:111–124. https://doi.org/10.1007/s10858-016-0060-y
Solyom Z, Schwarten M, Geist L et al (2013) BEST-TROSY experiments for time-efficient sequential resonance assignment of large disordered proteins. J Biomol NMR 55:311–321. https://doi.org/10.1007/s10858-013-9715-0
Wu C-H, Chen P-J, Yeh S-H (2014) Nucleocapsid phosphorylation and RNA helicase DDX1 recruitment enables coronavirus transition from discontinuous to continuous transcription. Cell Host Microbe 16:462–472. https://doi.org/10.1016/j.chom.2014.09.009
Ye Q, West AMV, Silletti S, Corbett KD (2020) Architecture and self-assembly of the SARS-CoV-2 nucleocapsid protein. Bio Rxiv. https://doi.org/10.1101/2020.05.17.100685
Zeng W, Liu G, Ma H et al (2020) Biochemical characterization of SARS-CoV-2 nucleocapsid protein. Biochem Biophys Res Commun 527:618–623. https://doi.org/10.1016/j.bbrc.2020.04.136
Zhou Y, Su JM, Samuel CE, Ma D (2019) Measles virus forms inclusion bodies with properties of liquid organelles. J Virol. https://doi.org/10.1128/JVI.00948-19
Zúñiga S, Sola I, Moreno JL et al (2007) Coronavirus nucleocapsid protein is an RNA chaperone. Virology 357:215–227. https://doi.org/10.1016/j.virol.2006.07.046
Acknowledgements
This work used the platforms of the Grenoble Instruct-ERIC center (ISBG; UMS 3518 CNRS-CEA-UGA-EMBL) within the Grenoble Partnership for Structural Biology (PSB), supported by FRISBI (ANR-10-INBS-05-02) and GRAL, financed within the University Grenoble Alpes graduate school (Ecoles Universitaires de Recherche) CBH-EUR-GS (ANR-17-EURE-0003). IBS acknowledges integration into the Interdisciplinary Research Institute of Grenoble (IRIG CEA). IBS acknowledges integration into the Interdisciplinary Research Institute of Grenoble (IRIG CEA). The authors acknowledge discussions and encouragement from the COVID-19 NMR consortium under the coordination of Prof. Harald Schwalbe and colleagues. Financial support from the TGIR-RMN-THC Fr3050 CNRS for conducting the research is gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Guseva, S., Perez, L.M., Camacho-Zarco, A. et al. 1H, 13C and 15N Backbone chemical shift assignments of the n-terminal and central intrinsically disordered domains of SARS-CoV-2 nucleoprotein. Biomol NMR Assign 15, 255–260 (2021). https://doi.org/10.1007/s12104-021-10014-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12104-021-10014-x