Abstract
Sparse isotopic labeling of proteins for NMR studies using single types of amino acid (15N or 13C enriched) has several advantages. Resolution is enhanced by reducing numbers of resonances for large proteins, and isotopic labeling becomes economically feasible for glycoproteins that must be expressed in mammalian cells. However, without access to the traditional triple resonance strategies that require uniform isotopic labeling, NMR assignment of crosspeaks in heteronuclear single quantum coherence (HSQC) spectra is challenging. We present an alternative strategy which combines readily accessible NMR data with known protein domain structures. Based on the structures, chemical shifts are predicted, NOE cross-peak lists are generated, and residual dipolar couplings (RDCs) are calculated for each labeled site. Simulated data are then compared to measured values for a trial set of assignments and scored. A genetic algorithm uses the scores to search for an optimal pairing of HSQC crosspeaks with labeled sites. While none of the individual data types can give a definitive assignment for a particular site, their combination can in most cases. Four test proteins previously assigned using triple resonance methods and a sparsely labeled glycosylated protein, Robo1, previously assigned by manual analysis, are used to validate the method and develop a criterion for identifying sites assigned with high confidence.
Similar content being viewed by others
References
Aleti A, Moser I (2016) A systematic literature review of adaptive parameter control methods for evolutionary algorithms. ACM Comput Surv 49:35. doi:10.1145/2996355
Barb AW, Meng L, Gao ZW, Johnson RW, Moremen KW, Prestegard JH (2012) NMR characterization of immunoglobulin G Fc glycan motion on enzymatic sialylation. Biochemistry 51:4618–4626. doi:10.1021/bi300319q
Bordoli L, Kiefer F, Arnold K, Benkert P, Battey J, Schwede T (2009) Protein structure homology modeling using SWISS-MODEL workspace. Nat Protoc 4:1–13. doi:10.1038/nprot.2008.197
Buchner L, Guntert P (2015) Systematic evaluation of combined automated NOE assignment and structure calculation with CYANA. J Biomol NMR 62:81–95. doi:10.1007/s10858-015-9921-z
Chao FA, Kim JG, Xia YL, Milligan M, Rowe N, Veglia G (2014) FLAMEnGO 2.0: an enhanced fuzzy logic algorithm for structure-based assignment of methyl group resonances. J Magn Reson 245:17–23. doi:10.1016/j.jmr.2014.04.012
Chen K, Tjandra N (2012) The use of residual dipolar coupling in studying proteins by NMR. Top Curr Chem 326:47–67. doi:10.1007/128_2011_215
Dashti H, Tonelli M, Lee W, Westler WM, Cornilescu G, Ulrich EL, Markley JL (2016) Probabilistic validation of protein NMR chemical shift assignments. J Biomol NMR 64:17–25. doi:10.1007/s10858-015-0007-8
Everett JK et al (2016) A community resource of experimental data for NMR/X-ray crystal structure pairs. Protein Sci 25:30–45. doi:10.1002/pro.2774
Feng L, Lee HS, Prestegard JH (2007) NMR resonance assignments for sparsely 15 N labeled proteins. J Biomol NMR 38:213–219. doi:10.1007/s10858-007-9159-5
Gao Q et al (2016) Structural aspects of heparan sulfate binding to Robo1-Ig1-2. ACS Chem Biol. doi:10.1021/acschembio.6b00692
Goto NK, Kay LE (2000) New developments in isotope labeling strategies for protein solution NMR spectroscopy. Curr Opin Struc Biol 10:585–592
Grishaev A, Steren CA, Wu B, Pineda-Lucena A, Arrowsmith C, Llinas M (2005) ABACUS, a direct method for protein NMR structure computation via assembly of fragments. Proteins 61:36–43. doi:10.1002/prot.20457
Han B, Liu YF, Ginzinger SW, Wishart DS (2011) SHIFTX2: significantly improved protein chemical shift prediction. J Biomol NMR 50:43–57. doi:10.1007/s10858-011-9478-4
Huang YJ, Mao B, Xu F, Montelione GT (2015) Guiding automated NMR structure determination using a global optimization metric, the NMR DP score. J Biomol NMR 62:439–451. doi:10.1007/s10858-015-9955-2
Ikura M, Kay LE, Tschudin R, Bax A (1990) Three-dimensional NOESY-HMQC spectroscopy of a 13 C-labeled protein. J Magn Reson 86:204–209
John M, Schmitz C, Park AY, Dixon NE, Huber T, Otting G (2007) Sequence-specific and stereospecific assignment of methyl groups using paramagnetic lanthanides. J Am Chem Soc 129:13749–13757. doi:10.1021/ja0744753
Kainosho M, Torizawa T, Iwashita Y, Terauchi T, Ono AM, Guntert P (2006) Optimal isotope labelling for NMR protein structure determinations. Nature 440:52–57. doi:10.1038/nature04525
Kato K, Yamaguchi Y, Arata Y (2010) Stable-isotope-assisted NMR approaches to glycoproteins using immunoglobulin G as a model system. Prog Nucl Magn Res spectrosc 56:346–359. doi:10.1016/j.pnmrs.2010.03.001
Kay LE, Gardner KH (1997) Solution NMR spectroscopy beyond 25 kDa. Curr Opin Struct Biol 7:722–731. doi:10.1016/s0959-440x(97)80084-x
Lange OF et al (2012) Determination of solution structures of proteins up to 40 kDa using CS-Rosetta with sparse NMR data from deuterated samples. Proc Natl Acad Sci U S A 109:10873–10878. doi:10.1073/pnas.1203013109
Li DW, Bruschweiler R (2015) PPM_One: a static protein structure based chemical shift predictor. J Biomol NMR 62:403–409. doi:10.1007/s10858-015-9958-z
Lin HN, Wu KP, Chang JM, Sung TY, Hsu WL (2005) GANA—a genetic algorithm for NMR backbone resonance assignment. Nucleic Acids Res 33:4593–4601. doi:10.1093/nar/gki768
Linge JP, Habeck M, Rieping W, Nilges M (2003) ARIA: automated NOE assignment and NMR structure calculation. Bioinformatics 19:315–316. doi:10.1093/bioinformatics/19.2.315
Lipsitz RS, Tjandra N (2004) Residual dipolar couplings in NMR structure analysis. Annu Rev Biophys Biomed 33:387–413. doi:10.1146/annurev.biophys.33.110502.140306
Lohr F, Tumulka F, Bock C, Abele R, Dotsch V (2015) An extended combinatorial N-15, C-13(alpha), and C-13′ labeling approach to protein backbone resonance assignment. J Biomol NMR 62:263–279. doi:10.1007/s10858-015-9941-8
Maslennikov I, Choe S (2013) Advances in NMR structures of integral membrane proteins. Curr Opin Struc Biol 23:555–562. doi:10.1016/j.sbi.2013.05.002
McAllister RG, Konermann L (2015) Challenges in the interpretation of protein H/D exchange data: a molecular dynamics simulation perspective. Biochemistry 54:2683–2692. doi:10.1021/acs.biochem.5b00215
Mishra SH, Frueh DP (2015) Assignment of methyl NMR resonances of a 52 kDa protein with residue-specific 4D correlation maps. J Biomol NMR 62:281–290. doi:10.1007/s10858-015-9943-6
Nietlispach D et al (1996) An approach to the structure determination of larger proteins using triple resonance NMR experiments in conjunction with random fractional deuteration. J Am Chem Soc 118:407–415. doi:10.1021/ja952207b
Nkari WK, Prestegard JH (2009) NMR resonance assignments of sparsely labeled proteins: amide proton exchange correlations in native and denatured states. J Am Chem Soc 131:5344–5349. doi:10.1021/ja8100775
Orts J, Walti MA, Marsh M, Vera L, Gossert AD, Guntert P, Riek R (2016) NMR-based determination of the 3D structure of the ligand-protein interaction site without protein resonance assignment. J Am Chem Soc 138:4393–4400. doi:10.1021/jacs.5b12391
Piai A et al. (2016) Amino acid recognition for automatic resonance assignment of intrinsically disordered proteins. J Biomol NMR 64:239–253. doi:10.1007/s10858-016-0024-2
Prestegard JH, Agard DA, Moremen KW, Lavery LA, Morris LC, Pederson K (2014) Sparse labeling of proteins: structural characterization from long range constraints. J Magn Reson 241:32–40. doi:10.1016/j.jmr.2013.12.012
Sahakyan AB, Vranken WF, Cavalli A, Vendruscolo M (2011) Structure-based prediction of methyl chemical shifts in proteins. J Biomol NMR 50:331–346. doi:10.1007/s10858-011-9524-2
Schmitt LM (2001) Theory of genetic algorithms. Theor Comput Sci 259:1–61
Shen Y, Bax A (2010) SPARTA plus: a modest improvement in empirical NMR chemical shift prediction by means of an artificial neural network. J Biomol NMR 48:13–22. doi:10.1007/s10858-010-9433-9
Skinner SP, Moshev M, Hass MAS, Ubbink M (2013) PARAssign-paramagnetic NMR assignments of protein nuclei on the basis of pseudocontact shifts. J Biomol NMR 55:379–389. doi:10.1007/s10858-013-9722-1
Sumathi S, Surekha P (2010) MATLAB-based genetic algorithm computational intelligence paradigms: theory and applications using Matlab, CRC Press, Boca Rotan. doi:10.1201/9781439809037$4
Tsoulos IG, Tzallas A, Tsalikakis D (2016) PDoublePop: An implementation of parallel genetic algorithm for function optimization. Comput Phys Commun 209:183–189. doi:10.1016/j.cpc.2016.09.006
Tugarinov V, Kanelis V, Kay LE (2006) Isotope labeling strategies for the study of high-molecular-weight proteins by solution NMR spectroscopy. Nat Protoc 1:749–754. doi:10.1038/nprot.2006.101
Tzakos AG, Grace CRR, Lukavsky PJ, Riek R (2006) NMR techniques for very large proteins and RNAs in solution Annu Rev Bioph Biom 35:319–342. doi:10.1146/annurev.biophys.35.040405.102034
Valafar H, Prestegard JH (2004) REDCAT: a residual dipolar coupling analysis tool. J Magn Reson 167:228–241. doi:10.1016/j.jmr.2003.12.012
Webb B, Sali A (2014) Comparative protein structure modeling using Modeller. Curr Protoc Bioinform. doi:10.1002/0471250953.bi0506s47
Williamson MP (2014) Using chemical shift perturbation to characterise ligand binding. Prog Nucl Mag Res Spectrosc. doi:10.1016/j.pnmrs.2014.05.001
Xiao Y, Warner LR, Latham MP, Ahn NG, Pardi A (2015) Structure-based assignment of Ile, Leu, and Val methyl groups in the active and inactive forms of the mitogen-activated protein kinase extracellular signal-regulated kinase. Biochemistry 54:4307–4319. doi:10.1021/acs.biochem.5b00506
Yang Y, Fritzsching KJ, Hong M (2013) Resonance assignment of the NMR spectra of disordered proteins using a multi-objective non-dominated sorting genetic algorithm. J Biomol NMR 57:281–296. doi:10.1007/s10858-013-9788-9
Zweckstetter M (2008) NMR: prediction of molecular alignment from structure using the PALES software. Nat Protoc 3:679–690. doi:10.1038/nprot.2008.36
Chipperfield A, Fleming P (1995) The MATLAB genetic algorithm toolbox. In: Applied control techniques using MATLAB, IEE Colloquium on, 1995. IET, pp 10/11–10/14
Acknowledgements
We would like to thank Drs. Kari Pederson and Alex Eletsky for their constructive comments during the course of this work. We gratefully acknowledge financial support from the National Institute of General Medical Sciences (U54GM094597, P41GM103390 and R01GM033225). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.
Author information
Authors and Affiliations
Corresponding author
Additional information
Qi Gao and Gordon R. Chalmers have contributed equally to the work.
Electronic supplementary material
Below is the link to the electronic supplementary material.
10858_2017_101_MOESM1_ESM.pdf
Detailed description of the genetic algorithm implementation and additional histograms showing assignment frequencies. (PDF 1122 KB)
Rights and permissions
About this article
Cite this article
Gao, Q., Chalmers, G.R., Moremen, K.W. et al. NMR assignments of sparsely labeled proteins using a genetic algorithm. J Biomol NMR 67, 283–294 (2017). https://doi.org/10.1007/s10858-017-0101-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10858-017-0101-1